Strong Data Scientist (NLP / GenAI)
π Who we are
Adaptiq is a technology hub helping fast-growing product companies build and scale high-performing R&D teams. We partner with innovative startups and established tech businesses to deliver cutting-edge solutions across industries.
π§ About the Product
Our platform is a cloud-based AI-driven workspace that automates statistical analysis validation and generation for clinical research. It serves large pharmaceutical and biotech clients by extracting, validating, and producing complex tabular outputs and regulatory deliverables.
The system handles high volumes of hierarchical tables, figures and listings, applying both classical and generative NLP to accelerate review cycles, reduce manual double-programming and maintain a full audit trail.
π― Your Role
We are looking for a Strong Data Scientist with a focus on NLP and Generative AI to drive the development of intelligent systems that automate complex analytical workflows.
This is a research-driven role, where you will take ownership of problems end-to-end β from understanding the data, to experimenting with approaches and delivering solutions that can be integrated into production.
π§ What youβll do
- Define and drive the AI research roadmap, mentoring peers on practical implementation.
- Design, develop and evaluate NLP and tabular-data algorithms using GenAI, retrieval-augmented generation (RAG), deep learning, classical ML, NER and rule-based methods.
- Explore large clinical datasets, perform data cleaning and feature engineering for downstream model training.
- Build and maintain data pipelines for extraction, transformation and preprocessing of structured and semi-structured inputs.
- Stay current on state-of-the-art techniques in NLP, generative AI and tabular-data analysis, and integrate best practices.
- Collaborate with cross-functional teams, including software developers, DevOps engineers to integrate the research solutions into production.
β What weβre looking for
- 3β5 years of industry experience with NLP as a core focus
- Experience working with structured or tabular data combined with NLP
- 2+ years of hands-on experience with deep learning methods and frameworks (e.g. PyTorch, TensorFlow).
- Strong hands-on experience with Generative AI / LLM-based systems (e.g. RAG, structured output generation, text-to-SQL)
- Solid understanding of classical NLP techniques (NER, parsing, rule-based methods)
- Proven ability to build and experiment with different approaches, not just implement predefined solutions
- Experience delivering AI solutions in real-world / production environments
- Strong Python skills for data analysis, experimentation, and prototyping
- MSc or PhD in CS, ML, Data Science, or related field
- Strong English communication skills
βοΈ Nice to have
- Familiarity with cloud-based NLP platforms and MLOps tooling.
- Experience with large-scale table analytics or regulatory statistical outputs.
π What we offer
- 20 working days of paid vacation + public holidays
- Full accounting & legal support
- Fully remote setup + co-working option
- High-performance equipment
- Competitive compensation with regular performance reviews
π‘ Why this role is interesting
- Work at the intersection of NLP, GenAI, and real-world healthcare impact
- Solve non-trivial data problems (tables + text + regulations)
- Influence a live product used by global pharma companies
- Strong focus on research + practical implementation
Required skills experience
| Data Science | 3 years |
| NLP | 3 years |
Required languages
| English | B2 - Upper Intermediate |