Senior ML Engineer

Location: Remote

Working timezone: Toronto / North America overlap (afternoon → late evening EU time);
Engagement: Full-time, long-term

We’re looking for a versatile and hands-on Machine Learning Engineer to join our client's team and take ownership of ML systems from early research through to real-world deployment. In this role, you’ll work across the full lifecycle - shaping datasets, exploring new approaches, building models, and running them reliably in production. T

his is a good fit if you enjoy fast iteration, practical experimentation, and seeing your work directly influence product outcomes.

Responsibilities:

ML solution design: Explore, evaluate, and implement machine learning approaches to solve complex product and business challenges using modern model architectures.
Experimentation workflows: Set up and maintain pipelines that support fast prototyping, testing ideas, and validating assumptions.
Structured experimentation: Plan, run, and document experiments in a reproducible way, including tuning hyperparameters, comparing architectures, and testing data variants using tools like MLflow or Weights & Biases.
Production rollout: Bring models into production using established CI/CD workflows and model serving infrastructure.
Model observability: Build monitoring and alerting around deployed models to track accuracy, drift, latency, and overall system health.
Inference optimization: Improve runtime performance and cost efficiency through techniques such as quantization, pruning, and distillation.
Training data ownership: Create, refine, and maintain high-quality datasets, including data augmentation and curation strategies.
Modern ML methods: Keep up with current advances in the field and apply state-of-the-art approaches, particularly in the area of large language models.

Requirements:

3+ years of hands-on experience building and deploying machine learning models in production systems.
Strong Python skills and experience writing production-grade ML code.
Solid experience with deep learning frameworks, especially PyTorch.
Practical background in hyperparameter tuning and optimization using tools such as Optuna or Ray Tune.
A disciplined approach to experimentation, with strong habits around reproducibility and tracking.
Hands-on work with LLMs, including fine-tuning, prompt design, retrieval-augmented generation, and efficient inference.
Experience applying model optimization techniques (e.g. quantization, pruning).
Proven ability to design and build datasets rather than only consuming existing ones.
Bachelor’s or Master’s degree in Computer Science, AI, Data Science, or a closely related field.

Nice to have:

Experience with advanced architectures such as Transformers or Mixture-of-Experts models.
Contributions to open-source projects or personal ML projects that demonstrate curiosity and depth.
Practical understanding of end-to-end MLOps, including tools like Docker, Kubernetes, MLflow, Kubeflow, or Prometheus.

Required skills experience

Python	5 years
PyTorch	3 years
RAG	2 years

Required languages

English

B2 - Upper Intermediate

Optuna, Ray Tune, quantization, bitsandbytes, pruning, Transformers, Mixtures of Experts, Docker, Kubernetes, MLflow

Published 16 January

35 views

13 applications

50% read

To apply for this and other jobs on Djinni login or signup.

Only from 5 years of experience
Full Remote
Worldwide
Countries where we consider candidates
- English B2 - Upper Intermediate

ML / AI

Python	5 years
PyTorch	3 years
RAG	2 years

Employment: Fulltime
Domain: Healthcare / MedTech
Outstaff
Test task is needed

Apply for the job

50% read

0% responded

📊 Average salary range of similar jobs in analytics →