MLOps Engineer

Hi!

Thank you for taking some time to look at our requisition. We are a US-based company working on an AI product in the entertainment space. Our app is geared towards children and we are working with major film companies.

We have a strong, distributed team, mostly in Europe. We're looking for an experienced person to help use with MLOps.

 

Core MLOps Responsibilities:

  • Model Deployment: Convert ComfyUI workflows to production Python pipelines
  • Infrastructure Management: Multi-provider GPU orchestration (RunPod + future providers)
  • CI/CD for ML: Automated model deployment and rollback systems
  • Monitoring & Observability: Pipeline performance, model drift, and system health
  • Scalability: Serverless GPU management and load balancing
  • Model Lifecycle: Version control, and hot-swapping of LoRAs

 

AI/ML Pipeline (Critical):

  • Deep experience with Diffusion models (Stable Diffusion, Flux)
  • Hands-on ComfyUI to Python conversion experience
  • Computer vision libraries: OpenCV, PIL, torchvision
  • Model inference optimization (batching, memory management)
  • Experience with diffuser library
  • Experience with ControlNets, LoRA, and inpainting workflows
  • Experience with GroundingDINO, SAM

 

Backend Development:

  • FastAPI/Python (mid/senior level)
  • Async programming and queue management
  • PostgreSQL/AlloyDb
  • RESTful API design with proper error handling

 

DevOps/Infrastructure:

  • Docker containerization
  • Google Cloud Platform (GCS, Cloud Run, CloudBuild)
  • Git Actions
  • CI/CD pipeline setup
  • GPU Providers Platform (RunPod nice to have)

 

GPU/Serverless:

  • RunPod API integration (preferred) or other GPU providers
  • GPU memory optimization
  • Cold start minimization strategies
  • Multi-provider orchestration patterns

 

Monitoring & Observability:

  • Custom metrics for ML pipelines
  • Performance monitoring and alerting
  • Integration with data warehouse systems

 

Nice-to-Have:

  • Previous work with content generation platforms
  • Experience with model serving frameworks (TorchServe, TensorRT)
  • Experience with training/fine-tuning image generation models (e.g, Stable Diffusion, Flux with LoRA)

 

 

 

 

Required skills experience

Python 5 years
FastAPI 2.5 years
REST API 5 years
Docker 5 years
CI/CD Pipelines 3 years

Required languages

English B1 - Intermediate
Python, Comfy UI, FastAPI, REST API, Docker, CI/CD pipelines, Model Deployment, PostgreSQL, GCP, Hugging Face
Published 16 July 2025 ยท Updated 27 January
Statistics:
28 views
ยท
5 applications
20% read
To apply for this and other jobs on Djinni login or signup.
Loading...