Middle AI/ML Backend Engineer (Python, Computer Vision, NLP/LLM)

We are looking for a Middle AI/ML Backend Engineer to join a long-term project focused on large-scale video analysis and intelligent metadata extraction. The ideal candidate will have strong Python skills and hands-on experience in Computer Vision and NLP/LLM model integration.

 

Requirements:

* 4+ years of professional experience with Python
* Hands-on experience in Computer Vision (object detection, OCR, facial recognition)
* Practical experience integrating and optimizing  NLP/LLM models


Will be a plus:

* Experience with video frame analysis, image embeddings, and multimodal AI (vision + text)
* Familiarity with Whisper, OCR pipelines, and multilingual transcription
* Understanding of DevOps practices, high-load system design, and performance optimization
* Knowledge of MongoDB, REST APIs, and microservice architectures
* Experience with AWS, GCP, or on-prem GPU environments

 

Responsibilities:

* Design and implement backend AI pipelines in Python
* Work on computer vision tasks: object detection, OCR, facial recognition
* Integrate and optimize NLP/LLM models for transcription and content understanding
* Develop scalable backend services for large-scale video processing
* Collaborate with cross-functional teams to deliver end-to-end AI-powered features

 

Tech Stack:
Python, MongoDB, REST APIs
Computer Vision, OCR, LLMs, Whisper, NLP
Microservices, cloud & on-prem GPU infrastructure

 

Project:
A cutting-edge platform that processes and analyzes massive volumes of video data by extracting frames and applying advanced AI techniques (computer vision, OCR, facial recognition, LLMs) to derive metadata. This metadata powers automation and decision-making processes, combining backend rule engines with AI-driven pipelines for both cloud and on-prem deployments.

Published 15 August
83 views
ยท
8 applications
88% read
ยท
75% responded
Last responded yesterday
To apply for this and other jobs on Djinni login or signup.
Loading...