Middle AI/ML Backend Engineer (Python, Computer Vision, NLP/LLM)
We are looking for a Middle AI/ML Backend Engineer to join a long-term project focused on large-scale video analysis and intelligent metadata extraction. The ideal candidate will have strong Python skills and hands-on experience in Computer Vision and NLP/LLM model integration.
Requirements:
* 4+ years of professional experience with Python
* Hands-on experience in Computer Vision (object detection, OCR, facial recognition)
* Practical experience integrating and optimizing NLP/LLM models
Will be a plus:
* Experience with video frame analysis, image embeddings, and multimodal AI (vision + text)
* Familiarity with Whisper, OCR pipelines, and multilingual transcription
* Understanding of DevOps practices, high-load system design, and performance optimization
* Knowledge of MongoDB, REST APIs, and microservice architectures
* Experience with AWS, GCP, or on-prem GPU environments
Responsibilities:
* Design and implement backend AI pipelines in Python
* Work on computer vision tasks: object detection, OCR, facial recognition
* Integrate and optimize NLP/LLM models for transcription and content understanding
* Develop scalable backend services for large-scale video processing
* Collaborate with cross-functional teams to deliver end-to-end AI-powered features
Tech Stack:
Python, MongoDB, REST APIs
Computer Vision, OCR, LLMs, Whisper, NLP
Microservices, cloud & on-prem GPU infrastructure
Project:
A cutting-edge platform that processes and analyzes massive volumes of video data by extracting frames and applying advanced AI techniques (computer vision, OCR, facial recognition, LLMs) to derive metadata. This metadata powers automation and decision-making processes, combining backend rule engines with AI-driven pipelines for both cloud and on-prem deployments.