Senior AI/ML Engineer – GenAI
About PredictSpring
PredictSpring is a market-leading company shaping the future of omni-channel retail and modern POS technology. We help global retail and lifestyle brands deliver seamless, modern, and data-driven customer experiences across digital and in-store channels.
We are looking for a highly skilled Senior AI/ML Engineer – GenAI to design, develop, and deploy enterprise-grade AI solutions with a strong focus on Generative AI, Large Language Models, Retrieval-Augmented Generation, and Agentic AI systems.
In this role, you will build production AI applications, multi-agent workflows, scalable ML pipelines, and cloud-native AI solutions that create business value across our platform and operations.
Responsibilities
As a Senior AI/ML Engineer – GenAI, you will:
- Design, develop, and deploy enterprise Generative AI applications using foundation models such as GPT, Claude, Gemini, Llama, Mistral, and other open-source LLMs.
- Build advanced RAG solutions using vector databases, embeddings, reranking techniques, and hybrid search architectures.
- Develop prompt engineering strategies, evaluation frameworks, guardrails, and hallucination mitigation techniques.
- Fine-tune and optimize LLMs using techniques such as LoRA, QLoRA, PEFT, and reinforcement learning approaches.
- Integrate LLMs with enterprise applications, APIs, databases, and business workflows.
- Design and implement autonomous and semi-autonomous AI agents capable of reasoning, planning, memory management, tool usage, and workflow orchestration.
- Build multi-agent systems using frameworks such as LangGraph, LangChain, AutoGen, CrewAI, Semantic Kernel, MCP, A2A, or similar technologies.
- Build and maintain end-to-end machine learning pipelines, including data ingestion, feature engineering, model training, evaluation, deployment, and monitoring.
- Develop predictive models, recommendation systems, anomaly detection solutions, NLP applications, and deep learning models.
- Design CI/CD pipelines for machine learning and AI applications.
- Deploy and manage AI workloads using Docker, Kubernetes, and cloud-native services.
- Monitor model drift, performance degradation, latency, and data quality.
- Architect cloud-native AI solutions on AWS, Azure, or Google Cloud Platform.
- Partner with product managers, software engineers, data engineers, and business stakeholders to translate business requirements into AI solutions.
- Mentor junior engineers and contribute to architecture reviews, technical strategy, and AI best practices.
Requirements
We are looking for someone with:
- Upper C1 or higher English proficiency, with fluent spoken and written English.
- Ability to confidently participate in client-facing discussions, technical workshops, architecture reviews, stakeholder presentations, and cross-functional collaboration.
- Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Data Science, Machine Learning, Engineering, or a related field.
- 7+ years of software engineering, machine learning, or AI experience.
- Minimum 3+ years of hands-on experience developing and deploying Generative AI applications in production environments.
- Strong Python programming skills with experience building scalable backend systems.
- Extensive experience with machine learning frameworks including PyTorch, TensorFlow, scikit-learn, and Hugging Face.
- Proven experience developing RAG architectures and knowledge retrieval systems.
- Experience building Agentic AI solutions and multi-agent workflows.
- Strong understanding of vector databases, embeddings, semantic search, and retrieval optimization.
- Experience with cloud platforms including AWS, Azure, or GCP.
- Strong understanding of MLOps principles, CI/CD, containerization, and model lifecycle management.
Preferred Qualifications
Nice to have:
- Experience with managed AI services such as AWS SageMaker, AWS Bedrock, Azure OpenAI, Azure ML, or Vertex AI.
- Experience with distributed computing platforms such as Databricks, Spark, or Ray.
- Experience with AI governance, responsible AI practices, model monitoring, and production AI observability.
- Experience with secure AI architectures, role-based permissions, auditability, and compliance controls.
- Experience in retail, e-commerce, omni-channel commerce, or POS technology is a strong plus.
Why PredictSpring?
- Join a market-leading company shaping the future of omni-channel retail and modern POS.
- Work on high-impact challenges at the intersection of commerce, data, AI, and customer experience.
- Collaborate with a talented global team in a flexible remote work environment.
- Enjoy opportunities for growth, learning, and professional development.
- Contribute to products used by some of the world’s most recognized retail and lifestyle brands.
- Be part of an innovative, collaborative, and data-driven culture focused on delivering measurable business value.
Required languages
| English | C1 - Advanced |