Softvery Solutions

Senior ML / LLM Consultant (Audit of AI Recommender ) Offline

 

We already have an ML Engineer on the project (Python, classical ML, RAG, LangChain, vector DBs, AWS.) and an MVP AI solution in place.
 

We’re looking for a short-term consultant to review, stress-test, and improve this solution.Objective of the Engagement
 

Your main goal will be to audit the current AI/ML implementation (support chatbot + recommendation engine) and provide clear, actionable recommendations to make it:
 

  • More accurate and trustworthy for users
  • Provide solutions for the speed optimization
  • Technically robust and maintainable for the team
  • Cost-efficient and scalable as usage grows

     

Key Responsibilities
1. Architecture & Code Review
 

  • Review current AI/ML architecture: data flows, pipeline, model choices, and integrations with mobile backend and admin panel.
  • Review codebase (Python, ML libraries, LangChain/RAG pipelines, vector DB usage, prompt design, evaluation scripts).
  • Assess MLOps practices: environments, versioning, deployment, monitoring, rollback strategy.

     

2. AI Support Chat & RAG Analysis
 

  • Evaluate the virtual assistant:
    • Retrieval quality (documents, embeddings, chunking strategy).
    • Prompting strategy and guardrails (hallucination risk, tone, safety).
    • Latency, cost, and failure handling for LLM calls.
  • Propose improvements to:
    • Retrieval and ranking
    • Prompt templates / system messages
    • Evaluation methods (e.g., answer quality, hallucination rate).

       

3. Cost, Performance & Scalability
 

  • Review current infrastructure and model choices for response time and cost (LLM provider, embeddings, vector DB).
  • Suggest optimizations: caching strategy, batch calls, model selection (cheaper/faster models for certain flows), index design, etc.

     

4. Collaboration & Knowledge Transfer
 

  • Work closely with the existing ML Engineer to:
    • Validate proposed changes
    • Prioritize a realistic improvement roadmap
    • Share best practices in LLM/RAG & recommender systems.
  • Deliver clear documentation and a short handover so the team can continue without you.

     

Required Profile
Must-have
 

  • Solid experience as a ML Engineer / ML Architect / LLM Engineer working on production systems.
  • Strong hands-on skills with Python and common ML / data stack (Pandas, scikit-learn, PyTorch or similar; SQL).
  • Proven experience designing and shipping LLM-based applications, specifically:
    • RAG pipelines (vector DBs, embeddings, retrieval, ranking).
    • Prompt engineering and evaluation.
    • Integrating LLMs into real products (APIs, mobile/web backends).
  • Experience with recommendation systems (content-based, rules/hybrid, ranking) in real-world products.
  • Good understanding of MLOps: versioning, CI/CD for models, monitoring, logging, alerting.
  • Ability to perform an independent audit: challenge existing choices respectfully, explain trade-offs to non-ML stakeholders.
  • Comfortable writing clear technical documentation and presenting findings.

     

Nice-to-have
 

  • Background in consumer apps, travel, or edtech (recommendation, search, personalization).
  • Awareness of data privacy and compliance (LGPD, GDPR or similar) in ML/AI solutions.
  • Type: Freelance / consulting

Required languages

English B1 - Intermediate

The job ad is no longer active

Look at the current jobs ML / AI →

Loading...