AI Systems Engineer (LLM / Agents / RAG)
HT Anywhere
Responds Quickly
$$$$
Product
AI Systems Engineer (LLM / Agents / RAG)
Remote | Full-time | Flexible Schedule
Our client is looking for a strong AI Systems Engineer to help design and build production-grade AI infrastructure, agent-based systems, and workflow automation.
You’ll work directly with the Head of AI on building the company’s internal AI ecosystem: multi-agent systems, RAG pipelines, AI-powered analytics, automation workflows, integrations, and operational AI tooling.
This role is ideal for someone who enjoys building real AI systems end-to-end — not just experimenting with prompts, but designing scalable architectures and shipping production-ready solutions.
Responsibilities
- Design and develop production AI/LLM systems
- Build agent-based workflows and multi-agent orchestration systems with memory, routing, and tool usage
- Develop end-to-end RAG pipelines and knowledge base systems:
- ingestion
- chunking
- embeddings
- retrieval
- vector databases
- citation tracing
- hallucination mitigation
- Build AI-powered workflow automation for marketing, analytics, operations, and reporting
- Integrate LLM systems with CRMs, APIs, external platforms, and internal tools
- Develop integrations using REST APIs, webhooks, and third-party services
- Participate in building unified analytics and attribution systems across multiple channels and platforms
- Support creative/content AI pipelines (image/video generation workflows)
- Improve reliability, observability, latency, and cost-efficiency of AI systems
- Deploy and maintain AI systems in production environments
- Work closely with the Head of AI on architecture and AI ecosystem development
Requirements
- 2+ years of hands-on experience building production AI/LLM systems (not educational or pet projects)
- Strong Python skills
- Experience building production backend systems and integrations
- Practical experience with:
- agent orchestration frameworks (LangGraph-class frameworks)
- tool/function calling
- structured outputs
- guardrails
- multi-agent systems
- Strong understanding of RAG systems and retrieval pipelines
- Experience with vector databases:
- Pinecone
- Qdrant
- pgvector
- Experience with PostgreSQL and data-layer architecture
- Understanding of:
- embeddings
- chunking strategies
- retrieval quality
- citation tracing
- hallucination mitigation
- Experience with workflow automation tools:
- n8n
- Make
- Zapier
- or similar
- Basic LLMOps understanding:
- evals
- observability (Langfuse-class tooling)
- latency optimization
- cost optimization
- fallback model chains
- Experience working with OpenAI and/or Anthropic Claude APIs
- Understanding of MCP (Model Context Protocol)
- Experience with REST APIs, webhooks, and external integrations
- English level: B1+
Nice to Have
- Experience with AI automation for marketing, analytics, or operations
- Experience with e-commerce ecosystems:
- Shopify
- Amazon
- TikTok Shop
- marketplace analytics
- Experience with attribution or marketing analytics systems
- Experience with image/video generation workflows or ComfyUI
- Understanding of privacy/compliance concepts (CCPA-class requirements)
- Portfolio or GitHub with production AI projects
What Matters in This Role
- Strong ownership mindset
- Ability to translate business problems into working AI architectures
- Production-oriented engineering mindset
- Ability to work independently in async environments
- Focus on measurable and reliable AI systems rather than experimental demos
We Offer
- Fully remote work from anywhere
- Flexible schedule
- Direct collaboration with the Head of AI
- Modern AI stack and real production AI challenges
- High ownership and technical freedom
- Paid vacation and sick days
Required skills experience
| Python / Backend Engineering | 4.5 years |
| LLM / AI systems | 1.5 years |
| Multi-Agent Systems | 1 year |
| LangGraph / Agent Orchestration Frameworks | 1.5 years |
| RAG Pipelines | 1.5 years |
+ 9 more
| Vector Databases (Qdrant / Pinecone / pgvector) | 1.5 years |
| PostgreSQL | 3.5 years |
| API Integrations / REST / Webhooks | 3.5 years |
| Workflow Automation (n8n / Make / Zapier) | 1 year |
| OpenAI / Claude APIs | 6 months |
| Structured Outputs / Tool Calling / Guardrails | 6 months |
| LLMOps / Observability / Evals | 6 months |
| AI System Architecture | 1 year |
| Production Deployments | 2 years |
Required languages
| English | B1 - Intermediate |
* FastAPI, * LangChain, LangGraph, * Docker, * AWS, GCP
Published 16 June
10 views
·
2 applications
📊
Average salary range of similar jobs in
analytics →
Loading...