Senior Amazon Bedrock Engineer β AI Infrastructure and Cost Optimization
Senior Amazon Bedrock Engineer β AI Infrastructure & Cost Optimization
Location: Remote (U.S.) | Type: Full-time or Contract
Company: Aesthetics360 (A360)
About A360
Aesthetics360 is an AI-native platform transforming elective medicine.
We combine domain-trained models, live transcription, and automated marketing into one continuous intelligence system.
Every workflow β from consult to care plan β runs on AWS, with Amazon Bedrock at the core of our generative-AI architecture.
Weβre scaling fast and need a Bedrock specialist who can make our inference layer faster, cheaper, and smarter.
The Role
Youβll be the owner of Bedrock orchestration inside A360 β optimizing prompt routing, model performance, and API utilization while pairing that with DevOps rigor and cost governance.
Youβll design scalable, secure, and financially efficient ways to serve multi-model workloads (Claude, Titan, Mistral, Llama 3 via Bedrock) to thousands of real-time users.
What Youβll Do
- Architect, deploy, and optimize Bedrock-based LLM pipelines powering transcription, summarization, and consultation workflows.
- Implement multi-model orchestration and intelligent routing for cost/performance balance (e.g., Haiku β Claude 3 β Titan RAG chains).
- Build automated monitoring, logging, and cost dashboards for token usage, latency, and throughput (CloudWatch + Cost Explorer + custom metrics).
- Engineer caching and batching strategies to reduce Bedrock inference cost per request.
- Manage CI/CD for prompt chain deployment using AWS CDK, Fargate, and Lambda.
- Collaborate with AI R&D to profile prompt performance, fine-tune templates, and benchmark models.
- Enforce FinOps best practices β budget alerts, anomaly detection, and savings-plan recommendations.
Ensure enterprise-grade uptime, compliance, and security across Bedrock environments.
What You Bring
- Deep expertise in Amazon Bedrock (multi-model orchestration, API integration, cost tuning).
- 5 + years hands-on AWS architecture or DevOps experience (ECS, Lambda, Aurora Serverless, API Gateway).
- Proficiency with AWS CDK / Terraform, CloudWatch, and Cost Explorer APIs.
- Experience optimizing token budgets, latency, and context-window trade-offs in LLMs.
- Familiarity with LangChain, SageMaker, or Supabase is a plus.
- Strong Python and shell scripting skills; ability to build internal CLI tools for cost and performance audits.
- Mindset: equal parts AI architect and FinOps engineer β you design with performance and price in mind.
Required languages
| English | B2 - Upper Intermediate |