Senior DevOps/infrastructure engineer (AWS, Terraform)
We are looking for a Senior DevOps / Cloud Infrastructure Engineer to design, build, and operate AWS infrastructure for an agentic research assistant in the Pharma / Life Sciences domain, with planned expansion into other research areas.
This is a hands-on infrastructure ownership role, covering cloud architecture, Kubernetes operations, CI/CD, and production reliability.
We are recruiting on behalf of the client. Full project details will be shared with shortlisted candidates.
Project Scope
- Design, build, and operate AWS infrastructure for a production-grade agentic research assistant
- Own Infrastructure as Code (Terraform) across multiple environments (dev / stage / prod)
- Build and maintain CI/CD pipelines for infrastructure and application deployments
- Ensure reliability, security, observability, and cost control
- Prepare runbooks, support incident response, and improve operational maturity
- Collaborate closely with backend and data teams (Python / Java, APIs, data products)
Responsibilities
- Design and operate scalable, secure, and reliable AWS infrastructure
- Own and evolve Terraform codebase (modules, environments, state)
- Build CI/CD pipelines using GitHub Actions for infra and app deployments
- Operate and support Kubernetes (EKS) clusters in production
- Implement observability, alerting, and operational best practices
- Participate in incident handling, root cause analysis, and postmortems
- Work closely with engineering teams to support deployment and runtime needs
Must-Have Requirements
AWS (Production Experience)
Hands-on experience with:
- EKS: cluster setup and operations, node groups, upgrades, autoscaling, ingress, IAM/OIDC
- ECR: image publishing, lifecycle policies, CI integration
- VPC: subnets, routing, NAT / IGW, security groups, NACLs, endpoints, peering (as needed)
- RDS (PostgreSQL): provisioning, HA, backups & restore, basic tuning, monitoring
- Neptune: operational understanding (networking, access control, backups, monitoring)
- EC2: supporting workloads or legacy components when required
- IAM: least-privilege access models, roles & policies, secure access patterns
Infrastructure as Code (Terraform)
Strong hands-on experience with:
- Terraform module design
- State management and remote backends
- Workspaces / environment separation
- Plan / apply workflows
- Secrets and configuration management patterns
- Change management and safe rollouts
Kubernetes & Packaging
- Strong Kubernetes operations experience
- Helm for packaging and deployments (charts, values management, release lifecycle)
CI/CD
Practical experience with GitHub Actions:
- Build / test / deploy pipelines
- Environment promotion and approvals
- Secrets management
- Infrastructure pipelines (terraform fmt / validate / plan / apply)
Operations & Quality
- Observability fundamentals: logging, metrics, tracing, alerting
- SLO / SLA thinking
- Incident handling, postmortems, and runbooks
- Security best practices: secure networking, secrets handling, access control, auditability
Nice-to-Have (Strong Plus)
- Experience with event-driven or distributed systems
- Familiarity with Python (enough to support infra tooling and collaborate on deployments)
- Experience supporting Java services in containerized / Kubernetes environments
- Strong AWS cost optimization skills (rightsizing, autoscaling, storage lifecycle)
Engagement Details
- Level: Senior
- Domain: Pharma / Life Sciences (research-focused product)
- Work format: Remote
- Recruitment: On behalf of the client (details shared later)
Required languages
| English | C2 - Proficient |