Site Reliability Engineer (SRE)
Responsibilities
- Ongoing support and maintenance of customer cloud environments
- Act as main point of contact (POC) for assigned customers
- Handle monitoring, alerts, incidents, and escalations
- Execute planned activities: DR drills, restore drills, upgrades, migrations
- Support EKS clusters: updates, migrations, dependency handling
- Assist during critical launches and production incidents
- Work hands-on with AWS infrastructure using Terraform
- Use Bash and Python for automation, troubleshooting, and tooling
- Identify improvement opportunities and share insights with internal teams
- Maintain and update internal knowledge base and documentation
- Promote a strong customer experience and build long-term trust
Requirements
- 3 years of experience
- Background in a Cloud MSP / AWS Partner environment โ strong advantage
- Hands-on experience with:
- AWS
- Terraform
- EKS
- Bash and/or Python
- Experience in support / operations / maintenance roles
- Strong analytical and troubleshooting skills
- Customer-facing mindset, communicator, and promoter
- Comfortable managing multiple customers simultaneously
- Fast learner, detail-oriented
- Proven ability to work in a fast-paced support environment
- Good written and verbal communication skills in English
Required skills experience
| AWS | 3 years |
| Terraform | 2 years |
| EKS | 2 years |
Required languages
| English | C1 - Advanced |
Published 20 March
15 views
ยท
0 applications
๐
Average salary range of similar jobs in
analytics โ
Loading...