Site Reliability Engineer (SRE)
Intetics
Top Employer
We are looking for a skilled Site Reliability Engineer (SRE) to join our team. In this role, you will be responsible for maintaining the reliability, scalability, and performance of our production systems. You will work closely with software engineers and infrastructure teams to build and operate resilient distributed systems.
Key Responsibilities
- Design, build, and maintain scalable and reliable infrastructure
- Monitor system performance and troubleshoot issues in production environments
- Automate operational tasks and improve system efficiency
- Implement and manage CI/CD pipelines
- Ensure high availability, fault tolerance, and disaster recovery
- Collaborate with development teams to improve system design and reliability
- Participate in on-call rotation and incident response
Define and track SLIs, SLOs, and SLAs
Requirements
- Bachelor’s degree in Computer Science or related field (or equivalent experience)
- 3+ years of experience in SRE, DevOps, or similar roles
- Strong knowledge of Linux/Unix systems
- Experience with cloud platforms (AWS, GCP, or Azure)
- Proficiency in at least one programming language (Python, Go, Java, etc.)
- Experience with containerization and orchestration (Docker, Kubernetes)
- Familiarity with monitoring tools (Prometheus, Grafana, ELK stack, etc.)
Understanding of networking concepts and distributed systems
Nice to Have
- Experience with Infrastructure as Code (Terraform, Ansible, etc.)
- Knowledge of microservices architecture
- Experience with incident management and postmortems
- Security best practices knowledge
Required languages
| English | B2 - Upper Intermediate |
Published 20 March
35 views
·
5 applications
📊
$2000-4600
Average salary range of similar jobs in
analytics →
Loading...