Site Reliability Engineer (SRE)

Intetics Top Employer

We are looking for a skilled Site Reliability Engineer (SRE) to join our team. In this role, you will be responsible for maintaining the reliability, scalability, and performance of our production systems. You will work closely with software engineers and infrastructure teams to build and operate resilient distributed systems.

 

Key Responsibilities

  • Design, build, and maintain scalable and reliable infrastructure
  • Monitor system performance and troubleshoot issues in production environments
  • Automate operational tasks and improve system efficiency
  • Implement and manage CI/CD pipelines
  • Ensure high availability, fault tolerance, and disaster recovery
  • Collaborate with development teams to improve system design and reliability
  • Participate in on-call rotation and incident response
  • Define and track SLIs, SLOs, and SLAs

     

Requirements

  • Bachelor’s degree in Computer Science or related field (or equivalent experience)
  • 3+ years of experience in SRE, DevOps, or similar roles
  • Strong knowledge of Linux/Unix systems
  • Experience with cloud platforms (AWS, GCP, or Azure)
  • Proficiency in at least one programming language (Python, Go, Java, etc.)
  • Experience with containerization and orchestration (Docker, Kubernetes)
  • Familiarity with monitoring tools (Prometheus, Grafana, ELK stack, etc.)
  • Understanding of networking concepts and distributed systems

     

Nice to Have

  • Experience with Infrastructure as Code (Terraform, Ansible, etc.)
  • Knowledge of microservices architecture
  • Experience with incident management and postmortems
  • Security best practices knowledge

Required languages

English B2 - Upper Intermediate
Published 20 March
35 views
·
5 applications
To apply for this and other jobs on Djinni login or signup.
Loading...