Senior Site Reliability Engineer (DevOps) Offline

Our customer:

Our customer is a leading provider of security and intelligence for unmanaged networks.

 

Responsibilities:

β€” Ensure the reliability, availability, and performance of critical systems;

β€” Develop and maintain automation scripts and tools to streamline operations;

β€” Develop and maintain monitoring dashboards & alerts;

β€” Lead incident response efforts and post-mortem analysis to prevent future occurrences;

β€” Optimize system performance and scalability;

β€” Implement and maintain security best practices;

β€” Create and maintain comprehensive documentation for systems and processes;

β€” Participate in on-call rotations to provide support for critical systems.

 

Required experience and skills:

β€” At least 5+ years of experience as a Site Reliability Engineer;

β€” Experience in software engineering and systems administration;

β€” Proficiency in one or more programming languages, such as Python, Go;

β€” Experience with AWS cloud platform;

β€” Hands-on experience with tools like Terraform, Ansible, or CloudFormation;

β€” Expertise in Docker and Kubernetes;

β€” Proficiency with monitoring tools like Prometheus, Grafana;

β€” Proficiency with logging tools like ELK stack or Loki stack;

β€” Experience with continuous integration and continuous deployment tools such as Jenkins, GitLab CI, or CircleCI;

β€” Strong understanding of networking concepts, protocols, and security;

β€” Bachelor’s degree in computer science, Engineering, or a related field. Advanced degrees are a plus;

β€” English β€” Upper-Intermediate+.

 

Working conditions:

  • 5-day working week, 8-hour working day, flexible schedule;
  • All public holidays are days off;
  • Vacation and sick leave are covered by the company;
  • Remote work.

The job ad is no longer active

Look at the current jobs DevOps β†’