Senior Site Reliability Engineer (DevOps) Offline
Our customer:
Our customer is a leading provider of security and intelligence for unmanaged networks.
Responsibilities:
β Ensure the reliability, availability, and performance of critical systems;
β Develop and maintain automation scripts and tools to streamline operations;
β Develop and maintain monitoring dashboards & alerts;
β Lead incident response efforts and post-mortem analysis to prevent future occurrences;
β Optimize system performance and scalability;
β Implement and maintain security best practices;
β Create and maintain comprehensive documentation for systems and processes;
β Participate in on-call rotations to provide support for critical systems.
Required experience and skills:
β At least 5+ years of experience as a Site Reliability Engineer;
β Experience in software engineering and systems administration;
β Proficiency in one or more programming languages, such as Python, Go;
β Experience with AWS cloud platform;
β Hands-on experience with tools like Terraform, Ansible, or CloudFormation;
β Expertise in Docker and Kubernetes;
β Proficiency with monitoring tools like Prometheus, Grafana;
β Proficiency with logging tools like ELK stack or Loki stack;
β Experience with continuous integration and continuous deployment tools such as Jenkins, GitLab CI, or CircleCI;
β Strong understanding of networking concepts, protocols, and security;
β Bachelorβs degree in computer science, Engineering, or a related field. Advanced degrees are a plus;
β English β Upper-Intermediate+.
Working conditions:
- 5-day working week, 8-hour working day, flexible schedule;
- All public holidays are days off;
- Vacation and sick leave are covered by the company;
- Remote work.
The job ad is no longer active
Look at the current jobs DevOps β