Site Reliability Engineer
What makes you a great fit:
- 3+ years of experience in an SRE, DevOps, or similar role in a SaaS/cloud-native environment.
- Strong experience with Kubernetes, and cloud-based distributed systems.
- Hands-on experience building or maintaining monitoring stacks such as Prometheus, Grafana, ELK, etc.
- Proficiency in Python, Bash, or similar scripting languages.
- Familiarity with CI/CD tools (e.g., GitHub Actions, Jenkins, ArgoCD).
- Solid analytical and problem-solving skills with a passion for operational excellence.
- Exposure to AI-based tooling (e.g., OpenAI API, LLM-based bots) to automate operations or enhance incident response processes.
- Upper-intermediate English level.
Will be a plus:
- Proficiency with AWS (EC2, S3, Lambda, Streaming, EMR, EKS).
- Experience with Infrastructure as Code tools (Terraform, Helm, etc.).
- Experience with incident management platforms (e.g., PagerDuty).
- Security-minded mindset and experience in the cybersecurity industry.
- Experience with service mesh, zero-downtime deployments, or chaos engineering.
- Contributions to AI-assisted SRE initiatives or platform operations & monitoring automation.
Your day-to-day in this position: - Design, implement, and maintain monitoring and alerting systems (e.g., Prometheus, Grafana) to detect and prevent reliability issues.
- Develop tools and automation (Python, Bash, etc.) for improving infrastructure reliability and operational efficiency.
- Collaborate with R&D and Product teams to embed reliability-first principles into every stage of the development process.
- Participate in and improve incident response processes, including running blameless postmortems and implementing preventive measures.
- Enhance our Infrastructure-as-Code (IaC) and CI/CD practices to streamline deployments and reduce risk.
- Maintain and extend internal AI-driven tools, such as bots that support SRE workflows (on-call management, triaging, etc.).
Document infrastructure, playbooks, and operational procedures to facilitate onboarding and knowledge sharing.
What makes this project exciting:Are you ready to join an innovative force in cybersecurity backed by one of the industry’s biggest names? Our client, recently acquired by Check Point for $200 million, is on a mission to transform external risk management. Imagine being part of a team that uses cutting-edge technology to protect businesses from the most dangerous cyber threats out there — monitoring the dark web, pinpointing vulnerabilities, and preventing data breaches.
This is more than just a job; it’s an opportunity to make a real impact in the world of cybersecurity. The pace is fast, the challenges are thrilling, and the solutions are AI-driven, putting you at the forefront of real-time threat detection. What’s more, with the support of a global powerhouse like Check Point, you’ll have the stability, resources, and career growth opportunities that only come with being part of a leader in the cybersecurity field!
Why work with us?- People-oriented management without bureaucracy
- The friendly climate inside the company is confirmed by the frequent comeback of previous employees
- Flexible working schedule
- 29 paid time off (18 working days per year, plus 11 days — all national holidays)
- 10 sick leave days
- Full financial and legal support for private entrepreneurs
- Free English classes with native speakers or with Ukrainian teachers (for your choice)
Dedicated HR
Our next steps:✅ Intro call with a Recruiter — ✅ Client intro interview — ✅ Tech interview — ✅ HR client interview — ✅Reference check — ✅ Offer
Required languages
English | B2 - Upper Intermediate |