Site Reliability Engineer (on-call, night shifts)

πŸ“ Location: Kyiv, Warsaw (Poland)
πŸ’Ό Employment type: Full-time
🧭 Work format: Office-based. On-call rotation 24/7


We are looking for an SRE Engineer to participate in an on-call rotation and join our tech team that powers thousands of customers around the globe. Our infrastructure spans GCP and Scaleway and includes various cloud-independent solutions, ensuring an exciting and diverse technical environment.


About us:

Stape β€” is a global product-driven IT company and the #1 leader in the server-side tracking market. We’re building a powerful, technically complex product that simplifies server-side tracking for marketers and website owners. Our platform processes over 10 billion requests daily, helping improve tracking accuracy and data privacy for more than 100, 000 clients worldwide. We work closely with top partners like Meta and Snapchat to provide advanced tracking capabilities.

 

Key tasks:

  • On-call rotation for 24/7 support of the main products and services.
  • Document issues and remediation steps.
  • At least 2+ years of experience as a CloudOps/SRE Engineer.
  • Uphold SLAs and SLOs by applying SRE best practices, including incident response, post-mortem analysis, and the creation of operational playbooks.
  • Prioritize customer focus in planning deployments/updates, ensuring minimal impact.
  • Enhance infrastructure health by implementing checks and scripts to address known issues.
  • Integrate new 3rd-party tools into our Cloud Infrastructure. (GCP, Scaleway, AWS)
  • Proactively create monitors within the GKE/K8s ecosystem.
  • Deploy new GKE/K8s clusters using Terraform and Helm/ArgoCD.


Your background:

  • 1+ years of extensive experience with Kubernetes (deployment, scaling, networking, troubleshooting).
  • Experience with monitoring tools like Prometheus, Grafana, and logging solutions like Grafana Loki Stack/Promtail or analogs.
  • Practical experience in supporting business applications (like NodeJS, PHP, GoLang, Python).
  • Strong understanding of networking concepts, protocols, and microservices architecture.
  • Proficiency in Git & Github or other version control.
  • Experience with issue processing (RCA, Postmortems).
  • Familiarity with incident response and management tools like BetterStack, PagerDuty, or others.


Will be a plus:

  • Familiarity with GCP/Scaleway, Terraform, Docker, Linux, CI/CD, PostgreSQL/MySQL.
  • Proficiency in at least one scripting language (e.g., Bash, Python).
  • Solid spoken and written English skills (ideally Upper-Intermediate level or higher).
  • Hands-on experience with high-load applications in production will be a big plus.
  • Familiarity with Cloudflare (Workers, WAF, managing DNS, and Cloudflare for SaaS).


We offer:

  • Innovative product: Make a meaningful difference by contributing to a globally recognized solution that shapes the future of the server-side tracking market.
  • Collaborative culture: Thrive in a friendly and open team environment that encourages initiative, creativity, and collaboration.
  • Cozy office in Kyiv: Join us at our office in the heart of the city near the Zoloty Vorota metro station, with up to 10 free taxi rides to ensure a smooth and hassle-free commute.
  • Career growth support: The company provides a dedicated budget for  your professional development.
  • Paid parental leave: Paid parental leave is available to support employees during key life moments, helping to maintain a healthy balance between work and family life.
  • Work-Life Harmony: Unlimited sick leave, 20 paid vacation days, and official Ukrainian holidays to help you stay healthy and recharge.


Excited to join us? Submit your CV and let’s get started!

Published 29 May
83 views
Β·
6 applications
100% read
Β·
100% responded
Last responded 3 days ago
To apply for this and other jobs on Djinni login or signup.