Senior SRE/DevOps (Platform Tribe)

$$$$
Product

About the Role

 

Playson is a globally recognised iGaming supplier delivering a high-performance, microservices-based platform designed to process billions of financial transactions every day. Operating at high scale (5–7k RPS), our cross-regional infrastructure is built to deliver near-zero latency and a seamless player experience under constant load.

 

We’re looking for a Senior SRE / DevOps Engineer to join our Platform Tribe - a lean & senior team where ownership is high and expectations are even higher. This is a deeply hands-on role at the core of a high-traffic system, where you’ll be directly responsible for maintaining reliability, performance, and stability in a fast-paced environment.

 

You’ll be working on real-time production challenges, handling incidents, managing alerts, and being part of a critical on-call rotation. This role requires resilience, strong decision-making under pressure, and a proactive mindset to continuously improve systems operating at scale.

 

If you thrive in high-load environments, enjoy solving complex production issues, and want to have a direct impact on systems used by millions - this is the place for you.

 

Key Responsibilities

 

  • Own system reliability by actively monitoring platform health, managing alerts, and responding to incidents in real time
  • Participate in 24/7 on-call rotations, taking full ownership of production stability in a high-traffic (5–7k RPS) environment
  • Investigate incidents, perform root cause analysis, and implement long-term fixes to prevent recurrence
  • Build and continuously improve monitoring, alerting, and observability across the Kubernetes (EKS) ecosystem
  • Deploy, manage, and optimise infrastructure using Terraform, Helm, and GitOps tools (Flux/ArgoCD)
  • Drive automation and proactively improve system resilience, reducing manual intervention and recurring issues
  • Maintain and evolve CI/CD pipelines and infrastructure-as-code practices
  • Collaborate closely with engineering teams to support deployments and minimise user impact in a live environment
  • Introduce and integrate new tools and technologies to enhance scalability, reliability, and performance
  • Handle environment-specific requests and ensure smooth day-to-day platform operations under constant load

 

Requirements

 

  • Strong hands-on experience with Kubernetes (deployment, scaling, troubleshooting) in high-load environments
  • Experience with GitOps tools such as FluxCD or ArgoCD
  • Proven experience in incident response, root cause analysis, and postmortems in production systems
  • Solid experience with AWS, Terraform, Docker, and CI/CD pipelines
  • Experience with monitoring and observability tools such as Datadog, Prometheus, Grafana, and logging stacks like ELK or CloudWatch
  • Strong understanding of networking concepts and protocols
  • Proficiency in at least one scripting language (e.g. Python, Go, Node.js)
  • Experience working with version control systems (Git)
  • Familiarity with incident management tools like PagerDuty, Opsgenie, or similar
  • Ability to operate effectively in a fast-paced, high-pressure environment with strong ownership and accountability
  • Proactive, resilient mindset with a focus on continuous improvement and system stability

 

What We Offer

 

  • Competitive Salary: We offer a competitive salary, subject to annual performance reviews
  • Quarterly Bonuses: Benefit from a transparent and systematic quarterly bonus system
  • Unlimited Paid Vacation: Enjoy unlimited paid vacation leave, including Ukrainian bank holidays
  • Unlimited Paid Sick Leave: Take unlimited paid sick leave whenever necessary
  • Flexible Schedule: We offer a flexible work schedule to accommodate your needs
  • Remote Work: Choose to work remotely, providing greater flexibility and comfort
  • Medical Insurance: Receive comprehensive medical insurance for both you and a significant other
  • Financial Support for Life Events: We provide financial support during special life events
  • Professional Development: Get reimbursement for professional development courses and training
  • International exposure: Attend industry expos, team gatherings & global meet-ups
  • B2B contracts

 

Recruitment Process

 

1. HR Interview (30-45 min)

2. Interview with a Product Owner (60 min)

3. Technical interview (90 min)

4. Final Interview with C-level (60 min)

Required skills experience

Kubernetes 4 years
AWS 4 years
24 / 7 Operations 4 years
Highload 3 years
Kafka 2 years
Terraform 2 years

Required languages

Ukrainian Native
English B1 - Intermediate
Published 25 September 2025 Β· Updated 7 May
89 views
Β·
6 applications
Response activity: Medium
Last responded more than a month ago
See stats of candidates who applied for this job πŸ‘€
To apply for this and other jobs on Djinni login or signup.
Loading...