Senior SRE/DevOps

$$$$

Project description

We are looking for a Senior Platform / SRE Engineer who brings strong real‑world engineering experience, understands how to operate high‑availability (4‑nines) business‑critical systems, and can drive best‑practice engineering across the team.
You will design, build, optimise, and support reliable, scalable platform components, working closely with engineering teams to ensure smooth delivery, observability, and robust production readiness.
You provide pragmatic technical leadership, suggest improvements based on experience, and always prioritise quality, maintainability, and automation — aligned with enterprise‑grade standards used company-wide.

Responsibilities

Design and implement scalable, reliable platform and infrastructure solutions (cloud‑native, microservices‑based).

Build reusable, high‑quality engineering components and automation (CI/CD, IaC, deployment tooling).

Own production readiness: monitoring, logging, performance, SLO/SLA thinking, resilience engineering. [tesco-careers.com]

Support production systems, troubleshoot incidents, perform root‑cause analysis, and improve reliability.

Evaluate risk vs. benefit in tactical vs. long‑term solutions; always recommend best‑practice approaches.

Lead and contribute to system design discussions; guide teams on architecture choices.

Ensure releases meet standards for a 99.99% availability environment.

Mentor engineers and raise engineering standards across the team.

Skills

Must have

Strong experience with Kubernetes (operations, scaling, networking).

Couchbase or NoSQL distributed databases (replication, high availability).

NGINX (reverse proxying, load balancing).

Azure Cloud (networking, compute, security, managed services).

ArgoCD / GitOps.

Terraform / IaC

CI/CD pipeline engineering (GitHub Actions / GitLab / Azure DevOps / Jenkins).

Strong understanding of production reliability, incident management, monitoring & instrumentation.

High‑quality engineering: clean code, reusable components, automation‑first mindset.

Ability to suggest improvements based on prior real‑world experience.

Nice to have

Go / Python scripting

Service mesh (Istio/Linkerd)

Kafka (streaming pipelines, reliability patterns)

Observability stack: Prometheus, Grafana, Loki, ELK

Security best practices (secret management, IAM, zero trust)

Languages

English: C1 Advanced

Required languages

English C1 - Advanced
Ukrainian Native
Azure Cloud, Kubernetes, Terraform
Published 20 April
15 views
·
0 applications
To apply for this and other jobs on Djinni login or signup.
Loading...