Site Reliability Engineer (offline)

Must haves

5+ years of experience in Site Reliability engineering and/or DevOps.

Strong understanding of Kubernetes.

Experience with Infrastructure-as-a-Code, Terraform.

Understanding of Linux network stack, REST, HTTP, and TCP/IP protocols.

Experience with Google Cloud Platform.

Deep knowledge and practical experience with Docker.

Excellent troubleshooting skills, root cause analysis, and technical decision reasoning.

Good communication skills in written and verbal English.

Work across multiple work streams and projects, independently with the minimum supervision.

 

As a plus

Experience with Ruby scripting.

Experience with JVM stack.

Experience with Fastlane and mobile CI/CD automation experience.

Prior experience with GitLab, setup and scaling of CI/CD build runners.

 

Key responsibilities

Scale the infrastructure up to stay resilient to higher loads.

Help product teams with migrating to microservices.

Help product teams with improving production monitoring and alerting.

Contribute to software architecture and key technical decisions.

Optimize continuous delivery pipelines.

Improve overall developer's experience from our current CI/CD workflows.

Create new and improve existing standards and documentation to make things more predictable and accessible.

Similar jobs

Europe except Ukraine