Lead Site Reliability Engineer (offline)

As a company, we want to grow our customer base. We expect that the number of transactions will significantly increase. Therefore we want to run a new team within the company that will focus on continuous product improvement based on metrics, SLO and error budget.

In this role, you will:

Build SRE team from scratch
Introduce and drive principles of how to create scalable and highly reliable software systems inside the organization
Contribute to the product’s strategy from a SRE perspective
Set levels of SRE team engagement over the course of a service’s life
Remove SRE team’s obstacles and make sure the team has everything they need to be successful
Contribute to setting reasonable SLO targets
Make sure company has gone through all the necessary stages of SRE practices implementation
Work closely with Product engineering teams to build the best of breed platforms to run our services
Be in tight communication with the Emergency team
Help Product engineering teams to investigate production issues

About you:
2+ years of team leadership experience
1+ years of experience of SRE practices usage in production
Track record in infrastructure automation, cloud services, and tooling (Docker, K8S, Ansible, Terraform)
5+ years of experience in Linux administration
4+ years of experience with cloud providers (AWS preferable)
Hands-on experience with software solutions’ architecture design
Experience with logging/monitoring/alerting solutions (ELK, TICK, Prometheus)
Comfortable with messaging systems (RabbitMQ, Kafka)
Experience with container orchestration (Kubernetes)
Development skills in scripting/programming language (Python, Java, Bash)
Experience and interest in infrastructure as a code approach
Experience working with CI/CD pipelines (Jenkins, Gitlab)
Solid knowledge and hands-on experience in Microservices architecture
Curiosity for technology and the ability to balance the trade-offs between engineering and business outcomes

Benefits:
An honest and open feedback culture and individual development opportunities
An opportunity to work from anywhere — our team is distributed worldwide, from Minsk to Manila, from Florida to California
Personal, yearly budget for educational courses, conferences, etc.
Competitive salary
Medical insurance
And much more!

About PandaDoc

PandaDoc is an all-in-one document automation solution with advanced capabilities, but simple and easy to use for teams of all sizes

Company website:
pandadoc.com

The job ad is no longer active
Job unpublished on 1 April 2021

Look at the current jobs DevOps Kyiv→