DevOps/Site Reliability Engineer (offline)

Job description

CloudSimple is the international product company leveraged by top-notch leading investors (RedPoint, Mayfield, and Microsoft Ventures) with the HQ in San Francisco Bay Area. We are building a large scale distributed platform that manages private software-defined data centers fully integrated with world-leading public clouds.

As a member of DevOps/SRE team, the candidate will face with the challenges arising from building and evolving large SaaS/PaaS system including but not limited to:
β€’ Building automation of deployment of our control-plane and monitoring stack into public clouds
β€’ Setting up and adding missing puzzles of CI/CD pipelines


Key responsibilities of Site Reliability Engineer include:

β€’ You will be responsible for the systems deployment automation, operations, and monitoring for our infrastructure, including design and development of infrastructure automation
β€’ You will get your hands dirty, writing code, troubleshooting infrastructure, and architectural challenges using your existing knowledge and toolkits
β€’ You will utilize your advanced system architecture & administration skills for collaboration with engineering and product management, and test teams to architect and develop strategic and tactical solutions
β€’ You will help develop requirements for customer onboarding processes, target environment sizing and migration automations

Candidate Background

β€’ 5+ years of experience producing state of the art automation for either Azure, GCP, or AWS public cloud platforms
β€’ Practical programming experience with Python
β€’ Deep technical roots in data center technologies:
β€’ Large-scale Linux production environments, preferably as part of a Cloud service provider environment
β€’ Understand datacenter networking fabric topologies and common architectures deployed
β€’ Deep understanding with cluster management systems like Kubernetes and Docker based container deployments is required
β€’ Practical experience working with ELK stack, Cassandra, Kafka, Consul, Vault is a plus
β€’ Experience with Terraform and Jenkins is a plus
β€’ Experience with CI/CD pipelines
β€’ Knowledge of Web Services (REST API) and/or SDK integrations
β€’ Knowledge of core infrastructure components like LDAP, DNS, DHCP, etc.
β€’ Basic knowledge of security tools and best practices
β€’ Prior successful experience of working in an innovative, fast-paced startup with a high rate of flux. The candidate must demonstrate strong entrepreneurial spirit and vigor
β€’ Demonstrated proficiency in creating detailed technical design documents, facilitate design reviews, and execution of design implementation projects
β€’ Strong English written and verbal communication skills to work with the CloudSimple global team
β€’ BS/MS degree in Computer Science or equivalent experience

What we offer:

β€’ International environment with great people to work with
β€’ A unique project with modern technologies
β€’ Opportunities to make a difference and grow professionally
β€’ Competitive compensation
β€’ Long term employment with paid vacation
β€’ Sports and healthcare package (medical insurance, paid gym membership)

About CloudSimple

CloudSimple is the international product company leveraged by top-notch leading investors (RedPoint, Mayfield, and Microsoft Ventures) with the HQ in San Francisco Bay Area. We are building a large scale distributed platform that manages private software-defined data centers fully integrated with world-leading public clouds.

Our main tech stack:

- Kubernetes
- Docker
- Cassandra
- Elasticsearch
- Consul
- Kafka
- Prometheus
- Grafana
- Terraform
- Ansible
- Golang
- Python

Company website:
https://www.cloudsimple.com/

DOU company page:
https://jobs.dou.ua/companies/cloudsimple/

The job ad is no longer active

Look at the current jobs DevOps Kyiv→