SRE/DevOps Engineer (offline)

We are looking for a DevOps/Site Reliability Engineer with some Cloud platform experience. This individual will be responsible for operating and maintaining production clusters and collaborate with other team members to develop automation strategies.

- Defining and maintaining SLOs and SLIs.
- Maintain and develop the platform’s real time monitoring and alerting.
- Supporting the development team’s effort towards reliability, creating a solid reliability culture within the development lifecycle.
- Drive incident, problem management and root cause analysis
- Develop, monitor and respond to all alerts for production systems.
- Develop troubleshooting documentation for production support resources.
- Support & expand - Maintain our AWS and Azure hosted infrastructure. You'll need to monitor and proactively plan and configure aspects of the infrastructure as well as liaising with other parties both internal and externally to improve our performance and improve reliability.

Desired skills and experience:

- Bachelor's Degree or MS in Engineering or equivalent
Experience in operating a container orchestration cluster (Kubernetes, Docker Swarm);
- Experience developing or maintaining software for production services at scale;
- Experience with AWS (azure will be considered a plus;)
- Experience with Continuous Integration tools (Gitlab CI, Jenkins);
- Strong scripting skills (Bash, Python or Go);
- Excellent communication skills;
- Experience with Terraform;
- Thinking out of the box and anticipate challenges - It is imperative we are not simply reactive, we must expect challenges and question technologies, procedures and thinking already in place, you will be expected to constantly review and challenge at all levels.
- Versatility - We work with agile/lean methods. We'd much rather iterate and learn than assume we know all the answers.
- Engineering excellence - Constantly looking for new ways to improve our infrastructure and leverage the capabilities AWS, Azure (and others if needed) provide us with.
- Being a team player - You don't (always) work in isolation and are excited by the thought of using your team whilst involving product, experience design, engineering and more in the process.

Will be considered as a plus:
- Telephony knowledge (SIP,VoIP);
- Experience in Linux Administration (RedHat, CentOS);
- Working knowledge in Configuration Management tools (Ansible);
- RDBMS knowledge (MySQL, Postgres);
- NoSQL knowledge (Redis);
- Experience with ELK;
- Experience with Cloud Security Services.

- Fixed compensation;
- Long-term employment with 24 working days vacation;
- Development in professional growth (courses, training, etc);
- Being part of successful cutting-edge technology products that are making a global impact in the service industry;
- Proficient and fun-to-work-with colleagues;
- Apple gear.

About Quickstarter

QuickStarter AI is a committed, goal-oriented team of industry professionals. We are all united by the desire for the highest results. We reward the performance and loyalty of our professionals. Our developers have already found their best job. QuickStarter AI stands for complete openness and transparency towards our Customers and Partners.

We provide you with a team of senior developers who will need a minimum of time to give the best possible result. We are convinced that real value comes from experience and specialization. If you tend to agree with this statement, we can succeed together.

QuickStarter AI:

- it's a high-quality software development services

- it's AI familiarization services

- creates, trains and manages multifunctional teams of technical specialists

- offers a flexible collaboration structure, integrates seamlessly into our customers ’IT environments in a flexible, scalable and cost-effective manner

Company website:

DOU company page:

The job ad is no longer active
Look at the current jobs (Other) →.

  • Category: Other
  • English: Upper-Intermediate
  • 5 years of experience
  • maps_home_work
    Full Remote
  • shopping_basket
  • explore