Senior Site Reliability Engineer / Senior DevOps Engineer Offline

$$$$

We are seeking a Senior DevOps Engineer to join the Release Management team. Release Management is the backbone of the product delivery, responsible for the design, installation, upgrade, and L3/L4 support of our entire product line, including Amelia (RPM & Cloud/K8s) and Autonomics.

In this role, you will not just be a body in a seat; we are looking for "brilliant brains" to help us scale. You will adopt our "1Click" philosophy—if a task needs to be done more than twice, you will automate it.

Your future tasks:

Infrastructure & Cloud Management:
- Manage and support installations across hybrid environments, including DSaaS (Dedicated SaaS), On-Premise, and Public Cloud (AWS, GCP, Azure, OCI).
- Administer and maintain Kubernetes clusters (EKS, GKE, AKS) and Docker-based deployments.
- Perform L3/L4 System Administration on Linux environments (Scientific Linux, RHEL 7/8/9), ensuring OS patching, security, and upgrades.
Automation & CI/CD:
- Develop and maintain Ansible playbooks and Terraform scripts to automate the spin-up of test infrastructure and product installation.
- Manage CI/CD pipelines using Bamboo and Bitbucket to execute automated "1Click" upgrades and installations.
- Script and automate release management processes, ensuring code upgrades are passed smoothly from R&D to production.
Database & Application Support:
- Manage and support backend technologies including Percona (MySQL v8), Redis, OpenSearch, RabbitMQ, and HAProxy.
- Oversee the deployment and maintenance of monitoring stacks, specifically ELK (Elasticsearch, Logstash, Kibana), Grafana, Prometheus, and Zabbix.
- Support specialized telephony infrastructure components like Jambonz (open-source voice platform) and Freeswitch.
Release Management & Reliability:
- Execute Release Management (RM) processes, creating client-specific git repositories for inventory configurations, certificates, and overrides.
- Oversee automated backup and restore procedures (using S3, Minio, etc.) and ensure Disaster Recovery readiness.
- Monitor upgrade success/failure rates via Jira and Slack integrations, intervening immediately to remediate exceptions.
Client Success & Documentation:
- Provide expert-level "White Glove" support during partner installs and upgrades, offering real-time troubleshooting.
- Create and maintain easily consumable documentation in Confluence for both internal teams and external partners.

What we expect from you:

Linux Expertise: Expert-level knowledge (L3/L4) of Linux administration (RHEL/CentOS family).
Automation Skills: Proven experience with Ansible (playbooks) and Terraform for Infrastructure as Code.
Container Orchestration: Strong experience with Kubernetes (K8s) and Docker in production environments.
CI/CD Tools: Proficiency with Bamboo, Git, and Bitbucket for version control and deployment pipelines.
Database Management: Experience supporting MySQL (Percona XtraDB Cluster), Redis, and familiarity with replication strategies.
Web & Proxy: Experience configuring and managing Nginx, Apache, and HAProxy.
Scripting: Proficiency in Shell scripting (Bash) and familiarity with Python or Java.

Prefered qualifications:

Experience with Voice/Telephony technologies (SIP, Freeswitch, Jambonz).
Familiarity with ELK Stack and Zabbix for monitoring and logging.
Experience in a "Hybrid" software environment (supporting both SaaS and On-Premise installations).
A mindset of "Don't break my stuff"—prioritizing stability and proactive testing (Eddie load testing) before deployment.
You believe that "Today's latest-and-greatest is often tomorrow's floppy disk," and you are constantly re-evaluating technology stacks (e.g., migrating from CentOS to RHEL 9).
You communicate effectively, capable of working with Delivery teams, R&D, and external Partners.

We offer:

Remote-first work environment;
Collaborative and motivated team;
Impactful work improving patient treatment workflows;
Professional growth with modern technologies;
Autonomy and ownership of your work;
Competitive compensation;
Opportunity to contribute to future product phases.

Required skills experience

Linux	5 years
RHEL	5 years
DevOps	5 years
Kubernetes	4 years
Docker	4 years

+ 3 more

Terraform	3 years
CI/CD	4 years
On-Premise Infrastructure	4 years

Required languages

English	C1 - Advanced
Ukrainian	Native

The job ad is no longer active

Look at the current jobs DevOps →

Only from 5 years of experience
Full Remote
United States
Countries where we consider candidates
- English C1 - Advanced
- Ukrainian Native

DevOps

Linux	5 years
RHEL	5 years
DevOps	5 years

+ 5 more

Employment: Fulltime
Domain: Other
Outsource

Apply for the job

📊 Average salary range of similar jobs in analytics →