Senior Site Reliability Engineer / Senior DevOps Engineer
We are seeking a Senior DevOps Engineer to join the Release Management team. Release Management is the backbone of the product delivery, responsible for the design, installation, upgrade, and L3/L4 support of our entire product line, including Amelia (RPM & Cloud/K8s) and Autonomics.
In this role, you will not just be a body in a seat; we are looking for "brilliant brains" to help us scale. You will adopt our "1Click" philosophy—if a task needs to be done more than twice, you will automate it.
Your future tasks:
- Infrastructure & Cloud Management:
- Manage and support installations across hybrid environments, including DSaaS (Dedicated SaaS), On-Premise, and Public Cloud (AWS, GCP, Azure, OCI).
- Administer and maintain Kubernetes clusters (EKS, GKE, AKS) and Docker-based deployments.
- Perform L3/L4 System Administration on Linux environments (Scientific Linux, RHEL 7/8/9), ensuring OS patching, security, and upgrades.
- Automation & CI/CD:
- Develop and maintain Ansible playbooks and Terraform scripts to automate the spin-up of test infrastructure and product installation.
- Manage CI/CD pipelines using Bamboo and Bitbucket to execute automated "1Click" upgrades and installations.
- Script and automate release management processes, ensuring code upgrades are passed smoothly from R&D to production.
- Database & Application Support:
- Manage and support backend technologies including Percona (MySQL v8), Redis, OpenSearch, RabbitMQ, and HAProxy.
- Oversee the deployment and maintenance of monitoring stacks, specifically ELK (Elasticsearch, Logstash, Kibana), Grafana, Prometheus, and Zabbix.
- Support specialized telephony infrastructure components like Jambonz (open-source voice platform) and Freeswitch.
- Release Management & Reliability:
- Execute Release Management (RM) processes, creating client-specific git repositories for inventory configurations, certificates, and overrides.
- Oversee automated backup and restore procedures (using S3, Minio, etc.) and ensure Disaster Recovery readiness.
- Monitor upgrade success/failure rates via Jira and Slack integrations, intervening immediately to remediate exceptions.
- Client Success & Documentation:
- Provide expert-level "White Glove" support during partner installs and upgrades, offering real-time troubleshooting.
- Create and maintain easily consumable documentation in Confluence for both internal teams and external partners.
What we expect from you:
- Linux Expertise: Expert-level knowledge (L3/L4) of Linux administration (RHEL/CentOS family).
- Automation Skills: Proven experience with Ansible (playbooks) and Terraform for Infrastructure as Code.
- Container Orchestration: Strong experience with Kubernetes (K8s) and Docker in production environments.
- CI/CD Tools: Proficiency with Bamboo, Git, and Bitbucket for version control and deployment pipelines.
- Database Management: Experience supporting MySQL (Percona XtraDB Cluster), Redis, and familiarity with replication strategies.
- Web & Proxy: Experience configuring and managing Nginx, Apache, and HAProxy.
- Scripting: Proficiency in Shell scripting (Bash) and familiarity with Python or Java.
Prefered qualifications:
- Experience with Voice/Telephony technologies (SIP, Freeswitch, Jambonz).
- Familiarity with ELK Stack and Zabbix for monitoring and logging.
- Experience in a "Hybrid" software environment (supporting both SaaS and On-Premise installations).
- A mindset of "Don't break my stuff"—prioritizing stability and proactive testing (Eddie load testing) before deployment.
- You believe that "Today's latest-and-greatest is often tomorrow's floppy disk," and you are constantly re-evaluating technology stacks (e.g., migrating from CentOS to RHEL 9).
- You communicate effectively, capable of working with Delivery teams, R&D, and external Partners.
We offer:
- Remote-first work environment;
- Collaborative and motivated team;
- Impactful work improving patient treatment workflows;
- Professional growth with modern technologies;
- Autonomy and ownership of your work;
- Competitive compensation;
- Opportunity to contribute to future product phases.
Required skills experience
| Linux | 5 years |
| RHEL | 5 years |
| DevOps | 5 years |
| Kubernetes | 4 years |
| Docker | 4 years |
+ 3 more
| Terraform | 3 years |
| CI/CD | 4 years |
| On-Premise Infrastructure | 4 years |
Required languages
| English | C1 - Advanced |
| Ukrainian | Native |
Published 6 February
16 views
·
1 application
📊
Average salary range of similar jobs in
analytics →
Loading...