Senior Site Reliability Engineer / Senior DevOps Engineer

We are seeking a Senior DevOps Engineer to join the Release Management team. Release Management is the backbone of the product delivery, responsible for the design, installation, upgrade, and L3/L4 support of our entire product line, including Amelia (RPM & Cloud/K8s) and Autonomics.

In this role, you will not just be a body in a seat; we are looking for "brilliant brains" to help us scale. You will adopt our "1Click" philosophy—if a task needs to be done more than twice, you will automate it.
 

Your future tasks:

  • Infrastructure & Cloud Management:
    • Manage and support installations across hybrid environments, including DSaaS (Dedicated SaaS), On-Premise, and Public Cloud (AWS, GCP, Azure, OCI).
    • Administer and maintain Kubernetes clusters (EKS, GKE, AKS) and Docker-based deployments.
    • Perform L3/L4 System Administration on Linux environments (Scientific Linux, RHEL 7/8/9), ensuring OS patching, security, and upgrades.
  • Automation & CI/CD:
    • Develop and maintain Ansible playbooks and Terraform scripts to automate the spin-up of test infrastructure and product installation.
    • Manage CI/CD pipelines using Bamboo and Bitbucket to execute automated "1Click" upgrades and installations.
    • Script and automate release management processes, ensuring code upgrades are passed smoothly from R&D to production.
  • Database & Application Support:
    • Manage and support backend technologies including Percona (MySQL v8), Redis, OpenSearch, RabbitMQ, and HAProxy.
    • Oversee the deployment and maintenance of monitoring stacks, specifically ELK (Elasticsearch, Logstash, Kibana), Grafana, Prometheus, and Zabbix.
    • Support specialized telephony infrastructure components like Jambonz (open-source voice platform) and Freeswitch.
  • Release Management & Reliability:
    • Execute Release Management (RM) processes, creating client-specific git repositories for inventory configurations, certificates, and overrides.
    • Oversee automated backup and restore procedures (using S3, Minio, etc.) and ensure Disaster Recovery readiness.
    • Monitor upgrade success/failure rates via Jira and Slack integrations, intervening immediately to remediate exceptions.
  • Client Success & Documentation:
    • Provide expert-level "White Glove" support during partner installs and upgrades, offering real-time troubleshooting.
    • Create and maintain easily consumable documentation in Confluence for both internal teams and external partners.


What we expect from you:

  • Linux Expertise: Expert-level knowledge (L3/L4) of Linux administration (RHEL/CentOS family).
  • Automation Skills: Proven experience with Ansible (playbooks) and Terraform for Infrastructure as Code.
  • Container Orchestration: Strong experience with Kubernetes (K8s) and Docker in production environments.
  • CI/CD Tools: Proficiency with Bamboo, Git, and Bitbucket for version control and deployment pipelines.
  • Database Management: Experience supporting MySQL (Percona XtraDB Cluster), Redis, and familiarity with replication strategies.
  • Web & Proxy: Experience configuring and managing Nginx, Apache, and HAProxy.
  • Scripting: Proficiency in Shell scripting (Bash) and familiarity with Python or Java.


Prefered qualifications:

  • Experience with Voice/Telephony technologies (SIP, Freeswitch, Jambonz).
  • Familiarity with ELK Stack and Zabbix for monitoring and logging.
  • Experience in a "Hybrid" software environment (supporting both SaaS and On-Premise installations).
  • A mindset of "Don't break my stuff"—prioritizing stability and proactive testing (Eddie load testing) before deployment.
  • You believe that "Today's latest-and-greatest is often tomorrow's floppy disk," and you are constantly re-evaluating technology stacks (e.g., migrating from CentOS to RHEL 9).
  • You communicate effectively, capable of working with Delivery teams, R&D, and external Partners.


We offer:

  • Remote-first work environment;
  • Collaborative and motivated team;
  • Impactful work improving patient treatment workflows;
  • Professional growth with modern technologies;
  • Autonomy and ownership of your work;
  • Competitive compensation;
  • Opportunity to contribute to future product phases.

Required skills experience

Linux 5 years
RHEL 5 years
DevOps 5 years
Kubernetes 4 years
Docker 4 years
Terraform 3 years
CI/CD 4 years
On-Premise Infrastructure 4 years

Required languages

English C1 - Advanced
Ukrainian Native
Published 6 February
16 views
·
1 application
To apply for this and other jobs on Djinni login or signup.
Loading...