Senior DevOps Engineer
This role is part of the Keyword.com team — one of the growing brands within saas.group.
Keyword.com is a keyword rank tracking and SERP & AI Visibility analytics platform that helps SEO teams track rankings and uncover opportunities that matter.
Profile Overview
As our Senior DevOps Engineer, you will own the infrastructure domain end-to-end and be a true game changer for our platform. You’ll take a complex, high-scale environment and turn it into a simpler, more reliable, and more developer-friendly system.
You’ll work closely with engineers and the business, enabling teams to ship faster and safer through better platform tooling, automation, and clear infrastructure standards. A major part of the role is leading a strategic migration toward a single-cloud setup (provider to be decided), while ensuring we keep the lights on and keep improving reliability.
Some of our stack: ArgoCD, GitHub Actions, Kubernetes (AWS & Hetzner), microservices, Terraform, Ansible, RabbitMQ, Redis, Kafka connect, Debezium (MariaDB → ClickHouse CDC), Infisical.
Your immediate impact in the first 3–6 months will be:
- Establishing clear ownership and standards across a currently scattered infrastructure landscape (across multiple repos and patterns)
- Improving GitOps + CI/CD reliability and developer experience (faster, safer deployments; smoother rollbacks; better visibility)
- Creating and executing a pragmatic roadmap toward cloud consolidation, reducing operational risk while keeping delivery speed high
- Strengthening reliability for data-heavy systems and high-volume worker workloads (capacity, performance, cost, and stability)
Your responsibilities
- Own the platform: Take responsibility for our Kubernetes and cloud infrastructure, ensuring reliability, scalability, and predictable operations
- Lead cloud consolidation: Build the migration strategy, align stakeholders, and execute safely (incremental moves, clear milestones, minimal downtime)
- Raise the bar on delivery: Improve GitOps and CI/CD to make deployments safer, faster, and easier to debug
- Improve developer experience: Provide “golden paths” — templates, tooling, docs, and strong environment parity so teams can move quickly without breaking things
- Infrastructure-as-Code leadership: Build and maintain Terraform modules and automation patterns; reduce drift and eliminate manual steps
- Reliability & incident response: Improve observability and operational readiness (alerts, runbooks, postmortems, prevention work) and lead by example during incidents
- Reduce technical debt: Simplify architecture, unify patterns, and steadily remove risky/legacy infrastructure decisions
What you bring to the table:
- Strong hands-on experience running production Kubernetes, including real troubleshooting and operational ownership (managed and/or self-hosted)
- Proven experience with GitOps (ArgoCD or similar) and modern CI/CD (GitHub Actions or similar)
- Deep knowledge of Terraform (modules, state management, safe rollouts) and a practical automation mindset (Ansible, scripting)
- Solid understanding of cloud primitives (networking, IAM/security, storage, compute) and distributed system failure modes
- Experience improving reliability for high-scale systems (data-heavy workloads, large storage footprints, worker fleets, microservices)
- You’re proactive and collaborative — you like being close to developers and the business, and you translate platform work into real product impact
Excellent written and verbal English communication (we collaborate async via Slack/GitHub)
Nice to have:
- Experience leading a cloud migration / consolidation (multi-cloud → single-cloud, or on-prem → cloud)
- Strong security posture experience (least privilege, secret handling, cluster hardening)
- Experience building internal platforms and self-service tooling for engineering teams
Note on the use of AI:
We’re AI enthusiasts and use it daily (Claude Team Premium, ChatGPT Enterprise, Gemini Enterprise). But we’re not “vibe coders”. We use AI thoughtfully — and we take everything that lands in our repositories seriously. Regardless of where code comes from, the engineer is responsible for it.
What’s in it for you:
- Ultimate flexibility: We’re remote-first
- Freedom and autonomy: High trust, high ownership, and the space to do your best work
- Real impact: You’ll own a mission-critical area and directly shape the product’s future
- Small, senior team: Work closely with developers and stakeholders; move quickly and ship meaningful improvements
Required skills experience
| ArgoCD | 2 years |
| Kubernetes | 3 years |
| Microservices | 1 year |
| Terraform | 3 years |
| RabbitMQ | 3 years |
| ClickHouse | 6 months |
| AWS | 3 years |
Required languages
| English | C2 - Proficient |