Devops Engineer to $1800
QOVES is a cutting-edge biomedical startup dedicated to transforming cosmetic consultations through innovative, AI-powered software. We're VC funded and are looking for only the most ambitious people to join our small, efficient team.
We develop tools that predict post-surgery appearances and assist in clinical decision-making, aiming to streamline and enhance the patient experience in cosmetic clinics.
Why Qoves?
- Join a fast-growing, VC-backed startup where your work on infrastructure directly enables life-changing AI applications in the biomedical and aesthetics space.
- Own and scale critical production systems β your expertise will keep our AI models, APIs, and global infrastructure reliable, secure, and performant.
- Work with an ambitious, international engineering team solving tough problems across cloud, bare metal, and distributed systems.
- Gain hands-on experience with cutting-edge DevOps practices β from CI/CD pipelines and self-hosted deployments to observability and compliance.
- Access real opportunities for career progression and leadership in a high-impact environment, not just maintaining systems but shaping how they evolve.
- Thrive in a culture that values technical excellence, creative problem-solving, and adaptability, where infrastructure is treated as a core product pillar.
Role Description
We are seeking a highly capable DevOps Engineer who can take ownership of Qovesβ infrastructure. This role is critical: you will ensure uptime, security, and scalability across our systems while supporting the engineering team with reliable deployments.
Responsibility
Infrastructure Uptime & On-Call
- Manage and be on-call (with high availability) to ensure production systems never go down (avoiding lost sales and unhappy customers).
- Respond immediately to outages.
- Set up monitoring to be notified, or delegate to another team member when unavailable.
- Take full responsibility for infrastructure uptime.
- Perform root cause analysis (RCA) after incidents and write incident reports/postmortems to prevent recurrence.
Server & Cloud Management
- Set up and secure bare metal servers.
- Configure and manage AWS architecture (EC2, Networking, Lambdas, S3).
- Deploy dockerized assets such as AI models, backend/frontend applications, and microservices.
- Set up distributed infrastructure (e.g., deployment servers or images) across regions to optimize reliability and performance.
- Systemize and expand non-production environments to improve performance, reliability, and developer velocity.
CI/CD & Automation
- Design, implement, and manage CI/CD pipelines for applications and infrastructure (e.g., Jenkins, GitHub Actions, GitLab CI/CD, CircleCI).
- Automate deployment, monitoring, and management processes to reduce manual effort.
Security & Best Practices
- Apply cybersecurity best practices for securing infrastructure.
- Be comfortable with securing infrastructure at all levels.
- Implement and follow best practices in observability and monitoring.
Collaboration & Advisory
- Work with the engineering team to provide risk assessments and scale production assets cost-effectively.
- Advise on DevOps engineering tradeoffs and infrastructure decisions.
- Collaborate with developers to ensure smooth integration of new features into production.
Automation & Self-Hosting
- Set up bash scripts and Infrastructure as Code (IaC) (e.g., Terraform, Ansible, CloudFormation).
- Self-host applications as needed, with experience in modern deployment/management platforms (e.g., Dokploy, Coolify, Dokku, CapRover).
Observability, Monitoring & Compliance
- Implement strong logging and monitoring for compliance and reliability (e.g., Prometheus, Grafana, ELK/EFK, Datadog).
- Ensure proactive alerting and performance tracking across production and non-production environments.
- Stay up to date with emerging tools, practices, and technologies in DevOps and cloud engineering.
Requirements
- 5+ years of experience in DevOps/SRE/Infrastructure roles.
- Expertise with AWS, Docker, and distributed systems.
- Experience with containers and orchestration (Docker, Kubernetes).
- Strong scripting/programming skills (e.g., Bash, Python; Go or Ruby a plus).
- Hands-on experience with CI/CD tools (e.g., Jenkins, GitHub Actions, GitLab CI/CD, CircleCI).
- Hands-on experience with monitoring & logging tools (e.g., Prometheus, Grafana, ELK/EFK, Datadog).
- Proficiency with Git and branching strategies (e.g., GitFlow, trunk-based development).
- Solid understanding of networking, security principles, and Linux system administration.
- Experience with Infrastructure as Code (Terraform, Ansible, CloudFormation, etc.).
- Ability to set up and secure bare metal servers.
- Experience with self-hosting and deployment management platforms (e.g., Dokploy, Coolify, Dokku, CapRover).
- Familiarity with JavaScript is a plus.
- Strong analytical and problem-solving skills (candidates may be asked to complete an IQ or logical reasoning test).
- Self-driven, reliable, and capable of working independently in a remote environment.
- Startup experience and adaptability in fast-paced environments is a plus.
Our Culture & Benefits
- Impact: Play a pivotal role in solving real-world healthcare challenges with advanced technology.
- Innovation: Work in a dynamic, meritocratic environment where the best ideas win.
- Growth: Join a well-funded startup with a clear vision for success and significant opportunities for professional development.
Benefits Include:
- Performance and Quarterly Bonuses
- Flexible Work Arrangements (Hybrid or Remote)
- Professional Development Opportunities
Application Process
Selected candidates will be invited to submit a 30-second video introduction. These will serve as key criteria in our initial screening process, preceding the final interview.
Location & Timeline
- Remote: Strong candidates are welcome to work remotely.
- Application Deadline: 25/09/2025
- Expected Start Date: 5/10/2025
Required languages
English | C2 - Proficient |