Lumnix

Lumnix

Joined in 2025
87% answers
Lumnix is a boutique technology talent agency that connects businesses with exceptional technology professionals from around the world. We specialize in sourcing, vetting, and managing top-tier remote talent to help startups and enterprises build high-performing, distributed technology teams.
  • · 39 views · 8 applications · 1d

    Senior Site Reliability Engineer (SRE)

    Full Remote · Countries of Europe or Ukraine · 7 years of experience · B2 - Upper Intermediate
    About Corvex Corvex delivers unparalleled cloud-based AI infrastructure, featuring cutting-edge NVIDIA GPUs that combine exceptional reliability, security, performance, and value. We're building a world-class experience for developers and data scientists...

    About Corvex

    Corvex delivers unparalleled cloud-based AI infrastructure, featuring cutting-edge NVIDIA GPUs that combine exceptional reliability, security, performance, and value. We're building a world-class experience for developers and data scientists across enterprise and AI-native organizations-empowering professionals to focus exclusively on training, fine-tuning, and inference of their AI models, while we manage the nuts and bolts of our premium infrastructure.
     

    Company and Position Description

    Corvex is seeking a Senior Site Reliability Engineer to help design, build, and operate our next-generation AI infrastructure platform. You will work across infrastructure-as-code, automation, Kubernetes, and private cloud environments. This role requires strong technical judgment, the ability to troubleshoot complex distributed systems, and the written communication skills necessary for clear, professional client interaction.

    This position requires 5-6 hours of overlap with US Eastern Time and participation in a rotating on-call schedule (1 week every 6 weeks), which should be considered in compensation expectations.
     

    What You’ll Do

    • Lead the design, deployment, and maintenance of infrastructure using Terraform and Ansible
    • Build, operate, and optimize Kubernetes clusters
    • Troubleshoot production issues across systems, networks, and platforms
    • Work with private cloud platforms (OpenStack strongly preferred)
    • Drive automation and improvements to CI/CD pipelines and operational tooling
    • Collaborate with engineering teams on reliability, scalability, and architectural decisions
    • Generate clear, high-quality technical documentation and client-facing communication
    • Participate in on-call rotation and refine incident response processes
       

    What We’re Looking For

    • 7+ years of experience in SRE, DevOps, or Systems Engineering roles
    • Strong experience with Terraform, Ansible, and Kubernetes
    • Excellent troubleshooting skills across Linux, networking, and distributed systems
    • Experience with OpenStack or other private cloud environments (commercial or non-commercial)
    • Excellent written English; able to communicate professionally with clients
    • Ability to work effectively with US-East time zone overlap
    • Willingness to participate in on-call rotation (1 week every 6 weeks)
       

    What We Offer

    • Competitive salary
    • A chance to help define a new category of AI infrastructure
    • Greenfield architecture - build the product you’ve always wanted to use
    • High trust and autonomy, with deep impact on platform direction
    • Remote-first culture with the option to collaborate in person as we scale
    • Small, highly skilled team and zero bureaucracy.
    More
  • · 78 views · 19 applications · 15h

    Mid-Level Site Reliability Engineer (SRE)

    Full Remote · Countries of Europe or Ukraine · 3 years of experience · B1 - Intermediate
    About Corvex Corvex delivers unparalleled cloud-based AI infrastructure, featuring cutting-edge NVIDIA GPUs that combine exceptional reliability, security, performance, and value. We're building a world-class experience for developers and data scientists...

    About Corvex

    Corvex delivers unparalleled cloud-based AI infrastructure, featuring cutting-edge NVIDIA GPUs that combine exceptional reliability, security, performance, and value. We're building a world-class experience for developers and data scientists across enterprise and AI-native organizations-empowering professionals to focus exclusively on training, fine-tuning, and inference of their AI models, while we manage the nuts and bolts of our premium infrastructure.
     

    Company and Position Description

    Corvex is hiring a Mid-Level Site Reliability Engineer with 3-5 years of experience to support the development and operation of our AI-focused cloud platform. You will work across automation, infrastructure-as-code, Kubernetes, and private cloud environments, while helping ensure high reliability and performance. Strong written communication is essential, as you will interact with distributed engineering teams and occasionally with clients.

    This role requires 5-6 hours of overlap with US Eastern Time and participation in a rotating on-call schedule (1 week every 6 weeks).
     

    What You’ll Do

    • Support infrastructure-as-code workflows using Terraform and Ansible
    • Assist in building and operating Kubernetes clusters
    • Troubleshoot system, network, and infrastructure issues
    • Contribute to automation, monitoring, and CI/CD pipeline enhancements
    • Work with private cloud environments (OpenStack strongly preferred)
    • Produce clear, professional internal and client-facing written communication
    • Participate in the on-call rotation and help improve incident response
       

    What We’re Looking For

    • 3-5 years of experience in SRE, DevOps, Systems Engineering, or similar roles
    • Hands-on experience with Terraform, Ansible, and Kubernetes
    • Solid troubleshooting skills across Linux and networks
    • Familiarity with OpenStack or other private cloud technologies (non-commercial experience welcome)
    • Strong written English and professional communication skills
    • Ability to maintain required overlap with US Eastern Time
    • Willingness to participate in on-call rotation (1 week every 6 weeks)
       

    What We Offer

    • Competitive salary
    • A chance to help define a new category of AI infrastructure
    • Greenfield architecture - build the product you’ve always wanted to use
    • High trust and autonomy, with deep impact on platform direction
    • Remote-first culture with the option to collaborate in person as we scale
    • Small, highly skilled team and zero bureaucracy

     

    More
Log In or Sign Up to see all posted jobs