Staff Distributed Systems Engineer
About Corvex
Corvex delivers unparalleled cloud-based AI infrastructure, featuring cutting-edge NVIDIA GPUs that combine exceptional reliability, security, performance, and value. We're ready to build a world-class experience for developers and data scientists across enterprise and AI-native organizations that will enable professionals to focus exclusively on training, fine-tuning, and inference of their AI models, while we manage the nuts and bolts of our premium infrastructure.
Position Description
We’re looking for experienced engineers who have built scalable, distributed systems and strong Kubernetes expertise to join our team. You’ll help build the orchestration and middleware layer that unleashes the power of our world-class inference engine across a fleet of high-performance compute.
What You’ll Do
- Design and build distributed backend systems using Go, with optional contributions in Python, Rust, or Java
- Develop and maintain Kubernetes controllers and operators that automate lifecycle management for model inference jobs
- Build orchestration logic to route workloads efficiently across GPU fleets, leveraging custom scheduling and service registration patterns
- Collaborate on the early design of our public APIs
- Collaborate with our research and development teams as we refine the engine and public-facing endpoints for our
- Participate in architecture discussions, code reviews, and shared design documents
What We’re Looking For
- 10+ years of experience in backend or systems development
- Strong experience with Kubernetes, including writing or maintaining custom operators/controllers
- Hands-on experience with distributed systems, job scheduling, or workflow orchestration
- Familiarity with monitoring and alerting frameworks with an emphasis on Prometheus and Grafana
- Comfort working in Go and experience debugging real-world infrastructure issues
- Ability to own complex systems end-to-end - from API design to deployment
- Strong communication skills and a self-directed mindset suited to fast-moving startup environments
What We Offer
- Competitive salary with meaningful equity
- A chance to help define a new category of AI infrastructure
- Greenfield architecture - build the product you’ve always wanted to use
- High trust and autonomy, with deep impact on platform direction
- Remote-first culture with the option to collaborate in person as we scale
- Small, highly skilled team and zero bureaucracy
Required languages
English | C1 - Advanced |