Senior Software Engineer
Next-generation autonomous AI systems are being built to move far beyond simple interaction โ toward software capable of executing complex, real-world operations at scale. Backed by $5.1M from leading founders and venture investors, the company operates at the intersection of autonomous systems, distributed infrastructure, and applied AI, laying the foundation for truly self-operating digital entities.
At this pivotal stage of growth, we are looking for a Senior Software Engineer (Core Infrastructure) to play a central role in architecting and scaling the distributed systems that orchestrate thousands of intelligent AI agents in dynamic, real-world environments. This is an opportunity to shape the backbone of a platform designed for massive scale from day one โ where your infrastructure decisions directly impact how autonomous systems perform in production.
This is an on-site role based in San Francisco.
Your responsibilities will include:
Build a high-performance, fault-tolerant infrastructure layer that:
- Orchestrates and coordinates thousands of autonomous AI agents in real time;
- Scales reliably from early-stage deployments to large-scale production environments;
- Maintains strict data isolation, integrity, and security across system boundaries;
- Ensures resilience through monitoring, tracing, and automated recovery mechanisms;
- Provides deep observability into system performance, agent behavior, and operational health;
- Design and scale distributed backend systems using Python-based architecture;
- Develop robust asynchronous workflows and services supporting complex agent lifecycles;
- Optimize API rate limits and integrations across multiple model and data providers;
- Design and implement GPU-enabled infrastructure to improve cost efficiency and throughput;
- Build internal observability tooling and dashboards for real-time performance tracking;
- Architect systems with privacy, security, and confidentiality as first principles;
- Collaborate cross-functionally while protecting sensitive infrastructure and proprietary systems;
- Take ownership of critical infrastructure components as we move from stealth to scaled deployment.
What we expect from you:
- Proven experience scaling products or companies from 0โ1;
- Strong expertise in Python and distributed systems engineering;
- Deep understanding of scalability, fault tolerance, and data isolation principles;
- Hands-on experience with PostgreSQL, Redis, and asynchronous job queue systems;
- Ability to balance trade-offs between performance, cost, and reliability;
- High ownership mindset and comfort operating in a small, high-trust team;
- Commitment to maintaining security and discretion in sensitive production environments;
Preferred:
- Experience with GPU infrastructure or orchestration frameworks;
- Background in multi-tenant or agent-based architectures;
- Experience with observability, tracing, and distributed monitoring systems;
We offer:
- The opportunity to join at a critical inflection point as we scale beyond stealth;
- Ownership over core infrastructure powering large-scale autonomous systems;
- A technically ambitious environment designing for exponential growth from day one;
- Close collaboration with a small, highly skilled engineering team;
- Direct impact on systems operating in live production environments;
- A role where technical excellence, autonomy, and long-term thinking are highly valued.
Required languages
| English | C1 - Advanced |