Senior Software Engineer

Next-generation autonomous AI systems are being built to move far beyond simple interaction — toward software capable of executing complex, real-world operations at scale. Backed by $5.1M from leading founders and venture investors, the company operates at the intersection of autonomous systems, distributed infrastructure, and applied AI, laying the foundation for truly self-operating digital entities.

At this pivotal stage of growth, we are looking for a Senior Software Engineer (Core Infrastructure) to play a central role in architecting and scaling the distributed systems that orchestrate thousands of intelligent AI agents in dynamic, real-world environments. This is an opportunity to shape the backbone of a platform designed for massive scale from day one — where your infrastructure decisions directly impact how autonomous systems perform in production.

This is an on-site role based in San Francisco.

Your responsibilities will include:

Build a high-performance, fault-tolerant infrastructure layer that:

Orchestrates and coordinates thousands of autonomous AI agents in real time;
Scales reliably from early-stage deployments to large-scale production environments;
Maintains strict data isolation, integrity, and security across system boundaries;
Ensures resilience through monitoring, tracing, and automated recovery mechanisms;
Provides deep observability into system performance, agent behavior, and operational health;
Design and scale distributed backend systems using Python-based architecture;
Develop robust asynchronous workflows and services supporting complex agent lifecycles;
Optimize API rate limits and integrations across multiple model and data providers;
Design and implement GPU-enabled infrastructure to improve cost efficiency and throughput;
Build internal observability tooling and dashboards for real-time performance tracking;
Architect systems with privacy, security, and confidentiality as first principles;
Collaborate cross-functionally while protecting sensitive infrastructure and proprietary systems;
Take ownership of critical infrastructure components as we move from stealth to scaled deployment.

What we expect from you:

Proven experience scaling products or companies from 0→1;
Strong expertise in Python and distributed systems engineering;
Deep understanding of scalability, fault tolerance, and data isolation principles;
Hands-on experience with PostgreSQL, Redis, and asynchronous job queue systems;
Ability to balance trade-offs between performance, cost, and reliability;
High ownership mindset and comfort operating in a small, high-trust team;
Commitment to maintaining security and discretion in sensitive production environments;

Preferred:

Experience with GPU infrastructure or orchestration frameworks;
Background in multi-tenant or agent-based architectures;
Experience with observability, tracing, and distributed monitoring systems;

We offer:

The opportunity to join at a critical inflection point as we scale beyond stealth;
Ownership over core infrastructure powering large-scale autonomous systems;
A technically ambitious environment designing for exponential growth from day one;
Close collaboration with a small, highly skilled engineering team;
Direct impact on systems operating in live production environments;
A role where technical excellence, autonomy, and long-term thinking are highly valued.

Required languages

English

C1 - Advanced

Published 18 February

66 views

8 applications

To apply for this and other jobs on Djinni login or signup.

Only from 3 years of experience
Office Work
Worldwide
Countries where we consider candidates
- English C1 - Advanced

Python

Employment: Fulltime
Domain: Other
Agency
Office: United States

Apply for the job

📊 Average salary range of similar jobs in analytics →