Join Aranya.tech to build ClusterdOS, a GitOps-native distributed operating system making Kubernetes accessible for the inference era. You will architect and maintain production-grade clusters across bare metal and cloud environments, bridging the gap between complex distributed systems and user-friendly computing. This is a high-impact role at a technical, founder-led startup.
Kubernetes DevOps Engineer at Aranya.tech — Seed-stage distributed OS startup
Aranya.tech is reinventing the distributed operating system for the AI era. We're looking for a Kubernetes DevOps Engineer in San Francisco to join our MIT-founded team and build the infrastructure powering the next generation of GPU inference. If you have deep experience building clusters from scratch on bare metal and want to shape a GitOps-native platform from the seed stage, this is your chance to make a massive impact. Join us in making distributed computing as accessible as the PC and help scale ClusterdOS to production fleets worldwide.
Overview
Why this role stands out
Company
Aranya.tech - Seed-stage MIT-founded AI infrastructure startup
Responsibilities
What you will do
- Build and orchestrate production Kubernetes clusters from scratch across bare metal, cloud, and hybrid environments with a focus on high-performance GPU workloads.
- Design and implement robust GitOps workflows using ArgoCD and Ansible to manage the full lifecycle of distributed infrastructure for global customers.
- Debug complex distributed systems issues under pressure and maintain an observability stack using Loki, Grafana, Tempo, and Mimir (LGTM) to ensure zero-downtime upgrades.
Candidate profile
Who this is a fit for
- Possesses deep Kubernetes expertise, including experience building clusters from the ground up rather than simply deploying to existing managed services.
- Has a strong foundation in Infrastructure-as-Code and GitOps methodologies, specifically with tools like ArgoCD, Ansible, and GitLab CI.
- Is comfortable managing bare metal infrastructure and distributed storage systems like Ceph, ideally with experience in early-stage startup environments.
===
What makes it remarkable
Why this role is remarkable
- Work on the cutting edge of AI infrastructure by building a distributed OS designed specifically for the next generation of GPU inference companies.
- Join a high-caliber technical team founded by MIT alumni at a seed-stage startup with early production traction and strong venture backing.
- Influence the core architecture of a platform aiming to make distributed computing as accessible as personal computers were in the previous era.
Jack & Jill
How Jack & Jill work together
About Jack & Jill
Meet Jack
Jack gets to know what you are great at, what you want next, and makes sure Jill considers you for the right opportunities.
How does this work?
Jack's an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack's network.
If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.