As a core member of the technical staff, you will architect and optimize high-throughput inference systems for large-scale generative models. You will tackle deep technical challenges in distributed systems and hardware-software co-design, directly impacting the latency and scalability of production-grade AI services for a global developer ecosystem.
Member of Technical Staff: AI Systems Engineer at well-funded AI infrastructure startup
Are you passionate about the low-level mechanics that power the world's most advanced AI? This well-funded London startup is looking for an AI Systems Engineer to join their core technical staff. You'll build high-throughput inference engines and solve complex distributed systems challenges at the very edge of what's possible with generative models. If you thrive on GPU optimization and high-performance engineering, this is your chance to shape the future of AI infrastructure from the ground up.
About this role
Role overview
About the company
About the company
Well-funded AI infrastructure startup
What you'll do
What you will do
- Design and implement low-level optimizations for model inference to maximize GPU utilization and minimize token latency.
- Build robust, distributed systems capable of serving frontier models with high reliability and cost-efficiency.
- Collaborate with research teams to integrate novel architectures into production-ready inference engines and serving stacks.
Who you are
Who this is a fit for
- Demonstrates deep expertise in systems programming and optimizing performance-critical software in C++ or Rust.
- Has a proven track record of working with deep learning frameworks and low-level GPU acceleration libraries.
- Possesses a strong understanding of distributed systems and the mechanics of modern large language model architectures.
Why this role
Why this role is remarkable
- Work at the intersection of systems engineering and cutting-edge machine learning research to define the future of model deployment.
- Join an elite technical team backed by top-tier venture capital firms during a period of rapid infrastructure scaling.
- Influence the foundational layer of AI applications by building systems that make massive models commercially viable and performant.
Jack & Jill
How Jack & Jill work together
Meet Jack
Jack gets to know what you're great at and what you want next, then searches 15 million jobs daily and helps you discover roles at companies like this.
How does this work?
Jack’s an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack’s network.
If it’s a match and the company wants to meet you, they’ll make the intro. In the meantime, if you’d like, Jack will send you excellent alternatives.