You will lead the development of foundation models for robotics, moving beyond hand-engineered controllers to create general-purpose autonomous policies. By applying advanced post-training techniques to video diffusion models, you will help build systems that generalize across tasks and environments. This role is ideal for hands-on researchers who ship production-grade code.
LLM Researcher at well-funded seed-stage robotics AI startup
Are you a hands-on researcher with deep experience in post-training LLMs? A well-funded seed-stage robotics startup in San Francisco is looking for early-career, hungry talent to build the foundation model layer for humanoid autonomy. Instead of brittle, hand-coded scripts, you'll help train general-purpose policies from video data that can fold laundry or assemble furniture with zero-shot generalization. With backing from top-tier AI pioneers and significant equity on the table, this is a rare chance to shape the future of physical AI. If you ship code as fast as you run experiments, this high-impact role is for you.
About this role
Role overview
About the company
About the company
Well-funded seed-stage robotics AI startup
What you'll do
What you will do
- Execute end-to-end post-training workflows including SFT, RLHF, and DPO to align robot policy behaviors.
- Develop and optimize scalable training pipelines for large-scale video diffusion and foundation models.
- Rapidly iterate on model architectures and training infrastructure to improve zero-shot generalization in new environments.
Who you are
Who this is a fit for
- Extensive hands-on experience in post-training LLMs, specifically focused on alignment, reasoning, or multi-modal learning.
- Prolific coding ability with a track record of shipping research code, open-source contributions, or production ML systems.
- Hungry, early-career researcher excited by the prospect of working in-person in San Francisco within a fast-moving, high-stakes startup.
Why this role
Why this role is remarkable
- Work at the cutting edge of physical AI by applying LLM-style scaling and generalization to humanoid robot autonomy.
- Backed by world-class AI researchers and top-tier venture capital firms at the earliest, highest-impact stage of growth.
- High-intensity, high-ownership environment where your research directly dictates the capabilities of next-generation robotic hardware.
Jack & Jill
How Jack & Jill work together
Meet Jack
Jack gets to know what you're great at and what you want next, then searches 15 million jobs daily and helps you discover roles at companies like this.
How does this work?
Jack’s an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack’s network.
If it’s a match and the company wants to meet you, they’ll make the intro. In the meantime, if you’d like, Jack will send you excellent alternatives.