You will join a 4-person founding team to build high-speed AI agents that automate legacy enterprise workflows. Moving beyond slow screenshot-to-VLM loops, you will develop agents that pre-train on interface navigation to achieve 5x faster execution. This role blends cutting-edge research in agentic orchestration with rapid production deployment.
AI/ML Research Engineer at YC W26 enterprise AI startup
Join a 4-person founding team at a YC W26 startup in San Francisco building the world's fastest computer-use agents. While others rely on slow screenshot loops, this team is pre-training agents to understand interfaces upfront, resulting in 5x faster automation for enterprise legacy systems. If you're a PhD or Master's grad with deep experience in VLMs and agentic orchestration who wants to ship production code rather than just papers, this $150k-$350k role offers a massive equity stake and the chance to define the future of AI-driven work.
Overview
Role overview
Company
Generalcatalyst
YC W26 enterprise AI startup building the world's fastest computer-use agents
Responsibilities
What you will do
- Research and implement novel agentic architectures for GUI automation using multi-agent coordination, memory, and context management.
- Build and evaluate reasoning pipelines—including chain-of-thought and reflexion loops—that maintain reliability under distribution shifts in enterprise environments.
- Develop interface pre-training methods and VLM-based screen understanding to enable deterministic execution and self-healing for automated enterprise agents.
Candidate profile
Who this is a fit for
- Early-career researcher (0-4 years) with a Master's or PhD in CS/AI from a top-tier program or a track record at a premier research lab.
- Strong engineering skills in Python, PyTorch, and agentic frameworks like LangGraph or AutoGen, with the ability to move from paper to prototype rapidly.
- Deep curiosity for computer-use agents and GUI understanding, evidenced by top-tier publications (NeurIPS, ICLR, CVPR) or significant production-grade AI projects.
What makes it remarkable
Why this role is remarkable
- Join a Y Combinator W26 company at the ground floor, working directly with founders on the core technology that defines the product's intelligence.
- Solve a massive enterprise bottleneck by building deterministic, self-healing agents that operate complex legacy software without APIs or structured data interfaces.
- High-impact environment where your research in reasoning models and vision-language architectures is shipped to production for real enterprise customers immediately.
Jack & Jill
How Jack & Jill work together
Meet Jack
Jack gets to know what you're great at and what you want next, then searches 14 million jobs daily and introduces you directly to hiring managers.
How does this work?
Jack's an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack's network.
If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.