You will lead the quality and coverage of data powering next-generation foundation models. As the in-house expert on global datasets, you'll ensure exceptional performance across dozens of languages. You will bridge the gap between research and production by building scalable systems to curate, evaluate, and steer massive multilingual data collections.
This job is no longer actively hiring. Open Roles to see active jobs.
Research Engineer, Data at Fast-growing generative AI startup
Are you passionate about the data-centric side of AI? Join a world-class team of researchers from top labs at this fast-growing generative AI startup in San Francisco. You will own the data strategy for next-generation foundation models, building the multilingual datasets and evaluation systems that define how intelligence is scaled globally. This is a rare opportunity to work at the intersection of cutting-edge SSM research and production-level systems while receiving competitive compensation and equity in a well-funded venture.
Overview
Role overview
Company
About the company
Fast-growing generative AI startup
Responsibilities
What you will do
- Design and build large-scale multilingual datasets and run controlled experiments to measure their impact on model behavior.
- Develop automated quality control systems and speech model evaluations using both manual annotation and automated metrics.
- Implement advanced steering techniques to improve model intelligence through data and mitigate bias in generative outputs.
Candidate profile
Who this is a fit for
- Proven experience building or working with large-scale multilingual datasets for generative models like speech or text.
- Strong applied machine learning background with a specific focus on data-centric approaches and scalable system building.
- Demonstrated ability to guide human annotation processes and evaluation metrics across multiple languages and cultures.
What makes it remarkable
Why this role is remarkable
- Work at the frontier of model architecture innovation alongside founding experts from world-class AI labs.
- Join a well-funded team backed by top-tier VCs and industry-leading AI advisors during a high-growth phase.
- Directly influence the intelligence and inclusivity of global-scale models used for audio, video, and text processing.
Jack & Jill
How Jack & Jill work together
Meet Jack
Jack gets to know what you're great at and what you want next, then searches 14 million jobs daily and introduces you directly to hiring managers.
How does this work?
Jack's an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack's network.
If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.