As a Machine Learning Engineer at CTGT, Inc., you will work deep within the model stack to make generative AI deterministic and reliable. You’ll apply mechanistic interpretability techniques to model weights and activations, building the policy engine that enforces real-time governance for high-stakes enterprise applications at the edge of AI research.
Machine Learning Engineer: LLM Interpretability & Systems at CTGT, Inc.
Join CTGT, Inc., a Stanford-born startup backed by Gradient Ventures and Y Combinator that is building the deterministic governance layer for enterprise AI. As a Machine Learning Engineer, you will dive deep into model internals using mechanistic interpretability and activation patching to make LLMs reliable for the Fortune 500. This is a rare opportunity to bridge frontier AI research with real-world production systems, working alongside a world-class team in San Francisco to solve the industry’s most pressing non-deterministic risk challenges.
About this role
Role overview
About the company
Despite massive investment in commercial AI, organizations often find that demonstrated value is elusive, primarily due to the non-deterministic risk inherent to generative models. CTGT is the deterministic governance layer that enables the most important global institutions to deploy AI workflows with confidence.
Born out of Stanford University research, we provide the control plane that makes it possible. A lightweight, model-agnostic system that enforces policy, prevents drift, and produces auditable decisions in real time. When benchmarked on HaluEval, the CTGT Policy Engine (paired with GPT-120B OSS) outperformed frontier models (Gemini 3 Pro Preview, Claude 4.5 Opus and 4.5 Sonnet) at drastically lower compute cost.
While we sit on the edge of AI research, CTGT brings frontier intelligence into real-world environments. We apply cutting-edge theory directly in production to make large language models more reliable, controllable, and performant in practice.
Our mission is to bring models to the level of performance and accountability required by the Fortune 500. By bridging the gap between LLM capabilities and domain-specific requirements, we unlock the true potential of generative AI to solve the most pressing problems in our world today.
What you'll do
What you will do
- Operationalize mechanistic interpretability research into production-ready code that improves model behavior through direct internal interventions.
- Design and optimize feature-level intervention systems that enable real-time, auditable policy enforcement during model inference.
- Build robust evaluation and deployment loops to ensure model changes are reliably shipped into complex enterprise VPC environments.
Who you are
Who this is a fit for
- Possesses deep expertise in Transformer architectures, PyTorch internals, and the mathematical foundations required for advanced deep learning optimization.
- Demonstrates a proven track record of training or fine-tuning models beyond simple augmentation, specifically probing the mechanics of model cognition.
- Exhibits strong technical ownership with the ability to translate academic papers into high-performance Python, Rust, or TypeScript implementations.
Why this role
Why this role is remarkable
- Bridge the gap between frontier AI research and real-world production by implementing mechanistic interpretability techniques like activation patching and control vectors.
- Join a high-growth team backed by elite investors including Google’s Gradient Ventures, General Catalyst, and Y Combinator with significant equity upside.
- Develop the core deterministic governance layer required by the world's largest institutions to safely deploy generative AI at scale.
Jack & Jill
How Jack & Jill work together
Meet Jack
Jack gets to know what you're great at and what you want next, then searches 15 million jobs daily and helps you discover roles at companies like this.
How does this work?
Jack’s an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack’s network.
If it’s a match and the company wants to meet you, they’ll make the intro. In the meantime, if you’d like, Jack will send you excellent alternatives.