Role overview

As a Principal Machine Learning Engineer, you will co-define Speechmatics’ technical vision while remaining hands-on in the Modelling Team. You will bridge the gap between cutting-edge research and production systems, developing next-generation transformer models and optimizing inference for global scale. This is a high-ownership role focused on shipping real-world impact.

Speechmatics

View profile

Software

Speechmatics is a technology company specializing in artificial intelligence-driven speech recognition and voice AI infrastructure. It provides automatic speech recognition (ASR), speech-to-text (STT), text-to-speech (TTS), translation, summarization, sentiment analysis, and topic detection services, supporting over 50 languages and dialects with high accuracy across accents, noisy environments, and multi-speaker scenarios. The company serves enterprises, developers, and partners in industries such as media & broadcast, contact centers, healthcare, education, legal, finance, and AI infrastructure, enabling real-time and batch transcription for applications like captioning, compliance, and voice agents.[1][5]

What you will do

Develop and deploy advanced ML models using Python and PyTorch, translating research into scalable, maintainable production services and services.
Optimise large-scale model inference using strategies like dynamic batching, flash attention, and speculative decoding to improve speed and cost efficiency.
Define and enforce best practices for model lifecycle management, data quality, and evaluations across the entire ML stack.

Who this is a fit for

Deep expertise in building and shipping production-grade ML systems, with a strong foundation in modern transformer architectures and self-supervised learning.
Proven track record in distributed training and optimizing inference at scale, bridging the gap between research models and production-ready code.
Expert proficiency in Python and ML frameworks such as PyTorch, complemented by experience in MLOps, CI/CD pipelines, and containerization.

Why this role is remarkable

Lead the technical direction of a $62M Series B scale-up that recently saw 4x growth in real-time usage and serves global customers.
Work at the rare intersection of high-rigor ML research and large-scale production, shipping models that solve real-world problems like accents and noise.
Take full ownership of technical domains, raising the bar for a talented team of 10+ engineers while developing groundbreaking bilingual and medical models.

How Jack & Jill work together

I get to know what you’re great at, then find roles you’d never find yourself.Ok, I'll go first. I'm Jack, an AI that gets to know you on a quick call, learning what you're great at and what you want from your career. Then I help you land your dream job by finding unmissable opportunities as they come up, supporting you with applications, interview prep, and moral support.

I recruit from Jack’s network and make the intro when I spot a great match.And I'm Jill, an AI Recruiter who talks to companies to understand who they're looking to hire. Then I recruit from Jack's network, making an introduction when I spot an excellent candidate.

Principal Machine Learning Engineer at Speechmatics