Skip to main content
Back to all jobs

Confidential company

Job listing

San Francisco, USA$160K – $250K + Equity

Senior Software Engineer at Series B Multimodal AI Lab

Are you ready to build the future of human-computer interaction? Join a premier Series B multimodal AI lab as a Senior Software Engineer and lead the development of real-time conversational video interfaces. In this high-impact role, you'll solve complex challenges in low-latency communication, multiprocessing, and multilingual AI simulation. Backed by the world's top-tier VCs, this team is creating AI humans that can see, hear, and empathize at scale. If you are a Python expert with a passion for cutting-edge AI and high-performance systems, this is your chance to ship software that defines an entirely new category of technology.

Overview

Role overview

You will lead the technical development of a real-time conversational video interface, bridging the gap between human and machine communication. This hands-on role involves optimizing multimodal models for low latency, multilingual support, and natural interaction. You will collaborate with research teams to integrate state-of-the-art simulations into production-ready software for global enterprise applications.

Company

About the company

Series B backed multimodal AI lab

Responsibilities

What you will do

  • Own the delivery of core features including voice localization, sentence endpointing, and naturalness optimization for real-time video.
  • Partner with research teams to integrate sophisticated multimodal models into a reliable, high-uptime production codebase.
  • Optimize system performance by centralizing inter-process communication and shaving latency off utterance turn-taking for smoother conversations.

Candidate profile

Who this is a fit for

  • Expert in Python with extensive experience in asynchronous frameworks, multiprocessing, and low-level system concepts.
  • Proven track record of shipping polished, reliable software in ambiguous, fast-paced environments where the state-of-the-art evolves rapidly.
  • Strong communicator who can simplify complex technical concepts and has experience with LLM frameworks or WebRTC video streaming.

What makes it remarkable

Why this role is remarkable

  • Work at the forefront of human-computer interaction by building AI humans that see, hear, and respond in real-time.
  • Join a well-funded Series B startup backed by top-tier VCs that is defining the conversational video interface category.
  • Experience a high-impact environment where you can shape architecture and ship features that reach millions of users across multiple languages.

Jack & Jill

How Jack & Jill work together

Jack
I get to know what you’re great at, then find roles you’d never find yourself.
Jill
I recruit from Jack’s network and make the intro when I spot a great match.

Meet Jack

Thumbnail for Meet Jack

Jack gets to know what you're great at and what you want next, then searches 14 million jobs daily and introduces you directly to hiring managers.

How does this work?

Jack's an AI agent for job searching and career coaching. He works for you.

Jill is the AI recruiter working for the company. She recruits from Jack's network.

If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.

Talk toJack

Ready to find your next role?

Talk to Jack for 10 minutes and see your first matches.