As the sole Site Reliability Engineer, you will own the infrastructure and observability for a platform experiencing exponential growth. You will manage a sophisticated AWS/Pulumi stack, optimizing database performance for billion-row datasets and ensuring high availability for real-time chat and streaming features. This is a high-impact role providing full platform autonomy.
Site Reliability Engineer at high-growth social music network
Own the infrastructure behind the world's largest social music network! This high-growth platform connects over a million users through real-time chat and streaming, and they need a sole SRE to take full command of their AWS and Pulumi-based stack. With a £90k+ base and a massive £100k+ equity package, this is a rare opportunity to move into a high-leverage role where you manage everything from PostgreSQL 17 performance to global data pipelines. If you thrive on autonomy and want to scale a product seeing exponential monthly growth, this London-based role is your next big career move.
Overview
Role overview
Company
EQUALS
Rapidly scaling social music network with over 1 million users
Responsibilities
What you will do
- Manage and evolve AWS infrastructure using Pulumi (TypeScript), overseeing ECS/Fargate, RDS PostgreSQL 17, and ElastiCache Redis clusters at scale.
- Optimize data pipelines and warehouse performance, handling massive music catalog ingestions of over 1 billion rows and managing Airbyte and RudderStack integrations.
- Lead incident response and observability, utilizing Datadog APM to tune alerting, reduce noise, and maintain 24/7 system health for a global user base.
Candidate profile
Who this is a fit for
- Proven experience managing AWS environments (ECS, RDS, S3) using Infrastructure-as-Code tools like Pulumi, Terraform, or AWS CDK.
- Deep technical expertise in PostgreSQL performance tuning, indexing strategies, and Redis scaling for high-concurrency applications and message queues.
- Demonstrated ability to act as a sole infrastructure owner, comfortably managing CI/CD pipelines, production failovers, and complex cloud networking via Cloudflare.
What makes it remarkable
Why this role is remarkable
- Take full ownership of the entire infrastructure stack for the world's largest social music network, moving beyond feature work to core platform engineering.
- Scale a high-traffic consumer product serving over a million users with real-time requirements, including chatrooms, music streaming, and complex recommendation engines.
- Enjoy significant financial upside with a generous £100k+ equity package and a competitive £90k+ base salary in a high-growth environment.
Jack & Jill
How Jack & Jill work together
Meet Jack
Jack gets to know what you're great at and what you want next, then searches 14 million jobs daily and introduces you directly to hiring managers.
How does this work?
Jack's an AI agent for job searching and career coaching. He works for you.
Jill is the AI recruiter working for the company. She recruits from Jack's network.
If it's a match and the company wants to meet you, they'll make the intro. In the meantime, if you'd like, Jack will send you excellent alternatives.