
Site Reliability Engineer
San Francisco
Full Time
USD 150,000 – 200,000
Mid
Site Reliability Engineer
Mar 4, 2026
The role
Summary
LiteLLM is seeking a Site Reliability Engineer (SRE) to ensure the reliability and performance of their AI Gateway platform used by major companies like Adobe and Netflix. The ideal candidate will own critical production infrastructure, addressing complex system challenges in a high-impact, open-source environment.
What you'll do
What we look for
Technical
Education
Experience
Skills
Required skills
Nice to have
Compensation & benefits
USD 150,000 – 200,000 (annual)
Available
Benefits
Open Source Contribution
Opportunity to work on and contribute to popular open-source projects
Cutting-edge AI Technology
Work with advanced AI infrastructure used by leading global companies
Direct Impact
Work closely with CEO and CTO on critical technical challenges
Interview process
- 1Initial Screening — Phone or video call with recruiting team to assess basic qualifications
- 2Technical Interview — In-depth discussion of system reliability, debugging experiences, and technical problem-solving
- 3Systems Design Challenge — Evaluate candidate's approach to complex infrastructure and reliability challenges
- 4Team Fit Interview — Meeting with current engineering team to assess collaboration and cultural alignment
You'll be redirected to the company's application page

LiteLLM
View all jobs
LiteLLM is a platform that provides a unified interface for accessing multiple large language models (LLMs) from different providers. The company offers tools and infrastructure that enable developers and organizations to seamlessly integrate various AI models into their applications while managing costs, performance, and deployment complexity. LiteLLM operates in the artificial intelligence and developer tools market, serving businesses that need flexible access to different language models without being locked into a single provider's ecosystem.