Tabs

Staff Site Reliability Engineer

Tabs1 months ago
Location

New York City, NY

Type

Full Time

Salary

USD 200,000 – 250,000

Level

Staff

Role

Site Reliability Engineer

Posted

Feb 24, 2026

Full TimeStaff

The role

Summary

Tabs is seeking a Staff Site Reliability Engineer to lead infrastructure evolution and platform reliability for their AI-native revenue platform. The ideal candidate will be a senior technical leader who can design scalable systems, improve observability, and drive operational excellence across a high-growth technology startup.

What you'll do

Infrastructure Management: Lead AWS infrastructure direction and platform evolution, including migration from ECS/Fargate to more scalable runtime environments
DevOps Excellence: Enhance CI/CD systems with focus on developer experience, safety, and advanced automation
Reliability Engineering: Define and evolve reliability standards, including SLIs, SLOs, and error budget management
Incident Response: Manage high-severity incidents, conduct thorough postmortems, and drive actionable improvements
System Design: Partner with engineering teams to design resilient, scalable, and observable distributed systems

What we look for

Technical

Cloud InfrastructureExtensive experience managing production systems on AWS with platform-level change leadership
Programming LanguagesStrong software engineering skills in modern programming languages
Distributed SystemsProven expertise operating distributed systems at production scale

Education

Technical DegreeBachelor's or Master's degree in Computer Science, Software Engineering, or related technical field preferred

Experience

Professional Experience10+ years in SRE, infrastructure, or backend engineering roles
System ThinkingDemonstrated ability to analyze systems holistically, considering risk, rollback strategies, and feedback loops

Skills

Required skills

AWSAdvanced cloud infrastructure management and architecture skills
CI/CDExpertise in continuous integration and deployment systems
ObservabilityDeep understanding of monitoring, logging, and tracing technologies

Nice to have

Incident ManagementExperience with structured incident response and postmortem methodologies
System DesignStrong skills in designing scalable, resilient distributed systems

Compensation & benefits

Salary

USD 200,000 – 250,000 (annual)

Stock options

Available

Benefits

Healthcare

100% employer-covered monthly healthcare premium including medical, dental, and vision

Equity

Competitive compensation package with stock options

Time Off

Unlimited PTO with up to 12 weeks parental leave

Retirement

401k retirement savings plan

Wellness

Free One Medical Membership and Employee Assistance Program


Interview process

  1. 1
    Initial Screening Phone or video call with recruiter to discuss background and role fit
  2. 2
    Technical Assessment Comprehensive technical evaluation of SRE and systems design skills
  3. 3
    Onsite/Virtual Interviews Multiple interview rounds covering technical expertise, system design, and cultural alignment
  4. 4
    Final Interview Meeting with engineering leadership to discuss role expectations and team integration

Apply for this position

You'll be redirected to the company's application page