UiPath

Principal Site Reliability Engineer

UiPath1 weeks ago
Location

Bangalore - Engineering

Type

Full Time

Salary

USD 250,000 – 350,000

Level

Principal

Role

Site Reliability Engineer

Posted

Apr 1, 2026

Full TimePrincipal

The role

Summary

UiPath is seeking a Principal Site Reliability Engineer to revolutionize reliability engineering through AI-driven approaches. The role focuses on building intelligent reliability platforms that leverage AI/ML to improve service reliability, reduce operational overhead, and accelerate incident response in large-scale, cloud-native environments.

What you'll do

Intelligent Automation: Design and implement self-healing mechanisms including automated remediation workflows and intelligent retry and fallback strategies for cloud-native systems.
Reliability Platform Development: Build internal systems that enable engineering teams to debug faster using AI-assisted tooling and proactively identify and mitigate reliability risks.
Reliability Strategy: Define and evolve reliability strategy using predictive reliability models including capacity planning, failure forecasting, and reliability scoring across engineering teams.
AI-Powered Incident Management: Develop AI-powered systems that determine incident impact and use historical data to improve detection and response mechanisms over time.
Technical Leadership: Influence standards for AI-driven tooling, mentor engineers, and elevate reliability focus across the organization.

What we look for

Technical

Cloud InfrastructureHands-on experience with major cloud providers (Azure, AWS, GCP), including networking, deployments, and scaling expertise.
Programming LanguagesProficiency in at least one programming language such as Python, Go, or equivalent.
Infrastructure as CodeExperience with tools like Terraform, Pulumi, and container orchestration platforms like Kubernetes.

Education

Advanced DegreeBachelor's or Master's degree in Computer Science, Software Engineering, or related technical field preferred.

Experience

Site Reliability Engineering7+ years of experience in SRE, Platform, or Cloud infrastructure engineering roles with a proven track record of building internal reliability tooling.
Distributed SystemsStrong conceptual understanding of distributed systems, performance bottlenecks, failure modes, and system trade-offs.

Skills

Required skills

AI/ML OperationsExperience building applications using LLMs to automate complex workflows and AI-driven operational tools.
ObservabilityProven experience with monitoring and observability stacks including metrics, logs, and distributed tracing.
Incident ResponseExpertise in conducting blameless postmortems and implementing systemic reliability improvements.

Nice to have

ML FrameworksExperience with PyTorch, vLLM, or equivalent ML frameworks in production environments.
Predictive ReliabilityBackground in developing predictive reliability models and intelligent system design.

Compensation & benefits

Salary

USD 250,000 – 350,000 (annual)

Stock options

Available

Benefits

Health Insurance

Comprehensive medical, dental, and vision coverage

Retirement Planning

401(k) with company matching

Professional Development

Annual learning and conference budget, internal training programs

Work Flexibility

Hybrid work model with options for remote and office-based work

Stock Options

Equity compensation package for eligible employees


Interview process

  1. 1
    Initial Screening Phone or video call with recruitment team to assess basic qualifications and background
  2. 2
    Technical Phone Interview Detailed discussion of technical experience, SRE background, and AI/ML approach to reliability engineering
  3. 3
    Technical Challenge Take-home or live coding exercise focusing on distributed systems, infrastructure automation, and AI-driven reliability solutions
  4. 4
    Onsite/Virtual Interviews Multiple rounds with SRE leadership, technical teams, and system design discussions covering reliability architecture and AI integration
  5. 5
    Final Leadership Interview Discussion with senior technical leadership to assess organizational impact and strategic thinking

Apply for this position

You'll be redirected to the company's application page