Snowflake

Senior Software Engineer - Production Engineering

Snowflake3 weeks ago
Location

PL-Warsaw

Type

Full Time

Level

Senior

Role

Site Reliability Engineer

Posted

Feb 20, 2026

Full TimeSenior

The role

Summary

Snowflake is seeking a Senior Production Engineer to drive reliability tools and processes for their cloud data platform. This role involves establishing SLOs across engineering teams, building infrastructure for rapid issue detection, and ensuring system health through incident response and automation. The position requires expertise in distributed systems, modern programming languages (preferably Golang), and production engineering practices including on-call responsibilities.

What you'll do

Service Lifecycle Management: Improve the whole lifecycle of services from inception and design through deployment, operation, and refinement
System Scaling & Automation: Scale systems sustainably through automation and participate in changes that improve reliability and velocity
Incident Response Leadership: Establish and practice low noise incident response rotations and conduct blameless postmortems to prevent problem recurrence
Code Development & Review: Write and review code, develop documentation and capacity plans, and debug complex problems on large distributed systems
SLO Collaboration: Collaborate with software engineers to establish, maintain, and optimize functional and performance Service Level Objectives
On-Call Responsibilities: Participate in 24x7 on-call rotation to ensure continuous system reliability and rapid issue resolution
Infrastructure Monitoring: Build and maintain infrastructure necessary for rapid detection of reliability issues across production systems
Release Verification: Deeply engage in system health verification after releases to ensure deployment success and system stability

What we look for

Technical

Programming ProficiencyProficient in at least one modern programming language, preferably Golang
Large Scale Systems6+ years industry experience building and supporting large scale systems in production
Observability ToolsExperience with modern observability tools and production monitoring practices
Container OrchestrationExperience with containers and container orchestration systems such as Kubernetes
Infrastructure ManagementExperience deploying, managing, and operating scalable and fault tolerant infrastructure
Cloud PlatformsHands-on experience with one or more public cloud providers (AWS, Azure, or GCP)

Education

Bachelor's DegreeBachelor's degree in Computer Science, related technical field involving software engineering, or equivalent practical experience

Experience

Production Engineering Experience6+ years industry experience building and supporting large scale systems in production environment
System Problem SolvingSystematic problem-solving methods and effective communication skills for complex technical challenges

Skills

Required skills

Golang ProgrammingProficiency in Golang for production engineering and system development
Distributed SystemsDeep understanding of large-scale distributed systems architecture and operations
Problem SolvingSystematic problem-solving methods for complex technical challenges
CommunicationEffective communication skills for cross-team collaboration and incident response
Incident ResponseExperience with on-call rotations and incident management processes

Nice to have

Production MonitoringExperience with modern observability tools and production monitoring practices
KubernetesContainer orchestration experience for managing distributed applications
Cloud PlatformsHands-on experience with AWS, Azure, or Google Cloud Platform
Infrastructure as CodeExperience with tools like Terraform for scalable infrastructure management
SLO ManagementExperience establishing and maintaining Service Level Objectives across engineering teams

Compensation & benefits

Benefits

Career Growth

Opportunity to build career at a fast-growing cloud computing company with innovative technology

Impact-Driven Culture

Work in a culture focused on impact, innovation, and collaboration

Professional Development

Opportunity to work with cutting-edge technology and advance skills in production engineering

Global Reach

Work on systems that serve enterprise customers worldwide with top-tier reliability


Interview process

  1. 1
    Initial Screening Phone or video call with recruiter to discuss background and role fit
  2. 2
    Technical Phone Screen Technical discussion covering systems design, programming concepts, and production engineering experience
  3. 3
    System Design Interview Deep dive into distributed systems architecture, scalability, and reliability patterns
  4. 4
    Coding Interview Programming assessment focusing on algorithms, data structures, and Golang proficiency
  5. 5
    Production Engineering Deep Dive Detailed discussion of incident response, monitoring, SLOs, and production operations experience
  6. 6
    Team Fit Interview Cultural fit assessment and discussion of collaboration style with engineering teams
  7. 7
    Final Interview Leadership discussion covering career goals, technical vision, and alignment with Snowflake values

Apply for this position

You'll be redirected to the company's application page