Snowflake

Staff Software Engineer - Production Engineering

Snowflake3 weeks ago
Location

PL-Warsaw

Type

Full Time

Level

Staff

Role

Staff Engineer

Posted

Feb 18, 2026

Full TimeStaff

The role

Summary

Snowflake seeks a Staff Software Engineer for their Production Engineering team in Warsaw to lead reliability initiatives across large-scale distributed systems. This role focuses on building tools and processes that ensure exceptional customer experiences through SLO management, automation, and incident response. The position requires 10+ years of experience with production systems, Golang proficiency, and expertise in cloud infrastructure and monitoring tools.

What you'll do

Service Lifecycle Leadership: Lead the improvement of the whole lifecycle of services from inception and design through deployment, operation, and refinement
Automation and Scaling: Drive scaling systems sustainably through automation and implement changes that improve reliability and velocity
Incident Response Management: Establish and practice low noise incident response rotations and conduct blameless postmortems to prevent problem recurrence
Code Development and Review: Write and review code, develop documentation and capacity plans, and debug the hardest problems on large distributed systems
SLO Collaboration: Collaborate with software engineers to establish, maintain, and optimize functional and performance Service Level Objectives
On-Call Operations: Participate in a 24x7 on-call rotation to ensure continuous system availability and rapid incident response

What we look for

Technical

Programming ProficiencyProficient in at least one modern programming language, preferably Golang
Problem-Solving SkillsSystematic problem-solving methods and effective communication skills
Large-Scale Systems10+ years industry experience designing, building and supporting large scale systems in production
Observability ToolsExperience in modern observability tools and production monitoring practices
Performance TestingExperience with capacity and load testing of distributed applications

Education

Bachelor's DegreeBachelor's degree in Computer Science, a related technical field involving software engineering, or equivalent practical experience

Experience

Production Systems10+ years industry experience designing, building and supporting large scale systems in production
Container OrchestrationExperience with containers and container orchestration systems such as Kubernetes
Linux InfrastructureExperience in deploying, managing, and operating scalable and fault tolerant Linux infrastructure
SLO ManagementExperience with the SLO-driven reliability management processes
Cloud PlatformsHands on experience with one or more public cloud providers (AWS, Azure, or GCP)

Skills

Required skills

Golang ProgrammingProficiency in Golang for building production engineering systems
Distributed SystemsExperience with large-scale distributed systems architecture and operations
Problem SolvingSystematic approach to troubleshooting and resolving complex technical issues
Communication SkillsEffective verbal and written communication for cross-team collaboration

Nice to have

KubernetesContainer orchestration and management in production environments
Cloud InfrastructureMulti-cloud experience with AWS, Azure, or Google Cloud Platform
Monitoring ToolsModern observability and monitoring tools like Prometheus, Grafana, or Datadog
SLO ManagementService Level Objective definition, measurement, and optimization
Linux AdministrationProduction Linux infrastructure deployment and management
Load TestingCapacity planning and performance testing of distributed applications

Compensation & benefits

Benefits

Professional Growth

Opportunity to work at a fast-growing cloud computing company with significant career advancement potential

Innovation Culture

Culture focused on impact, innovation, and collaboration with cutting-edge technology

Learning Opportunities

Access to the latest production engineering tools and practices in a cloud-first environment

Global Impact

Work on systems that serve enterprise customers worldwide with critical data infrastructure


Interview process

  1. 1
    Initial Screening Phone or video call with recruiter to discuss background and role fit
  2. 2
    Technical Phone Screen 45-60 minute technical discussion covering system design and production engineering concepts
  3. 3
    System Design Interview Deep dive into distributed systems architecture and reliability patterns
  4. 4
    Coding Assessment Live coding session focusing on Golang and problem-solving approach
  5. 5
    Production Engineering Deep Dive Discussion of SLOs, monitoring, incident response, and real-world production scenarios
  6. 6
    Cultural Fit Interview Behavioral interview assessing collaboration, leadership, and alignment with Snowflake values
  7. 7
    Final Round Panel interview with senior engineering leaders and potential team members

Apply for this position

You'll be redirected to the company's application page