Zip

Software Engineer, Core Infrastructure (Mid-Senior level)

Zip3 days ago
Location

Toronto

Type

Full Time

Salary

CAD 160,000 – 220,000

Level

Mid

Role

Infrastructure Engineer

Posted

Jun 26, 2026

Full TimeMid

The role

Summary

Join Zip's Core Infrastructure team as a Software Engineer to lead the design, build, and operation of global multi-region, highly scalable infrastructure systems. This mid-senior level role offers end-to-end ownership of critical infrastructure components including Kubernetes platforms, AI infrastructure, observability, and deployment pipelines while collaborating with world-class engineers from Apple, Airbnb, and Meta to support enterprise procurement innovation at scale.

What you'll do

Multi-Region Architecture Development: Design, build, and operate global multi-cell architectures that enable Zip's enterprise platform to scale across regions while maintaining performance, reliability, and data sovereignty requirements for Fortune 500 clients.
Infrastructure Component Ownership: Assume end-to-end ownership of one or more core infrastructure systems including Kubernetes platform management, AI infrastructure provisioning, observability stacks, deployment pipelines, and cost optimization initiatives that directly impact platform reliability and operational efficiency.
Scalable System Design and Implementation: Champion technical excellence by leading architectural design, implementation, and operational excellence of highly scalable systems that support Zip's $500+ billion annual spend processing across thousands of enterprise customers.
Cross-Functional Engineering Collaboration: Partner strategically with product engineering, platform, and data teams across multiple time zones to design infrastructure solutions that accelerate Zip's product roadmap and enable rapid feature deployment without compromising system stability.
Platform Reliability and Operations: Establish and maintain SLOs, incident response procedures, and operational runbooks for critical infrastructure components; conduct post-incident reviews and implement preventative measures to continuously improve platform availability and performance.
Infrastructure Modernization and Optimization: Evaluate, adopt, and optimize emerging infrastructure technologies and patterns; lead initiatives to improve deployment velocity, reduce operational overhead, and optimize cloud infrastructure costs across the organization.

What we look for

Technical

Large-Scale Distributed SystemsProven experience designing, building, and operating distributed systems at scale, with deep understanding of consistency models, fault tolerance, load balancing, and multi-region deployment patterns.
Container Orchestration and KubernetesAdvanced proficiency in Kubernetes administration, including cluster provisioning, networking, persistent storage, security policies, and operational management at production scale.
Cloud Infrastructure PlatformsHands-on expertise with major cloud providers (AWS preferred based on company tech stack) including infrastructure-as-code, networking, compute optimization, and cost management strategies.
Infrastructure Automation and DevOpsStrong experience with infrastructure-as-code tools (Terraform, CloudFormation), CI/CD pipeline design, deployment automation, and configuration management in production environments.
Observability and MonitoringDeep expertise with observability platforms (Datadog preferred), including metrics collection, log aggregation, distributed tracing, alerting strategies, and performance optimization.
Database Systems and Data InfrastructureStrong understanding of relational and non-relational databases, data pipelines, and query optimization; experience with migration strategies and multi-region data replication patterns.

Education

Bachelor's Degree in Computer Science or Related FieldBachelor's degree in Computer Science, Software Engineering, Physics, Mathematics, or equivalent technical discipline; advanced degree beneficial but not required.

Experience

4+ Years Software Engineering ExperienceMinimum 4 years of professional software engineering experience with demonstrated progression in scope and complexity, preferably including infrastructure, backend systems, or platform engineering focus.
Large-Scale Platform OperationsProven track record of independently building, deploying, and operating large-scale platforms or infrastructure systems in production environments supporting millions of users or processing significant data volumes.
Multi-Team Collaboration and CommunicationDemonstrated ability to communicate technical concepts effectively across diverse stakeholder groups, including non-technical audiences; proven success collaborating with distributed teams across multiple time zones.
Architectural Decision MakingExperience leading architectural decisions, evaluating technology trade-offs, and making sound technical judgments that balance performance, scalability, maintainability, and business requirements.

Skills

Required skills

Kubernetes and Container OrchestrationProduction-level expertise in Kubernetes cluster design, management, and optimization including networking, storage, security, and RBAC policies.
AWS Cloud ServicesStrong proficiency with AWS services including EC2, RDS, S3, VPC, IAM, CloudFormation, and other core infrastructure services used in enterprise deployments.
Infrastructure-as-Code (IaC)Advanced experience with Terraform, CloudFormation, or similar IaC tools for automating infrastructure provisioning, versioning, and management.
Distributed Systems DesignStrong grasp of distributed system principles including consensus algorithms, replication strategies, failure modes, and patterns for building resilient infrastructure.
Systems Programming and Performance OptimizationProficiency with systems-level programming, performance profiling, bottleneck identification, and optimization techniques for high-throughput systems.
Backend Programming LanguagesStrong proficiency in Python, Go, Java, or similar languages commonly used in infrastructure and backend systems development.

Nice to have

Datadog Observability PlatformHands-on experience implementing and optimizing Datadog for metrics collection, log aggregation, distributed tracing, and complex alerting in production environments.
Message Queue and Stream ProcessingExperience with distributed message systems like Celery, Apache Kafka, RabbitMQ, or Redis; understanding of event-driven architecture patterns and async processing.
AI/ML InfrastructureExposure to AI infrastructure requirements including GPU cluster management, model serving frameworks, training pipeline orchestration, and cost optimization for ML workloads.
Multi-Region and High-Availability ArchitectureDemonstrated success designing and operating multi-region deployments, cross-region failover mechanisms, and disaster recovery strategies for mission-critical systems.
Open Source ContributionActive involvement in open source projects, particularly infrastructure or DevOps tools, demonstrating commitment to community and technical depth.
Financial or Enterprise SaaS SystemsPrior experience building or operating infrastructure for financial technology, enterprise software, or high-compliance environments where security and reliability are paramount.
Cost Optimization and FinOpsTrack record of optimizing cloud infrastructure costs, implementing resource scheduling, and driving FinOps practices across engineering organizations.

Compensation & benefits

Salary

CAD 160,000 – 220,000 (annual)

Stock options

Available

Benefits

Equity Compensation

Competitive startup equity package providing long-term value participation as Zip scales its enterprise platform globally.

Comprehensive Health Coverage

100% coverage options for health, vision, and dental insurance with multiple plan choices and family coverage options.

On-Campus Meals

Catered breakfast, lunch, and dinner daily at Zip's offices to support employee wellness and team collaboration.

Flexible PTO Policy

Unlimited flexible paid time off allowing employees to balance work and personal commitments without rigid accrual limits.

Wellness Benefits

ClassPass membership providing access to thousands of fitness, wellness, and mental health activities and studios.

Commuter and Transportation Benefits

Monthly commuter benefits supporting sustainable and convenient transportation options to and from the office.

Team Culture and Events

Regular team building events, happy hours, and social gatherings fostering collaboration and company culture.

Remote Work Support

Home office stipend for equipment and setup, plus phone and internet reimbursement to support distributed work quality.

Hybrid Work Flexibility

Hybrid work model with 5 flexible remote days per quarter, allowing balanced in-office collaboration and remote productivity.

Family Support Benefits

Paid parental leave and fertility benefits supporting employees at all life stages and family planning needs.

Employee Assistance Program (EAP)

Comprehensive employee assistance program providing confidential counseling, mental health support, and personal resources.

AI Tool Access

Unlimited AI token usage providing access to cutting-edge AI capabilities and tools for productivity and innovation.


Interview process

  1. 1
    Initial Screening Brief conversation with Zip's recruiting team to discuss your background, infrastructure engineering experience, and interest in scaling enterprise procurement systems.
  2. 2
    Technical Interview - Systems Design Deep dive into distributed systems architecture and infrastructure design challenges. Expect discussions on multi-region deployment patterns, Kubernetes optimization, and scalability trade-offs relevant to processing $500+ billion in enterprise procurement spend.
  3. 3
    Technical Interview - Infrastructure Implementation Practical assessment of your infrastructure-as-code, automation, and hands-on experience with AWS, Kubernetes, or related technologies. May involve reviewing past projects or whiteboarding infrastructure solutions.
  4. 4
    Infrastructure Leadership and Collaboration Conversation with Core Infrastructure team members and potential collaborators focused on your approach to technical decision-making, cross-functional communication, and building resilient systems in distributed teams.
  5. 5
    Team and Cultural Fit Discussion Final conversation with engineering leadership exploring your alignment with Zip's values of ownership, open communication, underdog mindset, and commitment to driving innovation in enterprise software.

Apply for this position

You'll be redirected to the company's application page


Zip

Zip

View all jobs

Zip provides an intake-to-pay platform designed to streamline procurement processes, automate approvals, and improve visibility and control for organizations.

San Francisco, CA, USAFounded 2019ziphq.com

Tech Stack

Languages
PythonGoSQLBash/Shell
Frameworks
KubernetesCeleryDBOS
Databases
PostgreSQLRedisElasticsearch
Tools
TerraformDatadogDockerJenkins or GitHub ActionsArgoCDPrometheus
Other
AWS Cloud PlatformDoclingGit Version ControlHelm Package ManagerLinux System Administration
Apply Now