Airwallex

Senior Site Reliability Engineer, Spend Foundations

Airwallex4 months ago
Location

SG - Singapore

Type

Full Time

Level

Senior

Role

Site Reliability Engineer

Posted

Nov 13, 2025

Full TimeSenior

The role

Summary

This Senior Site Reliability Engineer role at Airwallex focuses on the Spend Foundations team, responsible for delivering scalable cloud infrastructure and maintaining high-availability systems. The position involves architecting AWS/GCP infrastructure, leading incident response, and embedding with development teams to drive reliability best practices for financial services at global scale.

What you'll do

Cloud Infrastructure Architecture: Architect and implement scalable cloud infrastructure for new services and product roadmap initiatives
Development Team Collaboration: Embed with development teams to drive reliability, performance optimization, and operational readiness
Incident Response Leadership: Lead incident response efforts, manage observability systems, and implement automation across critical production systems
SLO Management: Own team-level Service Level Objectives, maintain comprehensive runbooks, and track DevOps performance metrics
Cross-functional Collaboration: Collaborate with central DevOps and security teams to ensure compliance, resilience, and adherence to best practices
Infrastructure Modernization: Execute complex, high-risk projects including global data center migrations and data pipeline modernization
Capacity Planning: Monitor system performance and plan capacity requirements for global services scaling
Automation Implementation: Develop and maintain automated solutions for deployment, monitoring, and system maintenance

What we look for

Technical

Cloud Platforms ExpertiseDeep expertise in AWS and/or Google Cloud Platform with hands-on experience in service deployment and management
Container OrchestrationAdvanced proficiency in Kubernetes for container management and orchestration
Observability SystemsStrong experience with monitoring, logging, and alerting systems for production environments
Incident ResponseProven track record in incident response, troubleshooting, and system recovery procedures
Infrastructure as CodeExperience with Infrastructure as Code tools like Terraform, CloudFormation, or similar
High Availability SystemsExperience supporting production systems with stringent availability and compliance requirements

Education

Bachelor's DegreeDegree in Computer Science, Engineering, or related technical field

Experience

Senior-Level Experience6+ years in Site Reliability Engineering, DevOps, or infrastructure-focused engineering roles
Cross-functional LeadershipProven ability to lead SRE strategy for large-scale, cross-functional projects
Developer CollaborationStrong experience working closely with development teams and guiding reliability best practices
Fintech Experience (Preferred)Experience in fintech or similarly regulated industries with compliance requirements
Financial Systems (Preferred)Familiarity with data streaming, analytics pipelines, or financial data systems

Skills

Required skills

AWS/GCP ExpertiseDeep hands-on experience with major cloud platforms for infrastructure deployment and management
Kubernetes ProficiencyAdvanced skills in container orchestration and cloud-native application management
Observability ToolsExperience with monitoring, logging, and alerting systems like Prometheus, Grafana, ELK stack
Incident ManagementProven ability to lead incident response and implement effective recovery procedures
Infrastructure AutomationSkills in Infrastructure as Code, CI/CD pipelines, and system automation
System DesignAbility to architect scalable, reliable infrastructure for high-availability systems
Programming SkillsProficiency in languages like Python, Go, or similar for automation and tooling

Nice to have

Fintech Domain KnowledgeUnderstanding of financial services, compliance requirements, and regulatory frameworks
Data Pipeline ExperienceFamiliarity with data streaming technologies, analytics pipelines, and financial data processing
Multi-cloud ArchitectureExperience designing and managing infrastructure across multiple cloud providers
Security Best PracticesKnowledge of security frameworks and compliance requirements in regulated industries
Performance OptimizationAdvanced skills in system performance tuning and capacity planning at scale

Compensation & benefits

Benefits

Global Impact Opportunity

Work on systems serving 200,000+ businesses worldwide including major brands like Brex, Rippling, and SHEIN

Career Growth

Accelerated learning and true ownership opportunities in a fast-growing fintech unicorn valued at $8 billion

Flexible Location

Role can be based in either Singapore or Melbourne with opportunities for global collaboration

Cutting-edge Technology

Work with latest cloud technologies, AI tools, and innovative financial infrastructure

Diverse Team Environment

Join a team of 2,000+ innovative professionals across 26 global offices

Equal Opportunity Employer

Commitment to diversity, inclusion, and accommodation for disabilities or special needs


Interview process

  1. 1
    Initial Screening Phone or video call with recruiter to discuss background, experience, and role alignment
  2. 2
    Technical Assessment Technical interview focusing on cloud architecture, system design, and SRE practices
  3. 3
    System Design Interview Deep dive into designing scalable infrastructure and discussing real-world scenarios
  4. 4
    Behavioral Interview Assessment of cultural fit, leadership experience, and alignment with Airwallex values
  5. 5
    Final Interview Meeting with senior engineering leadership to discuss strategic thinking and team collaboration

Apply for this position

You'll be redirected to the company's application page


Airwallex

Airwallex

View all jobs

Airwallex is a Singapore-based financial technology company specializing in cross-border payments and financial services for businesses.

SingaporeFounded 2015airwallex.com

Tech Stack

Languages
PythonGoBash/Shell
Frameworks
KubernetesTerraformAnsible
Databases
PostgreSQLRedisInfluxDB
Tools
DockerHelmGitLab CI/CDPrometheusGrafanaELK Stack
Other
AWSGoogle Cloud PlatformDatadogPagerDuty
Apply Now