Articul8

Senior Software Development Engineer in Test (SDET), Chaos Engineering Specialist - (Dublin, CA)

Articul82 months ago
Location

Dublin, CA (HQ)

Type

Full Time

Salary

USD 160,000 – 220,000

Level

Senior

Role

Senior Software Development Engineer in Test (SDET)

Posted

Jan 5, 2026

Full TimeSenior

The role

Summary

Articul8 AI is seeking a Senior Software Development Engineer in Test (SDET) specializing in Chaos Engineering to strengthen their quality engineering capabilities. The ideal candidate will design advanced test automation frameworks, conduct chaos experiments, and implement robust monitoring solutions to ensure system resilience and reliability in AI-driven distributed environments.

What you'll do

Test Automation Framework Development: Design, develop, and maintain advanced test automation frameworks incorporating chaos engineering principles
Chaos Engineering Experiments: Create and execute comprehensive chaos experiments simulating various failure modes and edge cases in distributed systems
Monitoring and Observability: Implement sophisticated monitoring solutions to track system performance, resilience, and failure recovery with actionable insights
Resilience Integration: Collaborate with development teams to embed resilience into applications and infrastructure from the ground up
Performance Visualization: Develop metrics and dashboards to visualize system reliability and quantify the impact of chaos experiments
Continuous Improvement: Conduct post-mortem analyses to identify and address system weaknesses discovered through chaos testing

What we look for

Technical

Programming LanguagesProficiency in Python, Go, and/or Rust for test automation and chaos engineering implementation
Chaos Engineering ToolsExperience with tools like Chaos Monkey, Gremlin, or equivalent chaos engineering frameworks
Monitoring SystemsExpertise in monitoring platforms such as Prometheus, Grafana, ELK Stack, or similar observability tools
Cloud PlatformsHands-on experience with AWS, GCP, or Azure cloud platforms and their native monitoring capabilities

Education

Bachelor's DegreeBachelor's degree in Computer Science, Engineering, or a related technical field
Advanced DegreeMaster's degree preferred but not mandatory

Experience

Testing ExperienceMinimum 5+ years in software testing and quality assurance, with at least 2 years specializing in chaos engineering
System ResilienceProven experience implementing observability practices in distributed systems
DevOps PracticesStrong background in SRE practices, CI/CD pipeline integration, and container orchestration (Kubernetes)

Skills

Required skills

Chaos EngineeringDeep understanding of chaos engineering principles and implementation strategies
Test AutomationAdvanced skills in designing and maintaining comprehensive test automation frameworks
Distributed SystemsExpertise in testing and ensuring reliability of complex, distributed system architectures

Nice to have

AI/ML TestingExperience with testing challenges specific to AI and machine learning systems
Open Source ContributionsActive contributions to testing or chaos engineering open-source projects
Statistical AnalysisKnowledge of statistical methods for evaluating test results and system performance

Compensation & benefits

Salary

USD 160,000 – 220,000 (annual)

Stock options

Available

Benefits

Health Insurance

Comprehensive medical, dental, and vision coverage

Equity Compensation

Stock options in a growing AI technology company

Professional Development

Annual learning and conference budget, mentorship programs

Flexible Work Arrangements

Remote work options and flexible scheduling

Retirement Planning

401(k) with company matching


Interview process

  1. 1
    Initial Screening Phone or video call with recruiting team to assess background and initial fit
  2. 2
    Technical Assessment Comprehensive coding and chaos engineering challenge to evaluate technical skills
  3. 3
    Technical Interviews Multiple rounds of interviews with senior engineering team, focusing on system design and chaos engineering expertise
  4. 4
    System Resilience Presentation Candidate presents a case study or approach to system resilience testing
  5. 5
    Final Leadership Interview Discussion with engineering leadership about long-term vision and potential contributions

Apply for this position

You'll be redirected to the company's application page