Articul8

Machine Learning Engineer - Data Pipeline

Articul82 weeks ago
Location

Dublin, CA (HQ)

Type

Full Time

Salary

USD 150,000 – 225,000

Level

Senior

Role

Machine Learning Engineer

Posted

Jun 16, 2026

Full TimeSenior

The role

Summary

Articul8 is seeking a Machine Learning Engineer to design and develop sophisticated data processing pipelines for AI model training. The ideal candidate will be responsible for end-to-end data acquisition, processing, and quality improvement, working closely with research and engineering teams to power next-generation domain-specific AI models.

What you'll do

Data Pipeline Development: Design and develop comprehensive data processing pipelines including extraction, filtering, and labeling of diverse data sources
Machine Learning Model Implementation: Develop and implement ML models to enhance data quality and diversity, including quality classifiers and verification models
Data Acquisition Engineering: Lead engineering projects focused on web crawling, data ingestion, and large-scale data processing
Distributed Systems Architecture: Develop and deploy highly scalable distributed systems capable of handling terabytes of data with robust indexing and search capabilities
Cross-Team Collaboration: Work closely with Applied Research, Technology, and Architecture teams to ensure seamless data flow and system operability
Infrastructure Management: Deploy solutions in Kubernetes Infrastructure-as-Code environment and perform routine system maintenance and checks

What we look for

Technical

Deep Learning FrameworksProficiency in at least one deep learning framework, such as PyTorch
Programming LanguagesAdvanced proficiency in Python with ability to write clean, maintainable code
Distributed SystemsStrong expertise in large stateful distributed systems and data processing technologies
Data Processing ToolsFamiliarity with distributed workload technologies like multiprocessing, Ray, Docker, and Kubernetes

Education

Advanced DegreeBS/MS/PhD in Computer Science, Machine Learning, or related technical field

Experience

Machine Learning Project ExperienceProven experience in machine learning projects, particularly in text or vision domains
Data Pipeline DevelopmentDemonstrated ability to build large-scale data processing pipelines and datasets

Skills

Required skills

Python ProgrammingStrong programming skills in Python for machine learning and data processing
Data Pipeline EngineeringExpertise in designing and implementing complex data acquisition and processing systems
Machine Learning Model TrainingExperience in training machine learning models to solve specific problems

Nice to have

GitHub ContributionsActive open-source contributions and public code repositories
Data Crawling ToolsExperience with tools like Scrapy, Selenium, Hadoop, and Datasketch
Multilingual SkillsProficiency in multiple languages to support diverse data collection

Compensation & benefits

Salary

USD 150,000 – 225,000 (annual)

Stock options

Available

Benefits

Health Insurance

Comprehensive medical, dental, and vision coverage

Equity Compensation

Stock options with potential for significant growth in AI startup

Professional Development

Continuous learning opportunities, conference attendance, and skill development programs

Flexible Work Environment

Supportive culture emphasizing diversity, creativity, and personal growth


Interview process

  1. 1
    Initial Screening Resume and background review by recruiting team
  2. 2
    Technical Phone Screen Discussion of machine learning and data engineering experience with senior engineer
  3. 3
    Coding Challenge Take-home project involving data pipeline design and ML model implementation
  4. 4
    Onsite Technical Interviews Multiple rounds covering system design, machine learning concepts, and coding skills
  5. 5
    Final Leadership Interview Meeting with engineering leadership to assess cultural fit and long-term potential

Apply for this position

You'll be redirected to the company's application page