Apply Now

Machine Learning Engineer - AI & ML Evaluation Frameworks

Cupertino, CA, US • Posted 30+ days ago • Updated 8 hours ago

Full Time

On-site

Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

Analytics
SAFE
Software Architecture
Innovation
Reasoning
Algorithms
Sensors
Fusion
Management
Deep Learning
Python
Continuous Integration
Continuous Delivery
Git
Failure Analysis
Communication
Articulate
Computer Science
Statistics
Evaluation
Prompt Engineering
Data Processing
Apache Spark
Kubernetes
Privacy
Machine Learning (ML)
Artificial Intelligence
Testing

Summary

The Health Sensing Machine Learning Interpretability & Analytics (MLIA) team ensures clinical rigor and contextual trust are at the foundation of Apple's health sensing features. We are looking for an exceptional ML Engineer to help us build the next generation of scalable evaluation infrastructure and lead rigorous investigations into model performance. You will develop cutting-edge tools, synthetic data pipelines, and automated frameworks that ensure our health features are mathematically sound, demographically equitable, and clinically safe. If you are passionate about AI safety, robust software architecture, and pushing the boundaries of ML innovation, come join us!

Description

In this role, you will architect and build large-scale evaluation frameworks to interrogate unimodal ML systems and multi-modal foundation models. Beyond infrastructure, you will lead deep-dive ML evaluations, performing failure analysis to uncover performance gaps, reasoning flaws, and edge cases. You will translate findings into actionable insights and work directly with algorithm teams to improve the safety and reliability of our health features. Your work will empower teams across Apple to rapidly evaluate multi-modal sensor fusion while upholding Apple's privacy standards.

Minimum Qualifications

BS in Computer Science, Machine Learning, Statistics, or related field

3+ years of experience in ML Engineering or Applied ML

Strong experience in evaluating supervised, unsupervised, LLMs and deep learning models.

Proficiency in Python with the ability to write production-grade code (OOP, CI/CD, Git)

Hands-on experience in failure analysis, evaluating LLMs and driving subsequent model improvements

Experience building data pipelines, inference frameworks, and automated evaluation systems

Strong communication skills to articulate complex technical concepts across technical and non-technical audiences

Preferred Qualifications

MS/PhD in Computer Science, Machine Learning, Statistics, or related field

Experience evaluating LLMs or agentic systems (e.g., LLM-as-a-judge, RAG evaluation)

Experience with synthetic data generation and prompt engineering

Experience in parallel data processing (Spark, Kubernetes, Airflow) or privacy-preserving ML (Federated Learning)

Background in AI Safety, model interpretability, or adversarial testing

Interest in digital health and clinical rigor

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 90733111
Position Id: f08bb4bac53b468f3a53a3956e047717
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Cupertino, California

•

Today

Imagine what you could do here. At Apple, great new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish! Are you passionate about music, movies, and the world of Artificial Intelligence and Machine Learning? So are we! Join our Human-Centered AI team for Apple Products. In this role, you'll represent the user perspective on new features, review and analyze

Full-time

Machine Learning Engineer - AI Evaluation & LLM Systems

Cupertino, California

•

Today

Join the team building the evaluation systems that enable Apple's next generation of AI experiences. As a Machine Learning Engineer, you will develop scalable infrastructure, intelligent evaluators, and data-driven methodologies that measure and improve the quality of large language models and multimodal AI systems used across Apple products. You'll partner closely with ML researchers, software engineers, and product teams to design novel evaluation techniques, analyze model behavior, and trans

Full-time

AIML - Sr Machine Learning Engineer, Evaluation

Cupertino, California

•

Today

We are seeking a highly skilled and experienced machine learning engineer to join AIML Evaluation to build the systems that evaluate and refine Apple's foundation models and agents. As a key member of the team, you will help design and develop benchmarks, evaluators, simulation environments, and prompt and context optimization pipelines that drive quality improvements across Apple's AI experiences. You will collaborate with product teams and the foundation model team to close the loop between o

Full-time

Senior ML Platform Engineer - AD/ADAS

Palo Alto, California

•

Today

Woven by Toyota is enabling Toyota's once-in-a-century transformation into a mobility company. Inspired by a legacy of innovating for the benefit of others, our mission is to challenge the current state of mobility through human-centric innovation - expanding what "mobility" means and how it serves society. Our work centers on four pillars: AD/ADAS, our autonomous driving and advanced driver assist technologies; Arene, our software development platform for software-defined vehicles; Woven City,

Full-time

USD 140,000.00 - 230,000.00 per year

Search all similar jobs

More jobs at Apple, Inc. in Cupertino, CA

Machine Learning Engineer - AI & ML Evaluation Frameworks

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs