Apply Now

Senior Applied Scientist - AI Evaluation & Quality Systems

Washington, WA, US • Posted 30+ days ago • Updated 3 hours ago

Full Time

On-site

Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

Quality Management
Quality Control
Quality Assurance
Fluency
Research
Science
Large Language Models (LLMs)
Prompt Engineering
Use Cases
Generative Artificial Intelligence (AI)
Dynamics
Data Quality
Python
NMS
Computer Science
Machine Learning (ML)
Statistics
Evaluation
Distribution
Communication
Technical Direction
Artificial Intelligence

Summary

Apple Services Engineering (ASE) powers the AI and LLM features behind experiences that hundreds of millions of users love every day. As these systems increasingly rely on human-in-the-loop evaluation, the quality of our products is directly constrained by the quality of our evaluation systems. We believe that to build exceptional AI, you need exceptional mechanisms to validate the signals used to train and evaluate them.

The Human-centered AI, Data Quality Operations team is looking for a Senior Applied Scientist to join our growing team. We are building the systems and methodologies that make AI evaluation trustworthy, and scalable - directly shaping how Apple develops and validates AI across products and services. In this role, you will develop novel, scalable quality control solutions, working closely with cross-functional teams to ensure the data powering our AI/ML systems meets the highest standards of accuracy, consistency, and relevance.\n\nYour work will span two connected problem spaces. The first is the methodology and tooling that generates reliable ground truth and detects quality failures across human annotation and automated evaluation pipelines. The second is the autonomous QA agents that make those methodologies generalizable across teams and use cases. This role demands fluency across research thinking and engineering execution - you will prototype, validate, and ship. A strong point of view on when not to use a model or agent is as valued here as the ability to build one.\n

5+ years of industry experience in applied science or machine learning with demonstrated impact on shipped systems\nStrong hands-on experience with Large Language Models including prompt engineering and applied use cases such as grading, validation, or classification\nStrong working knowledge of evaluation methodology for generative AI, including LLM-as-a-judge design, meta-evaluation, and failure mode analysis\nFamiliarity with human-in-the-loop evaluation systems and the operational dynamics that affect data quality at scale\nHands-on experience designing ground truth generation pipelines across varied task types and annotation modalities\nProficiency in Python and relevant ML frameworks, with production experience building, deploying, and monitoring LLM-based pipelines and agents\nMS or PhD in Computer Science, Machine Learning, Statistics, or a related quantitative field, or equivalent practical experience

PhD in Computer Science, Machine Learning, Statistics, or a related field\nExperience designing agent architectures that are configurable and extensible by practitioners who did not build them\nHands-on experience building anomaly detection systems for evaluation quality, including drift detection, distribution analysis, and systematic bias identification\nStrong communication skills with the ability to influence technical direction across cross-functional teams\nDemonstrated passion for leveraging AI to improve work efficiency and scale

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 90733111
Position Id: 9c34204ab6b4d59cfa79e9a5950d160a
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Washington

•

Today

Apple Services Engineering (ASE) powers AI and LLM features across App Store, Music, Video, and more. As these systems increasingly rely on LLM Judges and automated evaluators to score model performance at scale, the trustworthiness of those evaluation signals becomes mission-critical. We believe that to build exceptional LLMs, you need exceptional mechanisms to validate the signals used to train and evaluate them.\\n\\n As a Principal Applied Scientist on the Human Centered AI team, you will b

Full-time

Measurement Scientist, AI Evaluation Platform

Washington

•

Today

Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or App Store experience we deliver is the result of us making each other's ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It's the diversity of our people and their thinking that inspires the innovation that runs through everyth

Full-time

ML Research Engineer, AI Evaluation Platform

Washington

•

Today

AI systems are only as trustworthy as the methods used to evaluate them. At Apple, where AI powers experiences for billions of people, getting evaluation right is not a support function-it is a foundational science. Our team, part of Apple Services Engineering, is building that scientific foundation: rigorous, scalable evaluation methodology for LLMs, agentic systems, and human-AI interaction.\\n\\nWhat makes this team unusual is its interdisciplinary core. You will work alongside measurement sc

Full-time

Evaluation & Insights Machine Learning Engineer

Washington

•

Today

Imagine what you could do here. At Apple, great new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish! Are you passionate about music, movies, and the world of Artificial Intelligence and Machine Learning? So are we! Join our Human-Centered AI team for Apple Products. In this role, you'll represent the user perspective on new features, review and analyze d

Full-time

Search all similar jobs

More jobs at Apple, Inc. in Washington, WA

Senior Applied Scientist - AI Evaluation & Quality Systems

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs