Apply Now

Sr Machine Learning Engineer, Tech Lead - Autograder Systems, Evaluation

Cupertino, CA, US • Posted 30+ days ago • Updated 6 hours ago

Full Time

On-site

Fitment

Dice Job Match Score™

🔗 Matching skills to job...

Job Details

Skills

Generative Artificial Intelligence (AI)
Art
Systems Engineering
Roadmaps
Research
Optimization
Failure Analysis
Mentorship
Design Review
Modeling
Conflict Resolution
Problem Solving
Collaboration
Leadership
Computer Science
Artificial Intelligence
FOCUS
Estimating
Training
Python
PyTorch
Machine Learning (ML)
Data Quality
Evaluation

Summary

We are looking for a Senior MLE Tech Lead to join a centralized evaluation organization and define the next generation of autograder quality across 20+ of Apple's most visible generative AI features. You will own the end-to-end technical vision for how we evaluate model outputs at scale - pioneering state-of-the-art methods, raising the technical bar, and leading a team of talented MLEs to build a robust autograder training and hillclimbing system from the ground up.\\n\\nThis is a high-impact, hands-on leadership role at the intersection of model evaluation, data quality, and ML systems engineering. You will work closely with model developers, data teams, and product partners to ensure our autograders are fast, accurate, and continuously improving - directly shaping the quality of AI experiences used by hundreds of millions of people.

In this role you will focus on:\n\nTechnical Leadership\n\n* Define and drive the technical roadmap for autograder quality - researching and introducing novel methods such as reward modeling, LLM-as-judge, preference learning, and calibration techniques to measurably improve evaluation accuracy.\n* Architect and lead the build-out of a scalable autograder training pipeline encompassing data curation, model fine-tuning, evaluation harnesses, and versioning.\n* Design and own the hillclimbing system that iteratively improves autograder performance through systematic prompt and model optimization loops.\n* Establish quality benchmarks, confidence metrics, and failure analysis frameworks that enable the team to track, trust, and act on autograder outputs.\n\nPeople & Collaboration\n\n* Mentor and technically guide a team of MLEs through design reviews, modeling standards, and hands-on problem-solving - fostering a culture of rigor and continuous learning.\n* Partner with data annotation teams to define labeling guidelines that feed autograder training.\n* Collaborate with feature engineers to align autograder signals with broader training and product objectives.\n* Translate complex technical trade-offs into clear narratives for engineering, product, and leadership audiences.

Master's or PhD in Computer Science, Machine Learning, Artificial Intelligence, or a related field.\n5+ years of industry experience in machine learning, with a strong focus on LLM or VLM systems.\nDeep expertise in prompt-tuning and fine-tuning techniques (SFT, RLHF, DPO, or equivalent), with proven experience of model calibration and uncertainty estimation.\nFamiliarity with data flywheel design - leveraging model outputs to continuously improve future training data.\nProficiency in Python and ML frameworks (PyTorch preferred).

Strong ML systems instincts - you care deeply about data quality, reproducibility, latency, and scale.\nBackground in human-in-the-loop annotation pipelines and inter-annotator agreement analysis.\nPrior experience on an evaluation infrastructure or model quality team.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 90733111
Position Id: 1bba464969ba59460c9242efe4ec8329
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Cupertino, California

•

Today

We are seeking a highly skilled and experienced machine learning engineer to join AIML Evaluation to build the systems that evaluate and refine Apple's foundation models and agents. As a key member of the team, you will help design and develop benchmarks, evaluators, simulation environments, and prompt and context optimization pipelines that drive quality improvements across Apple's AI experiences. \\nYou will collaborate with product teams and the foundation model team to close the loop between

Full-time

Senior Machine Learning Engineer

San Jose, California

•

Today

The Opportunity Adobe Journey Optimizer B2B is redefining how enterprises engage buying groups through AI-powered customer journey orchestration. We're building intelligent systems that understand complex B2B buyer behavior, predict intent signals across accounts, and deliver hyper-personalized experiences at every touchpoint-from first awareness through closed revenue. We are looking for a Machine Learning Engineer to join our AI and Agents team, define and own the ML architecture vision for

Full-time

USD 151,800.00 - 265,350.00 per year

AIML - Senior ML Engineer, Responsible AI and Safety

Cupertino, California

•

Today

Join Us in Shaping the Future of Generative AI at Apple! Are you passionate about making AI systems safer, more inclusive, and globally representative? \\n\\nApple is seeking an expert Machine Learning Engineer to shape the future of responsible AI for the next generation of generative features. In this role, you will lead the responsible AI lifecycle end-to-end: assessing risks, defining policies, developing mitigation strategies, and driving continuous improvements. Your work will directly inf

Full-time

AIML - Machine Learning Engineer, Visual Intelligence Technology

Santa Clara, California

•

Today

Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other's ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It's the diversity of our people and their thinking that inspires the innovation that runs through every

Full-time

Search all similar jobs

More jobs at Apple, Inc. in Cupertino, CA

Sr Machine Learning Engineer, Tech Lead - Autograder Systems, Evaluation

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs