AIML - Machine Learning Engineer, Foundation Models

Cupertino, CA, US • Posted 1 day ago • Updated 2 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

  • Machine Learning (ML)
  • Modeling
  • Spectrum
  • Workflow
  • Research
  • Use Cases
  • Natural Language Processing
  • Training
  • Python
  • Deep Learning
  • JAX
  • PyTorch
  • TensorFlow
  • Computer Science
  • Information Retrieval
  • Computer Hardware
  • Privacy

Summary

We build frontier foundation models that power intelligent experiences at Apple. Our team works across the full training lifecycle: including pre-training foundation models, and developing mid-training approaches that bridge general capability and task-specific performance. What makes our work distinct is that we're engineering models specifically for Apple silicon and optimized for experiences that are private, personal, and deeply integrated into the OS. We're solving frontier problems in reward modeling to resist reward hacking, handling sparse and delayed rewards in agentic settings, and aligning models reliably across the spectrum from open-ended creative tasks to precise, action-taking workflows. If you're drawn to hard problems where the research and the product are inseparable, this is the team.

We believe that the most interesting problems in deep learning research arise when we try to apply learning to real-world use cases, and this is also where the most important breakthroughs come from. You will work with a close-knit and fast growing team of world-class engineers and scientists to tackle some of the most challenging problems in foundation models and deep learning, including natural language processing, multi-modal understanding, and combining learning with knowledge.\n\nFurther, you will have opportunities to identify and develop novel applications of deep learning in Apple products. You will see your ideas not only published in papers, but also improve the experience of millions of users.

Proven track record in training or deployment of large models or building large-scale distributed systems.\nProficient programming skills in Python and one of the deep learning toolkits such as JAX, PyTorch, or Tensorflow.\nAbility to work in a collaborative environment.\nPhD, or equivalent practical experience, in Computer Science, or related technical field.

Web-scale information retrieval\nHuman-like conversation agent\nMulti-modal perception for existing products and future hardware platforms\nOn-device intelligence and learning with strong privacy protections
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: 61037901844d1fb31ba757922c204570
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Cupertino, California

Today

Full-time

Cupertino, California

Today

Full-time

Cupertino, California

Today

Full-time

Santa Clara, California

Today

Full-time

Search all similar jobs