Sr. Machine Learning Engineer, ASR Infrastructure and Tools, Siri Speech

Cupertino, CA, US • Posted 60+ days ago • Updated 9 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

  • Artificial Intelligence
  • Research
  • Cross-functional Team
  • Machine Learning (ML)
  • Speech Recognition
  • Natural Language Processing
  • Management
  • PySpark
  • JAX
  • Open Source
  • Modeling
  • Unstructured Data
  • Data Processing
  • Apache Spark
  • Software Engineering
  • Python
  • Computer Science
  • Training
  • HPC
  • Data Engineering
  • Problem Solving
  • Conflict Resolution
  • Critical Thinking

Summary

Want to join the team pushing the boundaries of AI and building an intelligent assistant that helps millions of people get things done? Join the Siri team at Apple! To build the best speech recognition models, we need to use the latest technology in distributed training and the best available data. We combine those needs into one team and are focused on blurring the lines between traditional "data processing" and "model training". Efficiently training on petabytes of audio data pushes us to consider the entire training stack while developing new models to extract useful signals from unprecedented volumes of data.\\n\\nBy joining our team, you'll have the opportunity to work with large and diverse datasets, iterate with research and production teams, and deliver voice based experiences to millions of users worldwide.

The Siri Speech team is looking for exceptional individuals to extend the core technology that let Siri understand, learn, and remember. You will be part of a cross-functional team consisting of software engineers as well as data and machine learning engineers/scientists and having a large impact on the Siri product. This is a rare opportunity to apply distributed data engineering techniques at the intersection of various areas such as speech recognition, natural language processing, and dialogue management.\n\nIn this role you will\n- Work with open source tools like PySpark, Jax, Ray and others\n- Optimize how to move multi-modal data from various sources into complex model training pipelines\n- Use open source models to extract signals from large volumes of speech data to drive modeling improvements

Experience processing large, complex, unstructured data\nKnowledge of distributed data processing frameworks (Beam, Spark, Dask, Ray)\nStrong software engineering abilities, ideally Python\nStrong interpersonal skills to work well with engineering teams

M.S. or Ph.D. degree in Computer Science, or other technical discipline\nMachine Learning experience a plus\nExperience with optimizing and running large batch or training jobs on HPC-like clusters using GPUs or TPUs\nSpeech understanding or generation experience a plus\nStrong data engineering background in speech and/or language/text/dialogue processing field\nExcellent problem solving and critical thinking\nAbility to work in a fast-paced environment with rapidly changing priorities\nPassionate about building extraordinary products and experiences for our users
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: 7e97a20ced185541b7303ad76722f4af
  • Posted 30+ days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Cupertino, California

Today

Full-time

Cupertino, California

Today

Full-time

Cupertino, California

Today

Full-time

Mountain View, California

Today

Full-time

USD 209,700.00 - 283,800.00 per year

Search all similar jobs