Machine Learning Engineer, Foundation Model Services

Santa Clara, CA, US • Posted 2 days ago • Updated 5 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

  • Servers
  • Music
  • Drawing
  • Computer Hardware
  • Art
  • Real-time
  • Research
  • Build Tools
  • Use Cases
  • Natural Language Processing
  • Information Retrieval
  • Statistics
  • Cloud Computing
  • Amazon Web Services
  • Microsoft Azure
  • Kubernetes
  • Docker
  • Golang
  • Python
  • Computer Science
  • Machine Learning (ML)
  • PyTorch
  • TensorFlow
  • Deep Learning

Summary

Do you feel you think differently, you are eager to break status quo, are bold and ambitious, aren't afraid to take risks and are passionate to build the best of class technology. If yes, what better place to be at and do this than Apple? At Apple, "we think different, we push the boundaries of computing and intelligence. We build products that bring smile to people's face". \\n\\nFoundation Model Services team, within Machine Learning Platform Technologies organization is the back-bone of Apple Intelligence. It builds frameworks, services and tools that power the largest Apple foundation models on servers. Our Infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring Intelligence to billions of users across the world. You will have an opportunity to make a difference in life of people. You will have a chance to work on optimizing billions of parameter language and vision and speech models using state of the art technologies and make it run at scale of Apple.

* Work closely with product teams to build production grade solutions to launch models serving millions of customers in real time. \n\n* Work along side Foundation Model Research team to prototype and develop inference for cutting edge model architectures. \n\n* Build tools to understand bottlenecks in Inference for different hardwares and use cases.

5 year+ industry experience in ML technologies (LLMs, Machine Learning, NLP, Information Retrieval, Statistics).\nExperience with high throughput services particularly at supercomputing scale.\nProficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker etc.\nProficient in building and maintaining systems written in modern languages (eg: Golang, python)\nBachelor's degree or higher in Computer Science or related technical field.

Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow.\nFamiliar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models.\nFamiliarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server etc.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: 3d98963cd2b8f85371be76f6fe062e09
  • Posted 2 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

San Jose, California

Today

Full-time

USD 151,800.00 - 265,350.00 per year

Cupertino, California

Today

Full-time

Sunnyvale, California

Today

Full-time

USD 185,500.00 - 270,000.00 per year

Sunnyvale, California

Today

Full-time

USD 155,420.00 - 205,900.00 per year

Search all similar jobs