Apply Now

Machine Learning Engineer, Foundation Model Services

Santa Clara, CA, US • Posted 30+ days ago • Updated 5 hours ago

Full Time

On-site

Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

Servers
Music
Drawing
Computer Hardware
Art
Real-time
Research
Build Tools
Use Cases
Natural Language Processing
Information Retrieval
Statistics
Cloud Computing
Amazon Web Services
Microsoft Azure
Kubernetes
Docker
Golang
Python
Computer Science
Machine Learning (ML)
PyTorch
TensorFlow
Deep Learning

Summary

Do you feel you think differently, you are eager to break status quo, are bold and ambitious, aren't afraid to take risks and are passionate to build the best of class technology. If yes, what better place to be at and do this than Apple? At Apple, "we think different, we push the boundaries of computing and intelligence. We build products that bring smile to people's face".

Foundation Model Services team, within Machine Learning Platform Technologies organization is the back-bone of Apple Intelligence. It builds frameworks, services and tools that power the largest Apple foundation models on servers. Our Infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring Intelligence to billions of users across the world. You will have an opportunity to make a difference in life of people. You will have a chance to work on optimizing billions of parameter language and vision and speech models using state of the art technologies and make it run at scale of Apple.

Description

* Work closely with product teams to build production grade solutions to launch models serving millions of customers in real time.

* Work along side Foundation Model Research team to prototype and develop inference for cutting edge model architectures.

* Build tools to understand bottlenecks in Inference for different hardwares and use cases.

Minimum Qualifications

5 year+ industry experience in ML technologies (LLMs, Machine Learning, NLP, Information Retrieval, Statistics).

Experience with high throughput services particularly at supercomputing scale.

Proficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker etc.

Proficient in building and maintaining systems written in modern languages (eg: Golang, python)

Bachelor's degree or higher in Computer Science or related technical field.

Preferred Qualifications

Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow.

Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models.

Familiarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server etc.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 90733111
Position Id: 3d98963cd2b8f85371be76f6fe062e09
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Santa Clara, California

•

Today

We are looking for a ML Engineer to join our ML Compute team to help improve the efficiency, scalability, and reliability of model training and inference workloads in the cloud. In this role, you will lead the integration of large-scale ML workloads with cloud infrastructure, working cross-functionally with ML engineers, infrastructure engineers, and researchers to optimize performance, improve system efficiency, and drive high utilization of accelerator resources. Description We are a group o

Full-time

Senior Machine Learning Engineer, Agentic Systems - Moveworks

Mountain View, California

•

Today

Company Description Who we are Moveworks is the Agentic AI Assistant platform that empowers the entire workforce. Our platform enables employees to converse with all of their business systems through natural language to quickly find answers and automate tasks. Powered by the world's most advanced LLMs, our proprietary models, and a sophisticated Agentic AI platform, we're transforming how work gets done by allowing AI to take initiative, streamline complex workflows, and continuously learn and

Full-time

Senior ML Infrastructure Engineer - Embodied AI

Sunnyvale, California

•

Today

Job Description At General Motors, our product teams are redefining mobility. Through a human-centered design process, we create vehicles and experiences that are designed not just to be seen, but to be felt. We're turning today's impossible into tomorrow's standard -from breakthrough hardware and battery systems to intuitive design, intelligent software, and next-generation safety and entertainment features. Every day, our products move millions of people as we aim to make driving safer, smart

Full-time

USD 153,200.00 - 234,100.00 per year

Senior ML Infrastructure Engineer - Embodied AI Scaling Foundations

Sunnyvale, California

•

Today

Full-time

USD 153,200.00 per year

Search all similar jobs

More jobs at Apple, Inc. in Santa Clara, CA

Machine Learning Engineer, Foundation Model Services

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs