Apply Now

Staff/Sr. Machine Learning Engineer, Foundation Models - AI, Search & Knowledge Platforms

San Francisco, CA, US • Posted 30+ days ago • Updated 6 hours ago

Full Time

On-site

Fitment

Dice Job Match Score™

🤯 Applying directly to the forehead...

Job Details

Skills

Servers
Music
Drawing
Computer Hardware
Art
Research
Real-time
Use Cases
GPU
Cloud Computing
Amazon Web Services
Microsoft Azure
Kubernetes
Docker
PyTorch
TensorFlow
Golang
Python
Deep Learning
Writing
CUDA
NMS
Computer Science
Artificial Intelligence
Machine Learning (ML)
Information Retrieval
Data Science

Summary

We are Foundation Model Inference Team, within AI, Search & Knowledge Platform Technologies organization. Our team is responsible to build Inference stack to power Apple Intelligence. It builds frameworks, services and tools that power the largest Apple foundation models on servers. Our Infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring Intelligence to billions of users across the world. You will have an opportunity to make difference in life of people by empowering them with AI. You will have a chance to work on optimizing billions of parameter langauge and vision and speech models using state of the art technologies and make it run at scale of Apple.

Work along side Foundation Model Research team to optimize inference for cutting edge model architectures.\nWork closely with product teams to build Production grade solutions to launch models serving millions of customers in real time.\nBuild tools to understand bottlenecks in Inference for different hardwares and use cases.\nMentor and guide engineers in the organization.

5+ years of experience leading and driving complex, ambiguous projects.\nExperience with LLM inference stack\nFamiliarity with GPU programming concepts using CUDA.\nFamiliarity with one of the popular ML Frameworks like Pytorch, Tensorflow.\nHave experience with high throughput services particularly at supercomputing scale.\nProficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker etc. \nFamiliar with one of the popular ML Frameworks like Pytorch, Tensorflow.\nBS in Computer Science, Artificial Intelligence, Machine Learning, Information Retrieval, Data Science or related field

Proficient in building and maintaining systems written in modern languages (eg: Golang, Python)\nFamiliar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models. \nFamiliarity with Nvidia TensorRT-LLM, vLLM, DeepSpeed, Nvidia Triton Server etc. \nExperience writing custom CUDA kernels using CUDA or OpenAI Triton. \nMS in Computer Science, Artificial Intelligence, Machine Learning, Information Retrieval, Data Science or related field.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 90733111
Position Id: de8a4992a2492a7602d31f28063c7a42
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Santa Clara, California

•

Today

Do you feel you think differently, you are eager to break status quo, are bold and ambitious, aren't afraid to take risks and are passionate to build the best of class technology. If yes, what better place to be at and do this than Apple? At Apple, "we think different, we push the boundaries of computing and intelligence. We build products that bring smile to people's face". \\n\\nFoundation Model Services team, within Machine Learning Platform Technologies organization is the back-bone of Apple

Full-time

ML Research Engineer, ML Systems

San Francisco, California

•

Today

Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference. The platform has been powering MLEs, researchers, data scientists and operators for fast and automatic training and evaluation of LLM's, as well as evaluation of data quality. Scale is uniquely positioned at the heart of the field of AI as an indispensable provider of training and evaluation data and end-to-end solutions for the ML lifecycle. You will work closely across Sc

Full-time

USD 189,600.00 - 237,000.00 per year

Machine Learning Inference Engineer

San Francisco, California

•

Today

We are partnering with a fast-growing AI startup building next-generation multimodal generative systems focused on highly realistic visual experiences at scale. The company operates at the intersection of computer vision, generative AI, and real-time inference infrastructure, developing advanced AI products used by enterprise customers across large consumer-facing industries. This is a highly technical and hands-on engineering role focused on production inference optimization for multimodal and

Easy Apply

Full-time

$200000 - $210000 per annum

Software Engineer

San Francisco, California

•

Today

Company Overview Docusign brings agreements to life. Over 1.5 million customers and more than a billion people in over 180 countries use Docusign solutions to accelerate the process of doing business and simplify people's lives. With intelligent agreement management, Docusign unleashes business-critical data that is trapped inside of documents. Until now, these were disconnected from business systems of record, costing businesses time, money, and opportunity. Using Docusign's Intelligent Agreem

Full-time

USD 146,400.00 - 235,375.00 per year

Search all similar jobs

More jobs at Apple, Inc. in San Francisco, CA

Staff/Sr. Machine Learning Engineer, Foundation Models - AI, Search & Knowledge Platforms

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs