On-device ML Infrastructure Engineer, Compiler & Runtime, Graphics, Games & ML

Cupertino, CA, US • Posted 5 hours ago • Updated 5 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

⏳ Almost there, hang tight...

Job Details

Skills

  • Artificial Intelligence
  • Research
  • Product Development
  • Innovation
  • Onboarding
  • Data Compression
  • Benchmarking
  • Forms
  • Backbone.js
  • Workflow
  • Optimization
  • Debugging
  • FOCUS
  • Computer Hardware
  • Use Cases
  • Software Engineering
  • Computer Science
  • C++
  • Python
  • Swift
  • Operating Systems
  • Embedded Systems
  • Communication
  • Open Source
  • LLVM
  • PyTorch
  • TensorFlow
  • JAX
  • Machine Learning (ML)
  • GPU

Summary

Imagine being at the forefront of an evolution where modern AI meets the elegance of Apple silicon. The On-Device Machine Learning team transforms groundbreaking research into practical applications, enabling billions of Apple devices to run powerful AI models locally, privately, and efficiently. We stand at the unique intersection of research, software engineering, hardware engineering, and product development, making Apple the leading destination for machine learning innovation.\\n\\nOur team builds the essential infrastructure that enables machine learning at scale on Apple devices. This involves onboarding powerful architectures to embedded systems, developing optimization toolkits for model compression and acceleration, building ML compilers and runtimes for efficient execution, and creating comprehensive benchmarking and debugging toolchains. This infrastructure forms the backbone of Apple's machine learning workflows across Camera, Siri, Health, Vision, and other core experiences, contributing to the overall Apple Intelligence ecosystem.\\n\\nWe're building an end-to-end developer experience for machine learning development that brings to bear Apple's vertical integration. This allows developers to iterate on model authoring, optimization, transformation, execution, debugging, profiling, and analysis. If you are passionate about the technical challenges of running sophisticated ML models across all devices, from resource-constrained devices to powerful cluster, and eager to directly impact how machine learning operates across the Apple ecosystem, this role presents a great opportunity to work on the next generation of intelligent experiences on Apple platforms.\\n

\nWe are seeking an experienced ML Infrastructure Engineer with a specific focus on building the best execution engine and compilation toolchain that employs our compilers infrastructure and the world's most efficient, portable, and extensible runtime, and which is capable of optimizing and driving ML models efficiently on Apple products and services, current and future.\n\nThis is a senior role and functions as the glue between our compiler technology, the runtime components, the kernel libraries, and the low-level hardware compilers to enable the execution of ML across a wide variety of devices and use cases. The successful candidate will make critical decisions affecting project direction and outcome. \n\nWe're seeking a highly motivated software engineer who is creative, skilled, and passionate about machine learning, common compiler optimizations, and system software engineering in the fast-paced and dynamic field of machine learning

\nBachelors in Computer Science, Engineering, or related subject area and 5+ years of hands on experience.\n\nHighly proficient in C++. Familiarity with Python and Swift.\n\nFamiliarity with Operating Systems and Embedded Programming.\n\nSound understanding of ML fundamentals, including common architectures such as Transformers.\n\nGood communication skills, including ability to communicate with multi-functional audiences.

\nExperience with any on-device ML stack, such as TFLite, ONNX, ExecuTorch, etc.\n\nExperience with open source machine learning models (Mistral, Phi, Gemma, Huggingface, etc)\n\nExperience with any compiler stack (MLIR/LLVM/TVM/...).\n\nExperience with any ML authoring framework (PyTorch, TensorFlow, JAX, etc.).\n\nExperience with machine learning accelerators and GPU programming.\n
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: b1a0c59df96938b7439a683d1223a569
  • Posted 5 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Cupertino, California

Today

Full-time

Cupertino, California

Today

Full-time

Cupertino, California

Today

Full-time

Cupertino, California

Today

Full-time

Search all similar jobs