1 - 20 of 187 Jobs

Senior ML Compiler Engineer

General Motors

Mountain View, California, USA

Full-time

Job Description Remote: This role is based remotely but if you live within a 50-mile radius of [Atlanta, Austin, Detroit, Warren, Milford or Mountain View], you are expected to report to that location three times a week, at minimum. Role: We are looking for a deep learning compiler engineer to build out our ML compiler for deploying machine learning models to a variety of ML hardware accelerators. You will develop and enhance GM's internal ML compiler for high performance, usability, and retar

Software Engineer, ML Inference Compiler & Deployment, GPU, CPU

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within our Autonomy teams, you will contribute to one of the most advanced and widely deployed AI Platforms in the world for Autopilot and our Humanoid Robot, Optimus. In this role, you will be responsible for the internal working of the AI inference stack and compiler running neural networks in millions of Tesla vehicles and Optimus. You will collaborate closely with the AI Engineers and Hardware Engineers to understand the full inference stack and design the compiler to

Software Engineer, ML Inference Compiler & Deployment, AI Frameworks

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within our Autonomy teams, you will contribute to one of the most advanced and widely deployed AI Platforms in the world, powering Autopilot and our Humanoid Robot, Optimus. In this role, you will be responsible for the internal working of the AI inference stack and compiler running neural networks in millions of Tesla vehicles and Optimus. You will collaborate closely with the AI Engineers and Hardware Engineers to understand the full inference stack and design the compi

Software Engineer, ML Inference Compiler & Deployment, AI Accelerator

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within our Autonomy teams, you will contribute to one of the most advanced and widely deployed AI Platforms in the world for Autopilot and our Humanoid Robot, Optimus. In this role, you will be responsible for the internal working of the AI inference stack and compiler running neural networks in millions of Tesla vehicles and Optimus. You will collaborate closely with the AI Engineers and Hardware Engineers to understand the full inference stack and design the compiler to

Machine Learning Kernel Performance Engineer, Dojo

Tesla Motors

Palo Alto, California, USA

Full-time

As a member of the Dojo Machine Learning software team, you will be responsible for developing and optimizing machine learning kernels to run on a massively parallel machine. The ideal candidate will have a strong background in kernel development and performance optimization for AI workloads, with a passion for delivering high-performance implementations, which are simple to use. Responsibilities Develop and validate datapath kernels on a massively parallel machineDesign and deliver implementat

Principal Machine Learning Engineer (NLP - AI Runtime Security)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Staff Software Engineer, ML Inference Compiler & Deployment, Optimus

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within our Autonomy teams, you will contribute to one of the most advanced and widely deployed AI Platforms in the world for Autopilot and our Humanoid Robot, Optimus. In this role, you will be responsible for the internal working of the AI inference stack running neural networks in Optimus and millions of Tesla vehicles. You will collaborate closely with the Optimus AI Engineers and AI Hardware Engineers to understand the full inference stack, co-design models to fit the

Principal AI/ML Engineer - Onboard Embodied AI

General Motors

Mountain View, California, USA

Full-time

Job Description Hybrid: This role is categorized as hybrid. This means the successful candidate is expected to report to the Mountain View Technical Center in the Bay Area three times per week, at minimum. Role: As a Technical Lead in Machine Learning within the Onboard Embodied AI organization, you will be a senior individual contributor driving cutting-edge end-to-end machine learning solutions directly impacting autonomous driving performance. Your role is pivotal in designing, architecting

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Annapurna Labs (U.S.) Inc.

Cupertino, California, USA

Full-time

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machinelearning accelerators and servers that use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs.This role is responsible for development, enablement and performance tuning of a wide variety of LLM model families, including massive scale large language models like the Llama family, DeepSeek and beyond, as well as stable dif

Senior Inference Technical Product Marketing Manager - Accelerated Computing

NVIDIA

Santa Clara, California, USA

Full-time

We are looking for a Senior Technical Product Marketing Manager. This role will be located in our rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (parallelisms, configurations, etc.). You will influence NVIDIA's entire technical marketing strategy to showcase our lead

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Annapurna Labs (U.S.) Inc.

Cupertino, California, USA

Full-time

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators and servers that use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs. This role is responsible for development, enablement and performance tuning of a wide variety of LLM model families, including massive scale large language models like the Llama family, DeepSeek and beyond, as well as stable

Sr. Software Engineer TS/SCI Polygraph

Leidos

Maryland, USA

Full-time

Description Leidos has an exciting opportunity for a Sr. Software Engineer! *Must have an active TS/SCI Polygraph up front. No exceptions.* You will perform software development lifecycle (SDLC) activities as both an individual and a member of our top-notch agile development team building a large complex enterprise system. Development includes the full range of turning Agile user stories into implementable concepts, through development, unit testing, integration and test, and deployment of the

Sr. Software Engineer TS/SCI Polygraph

Leidos

San Diego, California, USA

Full-time

Description Leidos has an exciting opportunity for a Sr. Software Engineer! *Must have an active TS/SCI Polygraph up front. No exceptions.* You will perform software development lifecycle (SDLC) activities as both an individual and a member of our top-notch agile development team building a large complex enterprise system. Development includes the full range of turning Agile user stories into implementable concepts, through development, unit testing, integration and test, and deployment of the

Senior Inference Technical Product Marketing Manager - Accelerated Computing

NVIDIA

Santa Clara, California, USA

Full-time

We are looking for a Senior Technical Product Marketing Manager. This role will be located in our rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (parallelisms, configurations, etc.). You will influence NVIDIA's entire technical marketing strategy to showcase our lead

Software Engineer TS/SCI Poly

Leidos

Alexandria, Virginia, USA

Full-time

Description Leidos has an exciting opportunity for a Software Engineer! *Must have an active TS/SCI Polygraph up front. No exceptions.* You will perform software development lifecycle (SDLC) activities as both an individual and a member of our top-notch agile development team building a large complex enterprise system. Development includes the full range of turning Agile user stories into implementable concepts, through development, unit testing, integration and test, and deployment of the new

Sr. Software Engineer TS/SCI Polygraph

Leidos

Franconia, Virginia, USA

Full-time

Description Leidos has an exciting opportunity for a Sr. Software Engineer! *Must have an active TS/SCI Polygraph up front. No exceptions.* You will perform software development lifecycle (SDLC) activities as both an individual and a member of our top-notch agile development team building a large complex enterprise system. Development includes the full range of turning Agile user stories into implementable concepts, through development, unit testing, integration and test, and deployment of the

Software Development Manager - Compiler Simulation , AWS Neuron, Annapurna Labs

Annapurna Labs (U.S.) Inc.

Seattle, Washington, USA

Full-time

AWS Machine Learning accelerators are at the forefront of AWS innovation. The Inferentia and Trainium chips, deliver industry-leading ML inference and training performance at the lowest cost in the cloud. This is all enabled by edge software stack, the AWS Neuron Software Development Kit (SDK), which includes an ML compiler and runtime and natively integrates into popular ML frameworks, such as PyTorch, JAX, TensorFlow and MxNet. AWS Neuron is is widely adopted by customers and partners like suc

Sr. Software Engineer- AI/ML, AWS Neuron Distributed Training - Multimodal

Annapurna Labs (U.S.) Inc.

Seattle, Washington, USA

Full-time

AWS Utility Computing (UC) provides product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development and management of Compute, Database, Storage, Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services. Additionally, this role may involve exposure to and experience with Amazon's growing suite of generative AI s

Staff AI/ML Engineer - Onboard Embodied AI

General Motors

Mountain View, California, USA

Full-time

Job Description Hybrid: This role is categorized as hybrid. This means the successful candidate is expected to report to the Mountain View Technical Center in the Bay Area three times per week, at minimum. Role: As a Technical Lead in Machine Learning within the Onboard Embodied AI organization, you will be a senior individual contributor driving cutting-edge end-to-end machine learning solutions directly impacting autonomous driving performance. Your role is pivotal in designing, architecting,

Software Development Manager - Compiler Simulation , AWS Neuron, Annapurna Labs

Annapurna Labs (U.S.) Inc.

Seattle, Washington, USA

Full-time

AWS Machine Learning accelerators are at the forefront of AWS innovation. The Inferentia and Trainium chips, deliver industry-leading ML inference and training performance at the lowest cost in the cloud. This is all enabled by edge software stack, the AWS Neuron Software Development Kit (SDK), which includes an ML compiler and runtime and natively integrates into popular ML frameworks, such as PyTorch, JAX, TensorFlow and MxNet. AWS Neuron is is widely adopted by customers and partners like suc