jax Jobs in california

Refine Results
1 - 20 of 64 Jobs

Senior ML Compiler Engineer

General Motors

Mountain View, California, USA

Full-time

Job Description Remote: This role is based remotely but if you live within a 50-mile radius of Mountain View, you are expected to report to that location three times a week, at minimum. Role: We are looking for a deep learning compiler engineer to build out our ML compiler for deploying machine learning models to a variety of ML hardware accelerators. You will develop and enhance GM's internal ML compiler for high performance, usability, and retargetability by leveraging open-source technology

Software Engineer, ML Inference Compiler & Deployment, GPU, CPU

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within our Autonomy teams, you will contribute to one of the most advanced and widely deployed AI Platforms in the world for Autopilot and our Humanoid Robot, Optimus. In this role, you will be responsible for the internal working of the AI inference stack and compiler running neural networks in millions of Tesla vehicles and Optimus. You will collaborate closely with the AI Engineers and Hardware Engineers to understand the full inference stack and design the compiler to

Principal Machine Learning Engineer (NLP - AI Runtime Security)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Software Engineer, ML Inference Compiler & Deployment, AI Frameworks

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within our Autonomy teams, you will contribute to one of the most advanced and widely deployed AI Platforms in the world, powering Autopilot and our Humanoid Robot, Optimus. In this role, you will be responsible for the internal working of the AI inference stack and compiler running neural networks in millions of Tesla vehicles and Optimus. You will collaborate closely with the AI Engineers and Hardware Engineers to understand the full inference stack and design the compi

Machine Learning Kernel Performance Engineer, Dojo

Tesla Motors

Palo Alto, California, USA

Full-time

As a member of the Dojo Machine Learning software team, you will be responsible for developing and optimizing machine learning kernels to run on a massively parallel machine. The ideal candidate will have a strong background in kernel development and performance optimization for AI workloads, with a passion for delivering high-performance implementations, which are simple to use. Responsibilities Develop and validate datapath kernels on a massively parallel machineDesign and deliver implementat

Software Engineer, ML Inference Compiler & Deployment, AI Accelerator

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within our Autonomy teams, you will contribute to one of the most advanced and widely deployed AI Platforms in the world for Autopilot and our Humanoid Robot, Optimus. In this role, you will be responsible for the internal working of the AI inference stack and compiler running neural networks in millions of Tesla vehicles and Optimus. You will collaborate closely with the AI Engineers and Hardware Engineers to understand the full inference stack and design the compiler to

Principal AI Engineer, Enterprise AI Platform

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Principal AI/ML Engineer - Onboard Embodied AI

General Motors

Mountain View, California, USA

Full-time

Job Description Hybrid: This role is categorized as hybrid. This means the successful candidate is expected to report to the Mountain View Technical Center in the Bay Area three times per week, at minimum. Role: As a Technical Lead in Machine Learning within the Onboard Embodied AI organization, you will be a senior individual contributor driving cutting-edge end-to-end machine learning solutions directly impacting autonomous driving performance. Your role is pivotal in designing, architecting

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Annapurna Labs (U.S.) Inc.

Cupertino, California, USA

Full-time

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machinelearning accelerators and servers that use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs.This role is responsible for development, enablement and performance tuning of a wide variety of LLM model families, including massive scale large language models like the Llama family, DeepSeek and beyond, as well as stable dif

Senior Software Development Engineer, AI/ML, AWS Neuron, Model Inference

Annapurna Labs (U.S.) Inc.

Cupertino, California, USA

Full-time

AWS Neuron is the complete software stack for the AWS Inferentia and Trainium cloud-scale machine learning accelerators and servers that use them. This role is for a software engineer in the Machine Learning Inference Model Enablement team for AWS Neuron at Annapurna Labs. This role is responsible for development, enablement and performance tuning of a wide variety of LLM model families, including massive scale large language models like the Llama family, DeepSeek and beyond, as well as stable

Senior Inference Technical Product Marketing Manager - Accelerated Computing

NVIDIA

Santa Clara, California, USA

Full-time

We are looking for a Senior Technical Product Marketing Manager. This role will be located in our rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (parallelisms, configurations, etc.). You will influence NVIDIA's entire technical marketing strategy to showcase our lead

Senior Inference Technical Product Marketing Manager - Accelerated Computing

NVIDIA

Santa Clara, California, USA

Full-time

We are looking for a Senior Technical Product Marketing Manager. This role will be located in our rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (parallelisms, configurations, etc.). You will influence NVIDIA's entire technical marketing strategy to showcase our lead

Staff AI/ML Engineer - Onboard Embodied AI

General Motors

Mountain View, California, USA

Full-time

Job Description Hybrid: This role is categorized as hybrid. This means the successful candidate is expected to report to the Mountain View Technical Center in the Bay Area three times per week, at minimum. Role: As a Technical Lead in Machine Learning within the Onboard Embodied AI organization, you will be a senior individual contributor driving cutting-edge end-to-end machine learning solutions directly impacting autonomous driving performance. Your role is pivotal in designing, architecting,

Deep Learning Vision Engineer, Optimus

Tesla Motors

Palo Alto, California, USA

Full-time

Tesla is on a path to build humanoid robots at scale to automate repetitive and boring tasks. The goal of our Machine Learning team is to build and demonstrate a general robot learning system that can leverage AI to perform complex physical tasks, ranging from full body locomotion, and precise manipulation This specific role will do hands-on data analysis for Optimus's Machine Learning team, as well as build data pipelines, tools and applications to automate those analysis. This is a unique oppo

Senior Machine Learning Engineer

Randstad - Torc

Remote

Contract

We are looking for a Senior Machine Learning Engineer to join our team and drive the development of advanced AI-driven solutions. You will be responsible for designing, deploying, and optimizing machine learning models that power real-world applications. This role requires deep expertise in model architecture, data engineering, MLOps, and production-scale AI systems. What you'll do:Design, develop, and deploy scalable, production-ready machine learning models for real-world applications.Architec

Senior Site Reliability Engineer, AI Infrastructure

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you! NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for over 30 years. It's a unique legacy of innovation that's fueled by phenomenal technology and outstanding people. Today, we're tapping into the unlimited potential of

Staff Software Engineer, Machine Learning

Discord Inc.

Remote

Full-time

Discord is used by over 200 million people every month for many different reasons, but there's one thing that nearly everyone does on our platform: play video games. Over 90% of our users play games, spending a combined 1.5 billion hours playing thousands of unique titles on Discord each month. Discord plays a uniquely important role in the future of gaming. We are focused on making it easier and more fun for people to talk and hang out before, during, and after playing games. We're currently l

Developer Technologies Engineer, Robotics Reinforcement Learning

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join the team and see how we can make a lasting impact on the world

Principal Software Engineer, TensorRT-LLM

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are now looking for a Principal Software Engineer, TensorRT-LLM ! NVIDIA is hiring experienced principal software engineer for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in AI, enabling breakthroughs in areas like content creation, code generation and information synthesis, that have put AI on the precipice of an "iPhone moment". Join the team building the AI serving software which is foundational to product lines within NVIDIA

Senior GenAI Algorithms Engineer

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are now looking for a Senior Gen AI Algorithms Engineer! NVIDIA is seeking engineers to design, develop and optimize Artificial Intelligence solutions to diverse real-world problems. If you have a strong understanding of deep learning and in particular large language models and their multimodal variants, then this role may be a great fit for you! Collaborate and interact with internal partners, users, and members of the open source community to analyze, define and implement highly optimized A