GPU Design Engineer Memory Hierarchy Jobs in San Francisco, CA

Refine Results
1 - 20 of 133 Jobs

Staff AI Infra Engineer (data platform)

Genmo

San Francisco, California, USA

Full-time

Job DescriptionJob DescriptionWe are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation. Role overview:We're seeking an experienced Senior/Staff AI Infra Engineer to design, build, and scale our petabyte-scale data infrastructure. You'll be responsible for creating robust, scalable systems that manage and proce

Staff AI Infra Engineer (serving API)

Genmo

San Francisco, California, USA

Full-time

Job DescriptionJob DescriptionWe are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation. Role OverviewWe are looking for a senior/staff software engineer to join our inference team. In this role, you will be responsible for designing and scaling our inference systems as they grow to support over millions of use

Senior AI Performance Engineer

Genmo

San Francisco, California, USA

Full-time

Job DescriptionJob DescriptionWe are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Join us in shaping the future of AI and pushing the boundaries of what's possible in video generation. Role overview:As a Deep Learning Performance Engineer at Genmo, you will play a critical role in optimizing the performance of our large generative AI models. Your expertise will ensure that our models run efficiently on cl

Senior Compiler Engineer, LLVM

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are looking for an experienced LLVM Compiler Engineer for an exciting and fun role in our GPU Software organization. We deliver features and improvements to better realize the potential of NVIDIA GPUs for a growing range of computational workloads, ranging from deep learning, scientific computation, and self-driving cars to graphics workloads for game titles on gaming platforms. Our compiler organization makes its mark on every GPU NVIDIA produces. Would you like to add this to your accompli

Principal Engineer, GPU Platform

OpenAI

San Francisco, California, USA

Full-time

About the Team The Applied Engineering team works across research, engineering, product, and design to bring OpenAI's technology to consumers and businesses. You'll join the team responsible for running the infrastructure that supports the models backing ChatGPT and the API. The systems we support include inference kubernetes clusters, GPU health, Infiniband performance, node lifecycle, and more.We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerfu

Post-training - Model Fusion Research Engineer

OpenAI

San Francisco, California, USA

Full-time

About the Team Our team is responsible for the post-training phase of ChatGPT, where we transform a large pre-trained model into a powerful, safe, and user-friendly chatbot. We collaborate with various teams across the company to enhance ChatGPT's safety, speed, intelligence, utility, and overall capabilities. We integrate these improvements into the final models that power our production ChatGPT and API services, impacting millions of users worldwide. About the Role We are seeking an enginee

Senior GPU Compiler Development Engineer

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are looking for experienced Systems SW Compiler Engineers for an exciting role in our PTX (Parallel Thread Execution) Compiler Development team. Join the PTX Compiler team and help drive PTX language design and PTX compiler evolution. PTX enables all GPU Computing applications including HPC, Deep Learning and Autonomous Driving. PTX provides a stable programming model and portable instruction set Architecture (ISA) for NVIDIA GPUs and used by all Compute programming languages compiled to NVID

Director of Engineering - Cloud Compute

Crusoe

San Francisco, California, USA

Full-time

Job DescriptionJob DescriptionCrusoe Energy is on a mission to unlock value in stranded energy resources through the power of computation. We aim to align the long term interests of the climate with the future of global computing infrastructure. As data centers consume an exponentially growing power footprint to deliver technology to all connected devices, we are inspired by making sure that the energy meeting that demand is sourced in an environmentally responsible fashion. Crusoe co-locates m

Cloud Engineer - GPU Hosting

AMAX

Fremont, California, USA

Full-time

Job DescriptionJob Description*Salary range: $120,000-$170,000* AMAX is seeking a skilled Cloud Engineer with expertise in GPU workloads to join our team. In this role, you will be responsible for designing, deploying, and managing cloud infrastructure specifically tailored for GPU hosting. You will optimize GPU utilization and performance within cloud environments, ensuring that systems run efficiently and securely. Essential Functions Design, deploy, and manage cloud infrastructure for GPU hos

Senior HPC Systems Engineer

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Senior Performance Engineer

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Internship, Software Engineer, Autonomy Systems Foundations (Winter/Spring 2025)

Tesla Motors

Palo Alto, California, USA

Full-time

As a C++ Software Engineer within the Autonomy group, you will have the opportunity to apply your technical skills to a variety of system components & foundational code targeting higher performance of Self-Driving and Humanoid robot. The nature of the role means that the code you will write, debug, and maintain will almost always connect with a variety of other components. You will be building robust code foundations for the autonomy teams to write their applications on top of and evangelize bes

Analytics Engineer, Infrastructure Strategy

OpenAI

San Francisco, California, USA

Full-time

About the Role As a critical member of the Applied Engineering Analytics Data team, you will be instrumental in enhancing our understanding of our infrastructure, particularly our GPU fleet: its allocation, its utilization, its costs, and opportunities for optimization. This role is pivotal to ensuring we optimize on infrastructure investments, which is vital for our AI research and deployment activities. You will work on projects that develop key data sources and dashboards to provide actionab

Tech lead, senior software engineer,AR/VR

Hireio, Inc.

San Francisco, California, USA

Full-time

Job DescriptionJob DescriptionLanguage:bilingual Requirements - B.Sc/M.Sc/PhD in Computer Science or in a related field - 5+ years of industrial C/C++ experience, solid CS fundamentals (algorithms, data structures, OOD/DOD, TDD) and problem-solving skills - Solid algorithm system design and implementation experience, including but not limited to inference engines, machine learning compilers, deployment pipelines, CV/CG/NLP/AIGC SDKs. - Deep hands-on experience in one of the following areas: com

Senior/Staff Software Engineer, Managed AI

Crusoe

San Francisco, California, USA

Full-time

Job DescriptionJob DescriptionCrusoe Energy is on a mission to unlock value in stranded energy resources through the power of computation. Take a look at what we do! - https://.youtube.com/watch?v=Rlt8k71Quqw We aim to align the long term interests of the climate with the future of global computing infrastructure. As data centers consume an exponentially growing power footprint to deliver technology to all connected devices, we are inspired by making sure that the energy meeting that demand is

Senior Software Engineer, Distributed Task-based Runtimes

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing - with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. We're looking to grow our company, and form teams with the most inquisitive people in the world. Join us at the forefront of technological ad

Software Engineer, Tools GPU (Core)

The Walt Disney Company

Emeryville, California, USA

Full-time

Are you interested in advancing Pixar's in-house and open-source filmmaking software? Our Software R&D department is looking for a motivated and skilled engineer to help develop the studio's interactive rendering architecture. We work very closely with both artists and engineers to build innovative filmmaking tools that enable our film production and continuously extend artistic reach. As a Software Engineer on the GPU team, you will work on hardware-accelerated preview rendering in our content

Staff Software Engineer, Ads ML Infra

Pinterest, Inc.

San Francisco, California, USA

Full-time

About Pinterest: Millions of people across the world come to Pinterest to find new ideas every day. It's where they get inspiration, dream about new possibilities and plan for what matters most. Our mission is to help those people find their inspiration and create a life they love. In your role, you'll be challenged to take on work that upholds this mission and pushes Pinterest forward. You'll grow as a person and leader in your field, all the while helping Pinners make their lives better in the

Senior GPU Register Database and System Architect

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

Industry leading NVIDIA GPUs not only render breathtaking images but are also driving the AI and self-driving revolution. The GPU Host architecture team in the Nvidia GPU Architecture organization is seeking a technically strong individual with both hardware design and software development expertise. In this wide-ranging role the individual will help design and be responsible for a database of GPU internal registers and develop associated tools for updating and querying the database. As part of

Software Developer, AI Frameworks & Readiness Team

Oracle Corporation

San Francisco, California, USA

Full-time

Job Description Design, develop, troubleshoot and debug Artificial Intelligence (AI) Frameworks & Customer Readiness team identifies developer and operation friction areas with OCI's GPU adoption by incubating and developing AI software that enables customers to easily onboard run, monitor and managing AI models at scale with OCI. We collaborate with our OCI compute and hardware teams, hardware (OEM) partners to build the software stacks for AI accelerators. Team specializes in working with Op