hpc/ai - kubernetes engineer Jobs in san jose, ca

Refine Results
1 - 20 of 376 Jobs

Senior AI-HPC Cluster Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that constantly evolves by adapting to new opportunities that are hard to solve, that only we can tackle, and that matter to the world. This is our life's work, to amplify human

Senior Systems Software Engineer, Containers and Kubernetes

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA is looking for a hardworking Sr. Systems Software Engineer to work on platform software based on open-source container runtimes and Kubernetes technologies. We expect you to have strong programming skills, a deep understanding of designing and building software, especially related to GO and C, experience with Systems Software and Distributed systems, as well as excellent communication and planning skills. We also welcome out-of-the-box problem solvers who can provide new ideas while stron

HPC Product Development Engineer (Kubernetes/Shell/Python)

KLA

Milpitas, California, USA

Full-time

Company Overview KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel d

Senior Engineer - HPC and AI

AMD (Advanced Micro Devices)

Santa Clara, California, USA

Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellen

Software Engineer - HPC

KLA

Milpitas, California, USA

Full-time

Company Overview KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel d

Senior Math Libraries Engineer - AI and HPC

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA Math Libraries team is looking for an expert engineer to join our development efforts in the area of kernel generation for AI and HPC, specifically targeting matrix operations, JITing and fusions. Around the world, leading commercial and academic organizations are revolutionizing AI, scientific and engineering simulations, and data analytics, using data centers powered by GPUs. Applications of these technologies are in healthcare, NLP, VR, deep learning, autonomous vehicles and countless

Senior AI-HPC Storage Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that constantly evolves by adapting to new opportunities that are hard to solve, that only we can address, and that matter to the world. This is our life's work, to amplify huma

Solutions Architect, HPC Systems Engineer

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer. Do you want to be part of a team that brings new Artificial Intelligence (AI) hardware and software technologies to production in customer data centers? As part of the NVIDIA SA organization, you will be driving deployment of our end-to-end technology solutions integration at some of NVIDIA's most strategic technology customers, as well as offering recommendations to business and engineering teams on our

Principal Machine Learning Engineer, AI (FULLY REMOTE IN USA)

Splunk Inc.

Remote or San Jose, California, USA

Full-time

Description Join us as we pursue our disruptive new vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we're committed to our work, customers, having fun and most importantly to each other's success. Learn more about Splunk careers and how you can become a part of our journey! Principal Machine Learning Engineer (MLE), Artificial I

Artificial Intelligence Engineer

KLA

Milpitas, California, USA

Full-time

Company Overview KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel d

Senior Developer Technology Engineer - AI

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We're currently seeking a Senior Developer Technology Engineer, Artificial Intelligence! Would you enjoy researching parallel algorithms to accelerate AI workloads on advanced computer architectures? Is it rewarding to investigate, find, and eliminate system bottlenecks to achieve the best possible performance of computer hardware? Could you be thrilled about an opportunity to partner with the Developer community, working at the forefront of technology breakthroughs that contribute to the succes

Principal Software Engineer - AI Infrastructure, OCI

Oracle Corporation

Santa Clara, California, USA

Full-time

Job Description This position supports the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure (OCI) with a focus on application performance in relationship to network design and configuration. The role combines (i) understanding of networking at the protocol level, (ii) programming skills, and () skill in benchmarking and analyzing the performance of extreme-scale distributed computing systems. Why Join Us? Innovative Projects: Build groundbreaking solutions

Senior Software Engineer, AI Resiliency

NVIDIA Corporation

Santa Clara, California, USA

Full-time

We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI. We are currently seeking a Senior Software Engineer to lead the development of AI software resiliency for the most powerful AI supercomputers in the world. As a member of our AI Software Resiliency team, you will play a pivotal role in defining and implementing critical resiliency features for AI supercomputers at a scale of 100,000+ GPUs. Your expertise will b

Senior HPC Performance Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. We are the GPU Communications Libraries and Networking tea

Backend Software Engineer, Machine Learning Platform, AI Infrastructure

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopilot and the Optimus. At the core of our autonomy capabilities are neural networks that the research team is designing to train on very large amounts of data, across large-scale GPU clusters and our supercomputer Dojo. Robustly training these models at scale and in the shortest amount of time is critica

Fullstack Software Engineer, Machine Learning Platform, AI Infrastructure

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopilot and the Optimus. At the core of our autonomy capabilities are neural networks that the research team is designing to train on very large amounts of data, across large-scale GPU clusters and our supercomputer Dojo. Robustly training these models at scale and in the shortest amount of time is critical

Sr MLOps Engineer, AI Platform Training

Adobe Systems

San Jose, California, USA

Full-time

Our Company Changing the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen. We're on a mission to hire the very best and are committed to creating exceptional employee experiences wher

AI System Research and Development Engineer - Frameworks

Snowflake Inc.

Menlo Park, California, USA

Full-time

Build the future of the AI Data Cloud. Join the Snowflake team. We are looking for talented System Developers and Researchers to join the Snowflake AI Research team and contribute to LLM inference and training system development, optimizations, and agentic systems. Our mission is to build the most efficient and scalable generative AI systems. Recent releases from our team include SwiftKV, an advanced inference optimization, and Arctic LLM, one of the largest open-source MoE foundation models.

Senior Software Engineer, Kubernetes Networking

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Principal Software Engineer, Kubernetes Networking

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op