HPC/AI - Kubernetes Engineer Jobs in San Jose, CA

Refine Results
21 - 40 of 371 Jobs

AI System Research and Development Engineer - Optimization

Snowflake Inc.

Menlo Park, California, USA

Full-time

Build the future of the AI Data Cloud. Join the Snowflake team. We are looking for talented System Developers and Researchers to join the Snowflake AI Research team and contribute to LLM inference and training system development, optimizations, and agentic systems. Our mission is to build the most efficient and scalable generative AI systems. Recent releases from our team include SwiftKV, an advanced inference optimization, and Arctic LLM, one of the largest open-source MoE foundation models.

Senior AI Infrastructure Engineer - DGX Cloud

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

DGXC SRE at NVIDIA ensures that our internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and at the same time enabling developers to make changes to the existing system through careful preparation and planning while keeping an eye on capacity, latency and performance. We are looking for systems and software engineers who are interested in building tooling, reporting, automation, and ML to enable operational excellence across a highly dynami

Senior Software Engineer, Kubernetes Fleet

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

AI Sr. Performance Engineer

AMD (Advanced Micro Devices)

Santa Clara, California, USA

Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellenc

AI and ML Infra Software Engineer, GPU Clusters

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Lead Machine Learning Engineer, Performance and Scalability, Generative AI

Adobe Systems

San Jose, California, USA

Full-time

Our Company Changing the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen. We're on a mission to hire the very best and are committed to creating exceptional employee experiences wher

Sr Principal Engineer Software (Cloud NW - AI Security)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Software Engineer, AI Networking, Machine Learning Infrastructure

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopilot and the Optimus. At the core of our autonomy capabilities are neural networks that the research team is designing to train on very large amounts of data, across large-scale GPU clusters and our supercomputer Dojo. Robustly training these models at scale and in the shortest amount of time is critica

Senior System Software Engineer - AI Performance and Efficiency Tools

NVIDIA Corporation

Santa Clara, California, USA

Full-time

A key part of NVIDIA's strength is our sophisticated analysis / debugging tools that empower NVIDIA engineers to improve perf and power efficiency of our products and the running applications. We are looking for forward-thinking, hard-working, and creative people to join a multifaceted software team with high standards! This software engineering role involves developing tools for AI researchers and SW/HW teams running AI workload in GPU cluster. As a member of the software development team, we

AI Software Engineer (Backend) - AI Research Team

Salesforce

Palo Alto, California, USA

Full-time

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category Software Engineering Job Details About Salesforce We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving you

Sr Staff Engineer Software (Agentic AI Security)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Principal Software Development Engineer - AI/ML

Oracle Corporation

Santa Clara, California, USA

Full-time

Job Description Oracle believes in empowering people to do more, through world class capabilities in analytics. We are the Data Services team within Oracle Analytics, responsible for innovating, building, and supporting data service management technologies and capabilities that support our diverse portfolio of products. Data Services organization mission is to provide an easy way to develop advanced analytical and AI applications combining data from across the entire Oracle ecosystem including

Sr Principal Software Engineer (Agentic AI Security)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Senior HPC Systems Engineer

RedLine Performance Solutions

Remote

Full-time

RedLine Performance Solutions (RedLine) has been in the HPC solutions engineering services business for 25 years and is consistently determined to keep the "bar of excellence" quite high for new hires. This enables RedLine to accomplish what other firms cannot and promotes a high level of staff retention. We offer services ranging from full life cycle HPC systems engineering to remote managed services to HPC program analysis. We are seeking a Senior HPC Systems Engineer to join our NASA NACS Hig

Platform Engineer - AI Software Solutions

AMD (Advanced Micro Devices)

Santa Clara, California, USA

Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellenc

Principal Software Engineer - Enterprise AI Platform

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Member of Technical Staff, Infrastructure Engineer

Microsoft Corporation

Mountain View, California, USA

Full-time

Overview: As Microsoft continues to push the boundaries of AI, we are on the lookout for passionate individuals to work with us on the most interesting and challenging AI questions of our time. Our vision is bold and broad - to build systems that have true artificial intelligence across agents, applications, services, and infrastructure. It's also inclusive: we aim to make AI accessible to all - consumers, businesses, developers - so that everyone can realize its benefits. Our Platform Infrastr

AI Engineer II

Electronic Arts

Redwood City, California, USA

Full-time

Description & Requirements Electronic Arts creates next-level entertainment experiences that inspire players and fans around the world. Here, everyone is part of the story. Part of a community that connects across the globe. A place where creativity thrives, new perspectives are invited, and ideas matter. A team where everyone makes play happen. The EA Digital Platform (EADP) group is the core powering the global EA ecosystem. We provide the foundation for all of EA's incredible games and playe

Observability Software Engineer, AI Infrastructure

Tesla Motors

Fremont, California, USA

Full-time

As a member of Tesla's "Insane Visibility" team, you will design, implement & maintain end-to-end observability across our AI Infrastructure stack and develop the framework to benchmark performance & processing of pipelines. You'll be responsible for building dashboards, alerts & monitoring necessary for Autopilot & AI teams to address observability issues in our FSD, Robotaxi & Optimus applications, ensuring these programs run smoothly throughout the full infrastructure stack. Responsibilities

Software Architect - Client Server

Horizontal Talent

Pleasanton, California, USA

Contract

We are seeking a talented Software Architect with expertise in client-server architecture to join our dynamic team. This role offers the opportunity to work on innovative search solutions that enhance user experiences and meet business needs. Responsibilities Design and implement search solutions, integrating them with Java-based systems. Optimize search infrastructure for performance and scalability. Collaborate with cross-functional teams to deliver effective search solutions. Create and manag