hpc cloud performance engineer Jobs in san jose, ca

Refine Results
1 - 20 of 36 Jobs

Senior HPC Performance Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. We are the GPU Communications Libraries and Networking tea

Senior AI-HPC Cluster Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that constantly evolves by adapting to new opportunities that are hard to solve, that only we can tackle, and that matter to the world. This is our life's work, to amplify human

Senior AI-HPC Storage Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that constantly evolves by adapting to new opportunities that are hard to solve, that only we can address, and that matter to the world. This is our life's work, to amplify huma

IT Cloud UNIX Engineer - MP1013025

Juniper Networks

Sunnyvale, California, USA

Full-time

At Juniper, we believe the network is the single greatest vehicle for knowledge, understanding, and human advancement the world has ever known. To achieve real outcomes, we know that experience is the most important requirement for networking teams and the people they serve. Delivering an experience-first, AI-Native Network pivots on the creativity and commitment of our people. It requires a consistent and committed practice, something we call the Juniper Way. IT Cloud UNIX Engineer Location:

Senior HPC Systems Engineer

RedLine Performance Solutions

Remote

Full-time

RedLine Performance Solutions (RedLine) has been in the HPC solutions engineering services business for 25 years and is consistently determined to keep the "bar of excellence" quite high for new hires. This enables RedLine to accomplish what other firms cannot and promotes a high level of staff retention. We offer services ranging from full life cycle HPC systems engineering to remote managed services to HPC program analysis. We are seeking a Senior HPC Systems Engineer to join our NASA NACS Hig

Senior System Software Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

We're looking for a systems-minded technical leader to orchestrate engineering efforts across platform, product, and operations. This role sits at the intersection of software engineering, DevOps, and technical program leadership-with an emphasis on engineering. You'll work on a product integration team that pulls together infrastructure and software from a range of teams, making it function cohesively before handing off "recipes" to product teams to polish and ship. You'll step into ambiguous p

Manufacturing Engineer

Super Micro Computer

San Jose, California, USA

Full-time

Job Req ID: 26142 About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, pass

Solutions Architect - Cloud Providers and Hyperscale

NVIDIA Corporation

Santa Clara, California, USA

Full-time

We are now looking for a Solutions Architect! NVIDIA is searching for Solutions Architect with expertise in AI, Machine Learning, and HPC for Hyperscale and Cloud Providers focus. Primary responsibilities will be to lead technical engagements with customers as they integrate, optimize, and apply NVIDIA's hardware and software technologies. Would you like to collaborate with some of the biggest companies developing brand new AI solutions by applying both NVIDIA and cloud technologies? Interested

Platform Software Engineer (RDMA/Ethernet Driver - Linux/Windows)

AMD (Advanced Micro Devices)

Santa Clara, California, USA

Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences - the building blocks for the data center, artificial intelligence, PCs, gaming and embedded. Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges. We strive for execution excellenc

Principal Software Engineer - AI Infrastructure, OCI

Oracle Corporation

Santa Clara, California, USA

Full-time

Job Description This position supports the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure (OCI) with a focus on application performance in relationship to network design and configuration. The role combines (i) understanding of networking at the protocol level, (ii) programming skills, and () skill in benchmarking and analyzing the performance of extreme-scale distributed computing systems. Why Join Us? Innovative Projects: Build groundbreaking solutions

Senior Software Engineer, AI Resiliency

NVIDIA Corporation

Santa Clara, California, USA

Full-time

We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI. We are currently seeking a Senior Software Engineer to lead the development of AI software resiliency for the most powerful AI supercomputers in the world. As a member of our AI Software Resiliency team, you will play a pivotal role in defining and implementing critical resiliency features for AI supercomputers at a scale of 100,000+ GPUs. Your expertise will b

AI System Research and Development Engineer - Frameworks

Snowflake Inc.

Menlo Park, California, USA

Full-time

Build the future of the AI Data Cloud. Join the Snowflake team. We are looking for talented System Developers and Researchers to join the Snowflake AI Research team and contribute to LLM inference and training system development, optimizations, and agentic systems. Our mission is to build the most efficient and scalable generative AI systems. Recent releases from our team include SwiftKV, an advanced inference optimization, and Arctic LLM, one of the largest open-source MoE foundation models.

Senior Software Engineer - Simulation and Virtualization

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Sr. Software Engineer who will help build simulators for our DGX Server platforms. Simulations play a significant role in building scalable systems at Speed of Light! Y

AI System Research and Development Engineer - Optimization

Snowflake Inc.

Menlo Park, California, USA

Full-time

Build the future of the AI Data Cloud. Join the Snowflake team. We are looking for talented System Developers and Researchers to join the Snowflake AI Research team and contribute to LLM inference and training system development, optimizations, and agentic systems. Our mission is to build the most efficient and scalable generative AI systems. Recent releases from our team include SwiftKV, an advanced inference optimization, and Arctic LLM, one of the largest open-source MoE foundation models.

Senior Software Engineer, Infrastructure Engineering

10x Genomics

Pleasanton, California, USA

Full-time

About the role We are looking for an exceptional Software Engineer to build out our growing cloud infrastructure and HPC platform with a solid understanding of Linux, cloud, and distributed computing to join our team. Our multi-disciplinary team in microfluidics, biochemistry, mechanical engineering, computational biology, and software has a proven track record of delivering successful commercial products built on deep technological innovation. If you are a self-starter who is passionate about

Senior Software Architect - Data Center Systems

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are building innovative server systems for GPU accelerated applications, such as Deep Learning. Data Center SW team architects and develops the end to end software and firmware stack for these systems. We are looking for a Senior Software Architect who has deep expertise in designing server platforms and has added understanding of application use cases in Deep Learning workloads. You will work with world class engineering teams, product management, Operations and Customer support to build sys

AI and ML Infra Software Engineer, GPU Clusters

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Deep Learning Engineer - Distributed Task-Based Backends

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are looking for Senior to Principal level experienced software professionals to help build the next generation of distributed backends for premier Deep Learning frameworks like PyTorch, JAX and TensorFlow. You will build on top of validated task-based runtime systems like Legate, Legion & Realm to develop a platform that can scale a wide range of model architectures to thousands of GPUs! What You Will Be Doing: Develop extensions to popular Deep Learning frameworks, that enable easy experime

Senior Storage and Data Production Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

Production engineering is a team that involves designing, building, and maintaining large-scale production systems with high efficiency and availability. It encompasses various areas, including software and systems engineering practices, storage, data management, and services. Production Engineers possess expertise in different domains, such as storage architecture, high-performance distributed storage, data management, systems, networking, coding, database management, capacity planning, continu

Distinguished Engineer - Data Center System Software Architect

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking for a strong technical architect to own the end-to-end architecture of these products, at the system software level. Including firmware, kernel drivers, operating syst