HPC Cloud Performance Engineer Jobs

Refine Results
121 - 140 of 141 Jobs

Linux System Administrator (Scientist 2)

Los Alamos National Laboratory

Los Alamos, New Mexico, USA

Full-time

Description Job Title Linux System Administrator (Scientist 2) Location Los Alamos, NM, US Organization Name HPC-SYS/High Performance Computing Systems Minimum Salary 104100 Maximum Salary 172200 What You Will Do The High Performance Computing (HPC) Division at Los Alamos National Laboratory provides scientific computing resources consisting of some of the largest HPC systems in the world as well as numerous large commodity cluster systems. Our Support Services Team within the HPC Systems Grou

Senior Member of Technical Staff - AI/ML Infrastructure Engineer 3

Oracle Corporation

No location provided

Full-time

Job Description As an AI/ML Infrastructure Engineer on the GPU Strategic Customers Engineering team, you will play a critical role in designing, implementing, and maintaining the infrastructure that supports our AI and machine learning initiatives. You will work closely with data scientists, software engineers, and IT professionals to ensure that our AI/ML models are deployed efficiently, securely, and at scale. Your expertise will be crucial in optimizing our infrastructure for performance, re

Solutions Architect, Networking - Cloud Service Providers

NVIDIA Corporation

Redmond, Washington, USA

Full-time

NVIDIA is building the world's leading AI company, and we are looking for an experienced Solutions Architect to focus on networking and help develop accelerated computing networking solutions for AI/ML and HPC with our Hyperscaler customers. As part of the NVIDIA Solutions Architecture team, you will work closely with strategic customers, offering technical expertise and support for both hardware and software solutions aligned with our product strategy. What you'll be doing: Working with Cloud

Manager, Software Technical Program Management - Datacenter Systems

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA data center systems, such as DGX, MGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We're looking for a strong technology leader for running NVIDIA's server TPM team. You will be the cross-section between execution and strategy, leading a team of Senior TPMs dr

HITS-U III Site Lead Army Research Lab (ARL)

General Dynamics Information Technology

Aberdeen, Maryland, USA

Full-time

Type of Requisition: Pipeline Clearance Level Must Currently Possess: Top Secret Clearance Level Must Be Able to Obtain: Top Secret/SCI Public Trust/Other Required: None Job Family: Information Systems Management Job Qualifications: Skills: High-Performance Computing (HPC) Systems, People Management, Team ManagementCertifications: NoneExperience: 8 + years of related experienceship Required: Yes Job Description: Provide the DoD Supercomputing Resource Center (DSRC) operations support, incl

HITS-U III Site Lead NAVY DSRC

General Dynamics Information Technology

John C. Stennis Space Center, Mississippi, USA

Full-time

Type of Requisition: Pipeline Clearance Level Must Currently Possess: Top Secret Clearance Level Must Be Able to Obtain: Top Secret/SCI Public Trust/Other Required: None Job Family: Information Systems Management Job Qualifications: Skills: High-Performance Computing (HPC) Systems, People Management, Team ManagementCertifications: NoneExperience: 8 + years of related experienceship Required: Yes Job Description: Provide the DoD Supercomputing Resource Center (DSRC) operations support, incl

Modeling, Simulation and Analysis Engineer

Peraton

Chantilly, Virginia, USA

Full-time

Responsibilities Are you a curious, driven analyst or software engineer excited by the challenge of harnessing the power of data science, simulation and operations research to optimize the Intelligence Community (IC) to get GEOINT data more efficiently to the users who need it most? As a member of PERATON's dynamic Performance Engineering and Analysis team, you will use your technical and analytic skills to identify trends and patterns using the team's data-rich ecosystem of GEOINT collection &

HITS-U III Site Lead Air Force Research Lab (AFRL)

General Dynamics Information Technology

Dayton, Ohio, USA

Full-time

Type of Requisition: Pipeline Clearance Level Must Currently Possess: Top Secret Clearance Level Must Be Able to Obtain: Top Secret/SCI Public Trust/Other Required: None Job Family: Information Systems Management Job Qualifications: Skills: High-Performance Computing (HPC) Systems, People Management, Team ManagementCertifications: NoneExperience: 8 + years of related experienceship Required: Yes Job Description: Provide the DoD Supercomputing Resource Center (DSRC) operations support, incl

Senior System Software Engineer, NCCL - Partner Enablement

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. We are the GPU Communications Libraries and Networking tea

Senior Software Engineer - Python Numerical Computing Libraries

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are looking for an experienced software professional to contribute to design and development of accelerated and distributed implementations of Python APIs for numerical computing. In the last decade, Python has become the de-facto programming language for practitioners in AI, data science and HPC, through popular frameworks such as NumPy, SciPy, TensorFlow and PyTorch. These frameworks provide an efficient high-level programming interface, allowing their users to focus on their application wh

Senior Solutions Architect, Networking - Cloud Service Providers

NVIDIA Corporation

Santa Clara, California, USA

Full-time

Are you seeking an opportunity to contribute to a team that brings innovative Artificial Intelligence (AI) technology to NVIDIA's largest clients? We are hiring a Solutions Architect to focus on networking and help develop accelerated computing networking solutions for AI/ML and HPC on hyperscalers. As part of the NVIDIA Solutions Architecture team, you will work closely with strategic customers, offering technical expertise and support for both hardware and software solutions aligned with our p

Principal Firmware Engineer - Data Center Server Management

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning - the next era of computing - with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as "the AI computing company." We're looking to grow our company and establish teams with the mos

Senior Software QA Engineer

PsiQuantum

Remote or Palo Alto, California, USA

Full-time

Quantum computing holds the promise of humanity's mastery over the natural world, but only if we can build a real quantum computer. PsiQuantum is on a mission to build the first real, useful quantum computers, capable of delivering the world-changing applications that the technology has long promised. We know that means we will need to build a system with roughly 1 million qubits that supports fault tolerant error correction within a scalable architecture, and a data center footprint. By harnes

Sr Principal Thermal Engineer - Data Center

Oracle Corporation

No location provided

Full-time

Job Description As a Principal Thermal Engineer, you will focus on the alignment of OCI thermal hardware design and the data center physical infrastructure. A mix of technical breadth and depth is required to work cross-functionally with different disciplines (Mechanical, Electrical, Thermal, and Software) to develop thermal solution optimized for the entire stack, and effectively manage high performance data center devices. You will work with internal and external partners, focusing on deliver

Signal Processing Engineer

Oak Ridge National Laboratory

Oak Ridge, Tennessee, USA

Full-time

Requisition Id 14873 Overview: As a U.S. Department of Energy (DOE) Office of Science national laboratory, ORNL has an impressive 80-year legacy of addressing the nation's most pressing challenges. Our team is made up of over 7,000 dedicated and innovative individuals! Our goal is to create an environment where a variety of perspectives and backgrounds are valued, ensuring ORNL is known as a top choice for employment. These principles are essential for supporting our broader mission to drive sci

Senior Principal Thermal Engineer - Data Center

Oracle Corporation

Ashburn, Virginia, USA

Full-time

Job Description As a Senior Principal Thermal Engineer, you will focus on the alignment of OCI thermal hardware design and the data center physical infrastructure. A mix of technical breadth and depth is required to work cross-functionally with different disciplines (Mechanical, Electrical, Thermal, and Software) to develop thermal solution optimized for the entire stack, and effectively manage high performance data centers. You will work with internal and external partners, focusing on deliver

Senior Machine Learning Engineer

Randstad - Torc

Remote

Contract

We are looking for a Senior Machine Learning Engineer to join our team and drive the development of advanced AI-driven solutions. You will be responsible for designing, deploying, and optimizing machine learning models that power real-world applications. This role requires deep expertise in model architecture, data engineering, MLOps, and production-scale AI systems. What you'll do:Design, develop, and deploy scalable, production-ready machine learning models for real-world applications.Architec

Sr. Python Architect

Zaspar Technologies

Palo Alto, California, USA

Full-time

Our client 8bit.AI is a dynamic startup in the Bay Area, CA seeking to hire Full-time employees and focused on developing a high-performance, multi-technology, vendor-independent, xPU-based Accelerated Cloud Computing platform. We stack massive clusters purpose-built for high-performance parallel computing and aim to launch a global accelerated cloud solution. Additionally, the firm will focus on broader Artificial General Intelligence (AGI) products, supercomputing services, and end-to-end AI e

Java Full stack Developer

Teknoviq Solutions

Detroit, Michigan, USA

Contract

The Senior Full-Stack Software Developer (Java/Spring Boot) will design, develop, and maintain high-performance, scalable applications. This role involves working across the entire tech stack, focusing on back-end services using Java, Spring Boot and Postgres, while also contributing to front-end development using Angular framework. Responsibilities include collaborating with cross-functional teams to deliver robust solutions, driving the architecture and design of new features, optimizing perfo

Infrastructure Architect

Calsoft Labs

Lansing, Michigan, USA

Contract

Please attach a separate Reference Page to your bid (not within resume) that includes at least 2 professional references! Be sure to include the reference s full name, phone number, email, affiliation to the candidate (Company Name, Title, Relationship, etc). Top Skills & Years of Experience: 10+ years in Linux system administration (Ubuntu, CLI, security, networking) 10+ years in Bash & Python scripting, with pipeline automation experience (e.g., Nextflow) 10+ years in Slurm workload manager i