hpc cloud performance engineer Jobs

Refine Results
61 - 80 of 137 Jobs

Senior Software Engineer, AI Resiliency

NVIDIA Corporation

Santa Clara, California, USA

Full-time

We are now looking for a Senior Software Engineer for AI Resiliency. At NVIDIA, we are pushing the boundaries of what's possible in AI. We are currently seeking a Senior Software Engineer to lead the development of AI software resiliency for the most powerful AI supercomputers in the world. As a member of our AI Software Resiliency team, you will play a pivotal role in defining and implementing critical resiliency features for AI supercomputers at a scale of 100,000+ GPUs. Your expertise will b

Lead Process Engineering, High Performance Computing

Peraton

Fort Meade, Maryland, USA

Full-time

Responsibilities Peraton is hiring a Lead Process Engineer to facilitate the government customer's large High Performance Computing (HPC) related program. This program is on the cutting edge and includes everything from HPC test planning and execution, architecture design and prototyping, and vendor outreach and collaboration support. Program technical areas include commercial cloud technologies, high performance computing, and enterprise architecture. The program is tactically important to the

Senior Advisor Process Engineering, High Performance Computing

Peraton

Fort Meade, Maryland, USA

Full-time

Responsibilities Peraton is hiring a Senior Advisor Process Engineer to facilitate the government customer's large High Performance Computing (HPC) related program. This program is on the cutting edge and includes everything from HPC test planning and execution, architecture design and prototyping, and vendor outreach and collaboration support. Program technical areas include commercial cloud technologies, high performance computing, and enterprise architecture. The program is tactically import

Software Developer - AI Infra Compute

Oracle Corporation

No location provided

Full-time

Job Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. Our team is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage automatio

AI System Research and Development Engineer - Optimization

Snowflake Inc.

Menlo Park, California, USA

Full-time

Build the future of the AI Data Cloud. Join the Snowflake team. We are looking for talented System Developers and Researchers to join the Snowflake AI Research team and contribute to LLM inference and training system development, optimizations, and agentic systems. Our mission is to build the most efficient and scalable generative AI systems. Recent releases from our team include SwiftKV, an advanced inference optimization, and Arctic LLM, one of the largest open-source MoE foundation models.

HPC Systems Engineering, Journeyman

Peraton

Fort Meade, Maryland, USA

Full-time

Responsibilities Peraton is looking for an Journeyman Systems Engineer to facilitate the government customer's management of a large High Performance Computing (HPC) related program. This program is on the cutting edge and includes everything from HPC test planning and execution, architecture design and prototyping, and vendor outreach and collaboration support. Program technical areas include commercial cloud technologies, high performance computing, and enterprise architecture. The program is

Sr. Staff IT Software Engineer

Synopsys, Inc.

Austin, Texas, USA

Full-time

Descriptions & Requirements Job Description and Requirements OPEN TO HIRING IN AUSTIN, TX or HILLSBORO, OR We Are: At Synopsys, we drive the innovations that shape the way we live and connect. Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines. We lead in chip design, verification, and IP integration, empowering the creation of high-performance silicon chips and software content. Join us to transform the future through continuous techno

Principal Software Eng - AI Infrastructure, OCI

Oracle Corporation

No location provided

Full-time

Job Description This position supports the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure (OCI) with a focus on application performance in relationship to network design and configuration. The role combines (i) understanding of networking at the protocol level, (ii) programming skills, and () skill in benchmarking and analyzing the performance of extreme-scale distributed computing systems. Why Join Us? Innovative Projects: Build groundbreaking solutions

AI Advisor - Global Financial

World Wide Technology

No location provided

Full-time

Qualifications & Experience 10+ years of experience in AI, data analytics, machine learning, and HPC environments. Proven ability to advise C-suite and technical leaders on AI strategy, solutions, and implementation. Strong background in AI infrastructure, data pipelines, model lifecycle management, and cloud/hybrid architectures. Expertise in AI frameworks, ML Ops, RAG architectures, LLMs, and enterprise AI deployment models. Hands-on experience with AI solutions from NVIDIA, AWS, Azure, Google

AI Advisor - Healthcare

World Wide Technology

No location provided

Full-time

Qualifications & Experience 10+ years of experience in AI, data analytics, machine learning, and HPC environments. Proven ability to advise C-suite and technical leaders on AI strategy, solutions, and implementation. Strong background in AI infrastructure, data pipelines, model lifecycle management, and cloud/hybrid architectures. Expertise in AI frameworks, ML Ops, RAG architectures, LLMs, and enterprise AI deployment models. Hands-on experience with AI solutions from NVIDIA, AWS, Azure, Google

Principal Lustre Engineer, CSS Global SaaS & Apps Delivery, Federal

Oracle Corporation

No location provided

Full-time

Job Description Principal Lustre Engineer Location: Telecommuting/remote from within the US Note: ship is required due to needed access of government systems. Any level of clearance a plus. Are you interested in building High-Performance Computing (HPC) Systems infrastructure for the cloud? Oracle's Cloud Infrastructure team is building new Lustre Storage Services that operate at high scale in a broadly distributed multi-tenant cloud environment. Our customers run their businesses on our cl

Software Developer - AI Infra Compute

Oracle Corporation

Seattle, Washington, USA

Full-time

Job Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. Our team is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage automatio

Senior Software Engineer - Simulation and Virtualization

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Sr. Software Engineer who will help build simulators for our DGX Server platforms. Simulations play a significant role in building scalable systems at Speed of Light! Y

Linux Systems Engineer (TS Clearance Required)

General Dynamics

Annapolis, Maryland, USA

Full-time

Type of Requisition: Regular Clearance Level Must Currently Possess: Top Secret/SCI Clearance Level Must Be Able to Obtain: Top Secret SCI + Polygraph Public Trust/Other Required: None Job Family: Systems Engineering Job Qualifications: Skills: Complex Systems, High-Performance Computing (HPC) Systems, Linux, Management Tools, Systems Engineering Certifications: None Experience: 10 + years of related experience ship Required: Yes Job Description: Linux Systems Engineer GDIT is seeking a

Principal Software Developer - AI Infra Compute

Oracle Corporation

No location provided

Full-time

Job Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. Our team is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage automatio

Sr. Engineer, Engineering Tools and Support

Lucid Group, Inc.

Newark, New Jersey, USA

Full-time

Leading the future in luxury electric and mobility At Lucid, we set out to introduce the most captivating, luxury electric vehicles that elevate the human experience and transcend the perceived limitations of space, performance, and intelligence. Vehicles that are intuitive, liberating, and designed for the future of mobility. We plan to lead in this new era of luxury electric by returning to the fundamentals of great design - where every decision we make is in service of the individual and en

Technical Presales Consultant (DC Infra focused)

World Wide Technology

No location provided

Full-time

Required Experience and Skills (Must have)- 10+ years of Enterprise DC Environment and min 5+ years of Pre-Sales (technical sales / Sales engineering), experience (preferably with a System Integrator company). Mandatory exposure to maximum combination of OEMs across DC Infra OEMS including (Rack, Storage and Compute, OS Virtualization, Containers, Storage Networking, Disaster Recovery including backups, HPC Networking, Cloud, AIOPS) Strong business, commercial and technical understanding of DC I

Motorsports Senior Data Processing Engineer

General Motors

Concord, North Carolina, USA

Full-time

Job Description Onsite: This role is categorized as onsite. This means the successful candidate is expected to report to Concord, NC on a full-time basis. The Role We are seeking a highly skilled and motivated Senior Data Processing Engineer to join the Aero Innovation team within the Motorsports Aerodynamic Department . Your primary responsibility will be to develop complex data pipelines and data-processing systems that enhance the productivity and capabilities of our aerodynamics develo

Integration Engineer

Modern Technology Solutions

Alexandria, Virginia, USA

Full-time

Overview Modern Technology Solutions, Inc. (MTSI) is seeking a Integration Engineer to join our team in the Alexandria, VA area. You will provide technical support for programs, processes, and capabilities within the Assistant Secretary of Defense for Mission Capabilities (ASD(MC)) within the Office of the Under Secretary of Defense for Research and Engineering (OUSD(R&E)). Why is MTSI known as a Great Place to Work? Interesting Work: Our co-workers support some of the most important and criti

Enterprise Network Architect / Integrator

Modern Technology Solutions

Alexandria, Virginia, USA

Full-time

Overview Modern Technology Solutions, Inc. (MTSI) is seeking an Enterprise Network Architect / Integrator to join our team in the Alexandria, VA area. You will provide technical support for programs, processes, and capabilities within the Assistant Secretary of Defense for Mission Capabilities (ASD(MC)) within the Office Under the Secretary of Defense for Research and Engineering (OUSD(R&E)). Why is MTSI known as a Great Place to Work? Interesting Work: Our co-workers support some of the most