infiniband Jobs

Refine Results
81 - 100 of 114 Jobs

Software Engineer, AI Networking, Machine Learning Infrastructure

Tesla Motors

Palo Alto, California, USA

Full-time

As a Software Engineer within the Autopilot AI Infrastructure team, you will work on reinforcing, optimizing, and scaling our infrastructure components supporting AI research activities for Autopilot and the Optimus. At the core of our autonomy capabilities are neural networks that the research team is designing to train on very large amounts of data, across large-scale GPU clusters and our supercomputer Dojo. Robustly training these models at scale and in the shortest amount of time is critica

Senior Network Operations Engineer - GNOC (Remote)

Oracle Corporation

Remote or Abilene, Texas, USA

Full-time

Job Description We are seeking a skilled and proactive engineers with 3-5 years of experience to join our Global Network Operations Center (GNOC) team. The GNOC is our front-line for addressing physical network issues and operates 24x7x365 to ensure the reliability and efficiency of our physical network infrastructure. The team is responsible for performing data collection, triage, technical analysis, incident mitigation, and redirection as necessary to maintain and optimize operations. Our pri

Senior Classified HPC Engineer

Oak Ridge National Laboratory

Oak Ridge, Tennessee, USA

Full-time

Requisition Id 15020 Overview: We are hiring a Senior Linux HPC Systems Engineer to design, operate and maintain clusters, servers, and workstations supporting services where science happens at ORNL! This position resides in the Emerging Technologies & Computing team in the Research Computing group in the Information Technology Services Directorate at Oak Ridge National Laboratory (ORNL). The Emerging Technology Computational Group facilitates ORNL goals through HPC systems engineering, integra

Senior Software Engineer - AI Infra Compute

Oracle Corporation

Seattle, Washington, USA

Full-time

Job Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. Our team is the GPU Availability and Monitoring team in the Compute Org. we are responsible for designing and developing architectural chang

Senior Software Engineer - AI Infra Compute

Oracle Corporation

Nashville, Tennessee, USA

Full-time

Job Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. Our team is the GPU Availability and Monitoring team in the Compute Org. we are responsible for designing and developing architectural chang

Principal Security Engineer

Oracle Corporation

Nashville, Tennessee, USA

Full-time

Job Description Do you thrive on the leading edge of cloud technology and security? At Oracle Cloud Infrastructure (OCI), we are building and operating a suite of massive scale, integrated cloud services in a broadly distributed, multi-tenant cloud environment. OCI is providing best-in-cloud products that meet the needs of customers who are tackling some of the world's biggest challenges. The OCI Core Services organization is the engineering and operations backbone of Oracle Cloud Infrastructu

Principal Network Operation Engineer - GNOC (Remote)

Oracle Corporation

Texas, USA

Full-time

Job Description We are seeking a skilled and proactive engineers with 6+ years of experience to join our Global Network Operations Center (GNOC) team. The GNOC is our front-line for addressing physical network issues and operates 24x7x365 to ensure the reliability and efficiency of our physical network infrastructure. The team is responsible for performing data collection, triage, technical analysis, incident mitigation, and redirection as necessary to maintain and optimize operations. This ons

HPC Engineer

American Systems Corporation

Arlington, Virginia, USA

Full-time

Overview AMERICAN SYSTEMS is an employee-owned federal government contractor supporting national priority programs through our strategic solutions in the areas of Information Technology, Test & Evaluation, Program Mission Support, Engineering & Analysis, and Training. Responsibilities THIS POSITION COMES WITH A 10K SIGNING BONUS! As an HPC Engineer with AMERICAN SYSTEMS you will have an opportunity to do the followingl: Apply comprehensive knowledge of High Performance Computing (HPC) systems

Principal Software Developer - AI Infra Compute

Oracle Corporation

No location provided

Full-time

Job Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. Our team is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage automatio

GBM - Investment Banking Engineering, Associate, UX Designer, Dallas

Goldman Sachs & Co.

Dallas, Texas, USA

Full-time

Job Description Investment Banking Engineering, UX Designer, Associate, Dallas Investment Banking The Investment Banking business (IB) works on some of the most complex financial challenges and transactions in the market today. Whether advising on a merger, providing financial solutions for an acquisition, or structuring an initial public offering, we handle projects that help clients at major milestones. We work with corporations, pension funds, financial sponsors, and governments and are te

Site Reliability Engineer - AI Cloud

SUPERMICRO COMPUTER INC

San Jose, California, USA

Full-time

Job Req ID: 26861 About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, pass

HPC Software Engineer

Cymertek Corporation

Tysons, Virginia, USA

Full-time

HPC Software Engineer LOCATION Tysons, VA 22182 CLEARANCE TS/SCI Full Poly (Please note this position requires full U.S. Citizenship) KEY SUMMARY We are seeking an innovative and driven HPC (High-Performance Computing) Software Engineer to develop and optimize software solutions for cutting-edge computational environments. In this role, you will design, implement, and enhance software that leverages HPC systems to solve complex problems and drive performance improvements. The ideal candidate h

HPC Software Engineer

Cymertek Corporation

Reston, Virginia, USA

Full-time

HPC Software Engineer LOCATION Reston, VA 20190 CLEARANCE TS/SCI Full Poly (Please note this position requires full U.S. Citizenship) KEY SUMMARY We are seeking an innovative and driven HPC (High-Performance Computing) Software Engineer to develop and optimize software solutions for cutting-edge computational environments. In this role, you will design, implement, and enhance software that leverages HPC systems to solve complex problems and drive performance improvements. The ideal candidate h

HPC Software Engineer

Cymertek Corporation

Annapolis, Maryland, USA

Full-time

HPC Software Engineer LOCATION Annapolis Junction, MD 20701 CLEARANCE TS/SCI Full Poly (Please note this position requires full U.S. Citizenship) KEY SUMMARY We are seeking an innovative and driven HPC (High-Performance Computing) Software Engineer to develop and optimize software solutions for cutting-edge computational environments. In this role, you will design, implement, and enhance software that leverages HPC systems to solve complex problems and drive performance improvements. The ideal

HPC Software Engineer

Cymertek Corporation

Chantilly, Virginia, USA

Full-time

HPC Software Engineer LOCATION Chantilly, VA 20151 CLEARANCE TS/SCI Full Poly (Please note this position requires full U.S. Citizenship) KEY SUMMARY We are seeking an innovative and driven HPC (High-Performance Computing) Software Engineer to develop and optimize software solutions for cutting-edge computational environments. In this role, you will design, implement, and enhance software that leverages HPC systems to solve complex problems and drive performance improvements. The ideal candidat

Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are seeking Software Engineers with previous experience building and running private and public clouds at production scale. As part of the DGX Cloud team, you'll have the opportunity to support our customers' journeys in AI training and inference development by building the platforms, tools, and services that defend the operational capacity of our bare-metal, accelerated compute infrastructure and codify reliability best-practices in the broader DGX Cloud platform ecosystem. What you'll be d

Software Developer 4

Oracle Corporation

No location provided

Full-time

Job Description OCI (Oracle Cloud Infrastructure) AI Infrastructure is at the forefront of building a cutting-edge, ultra-high-performance GPU platform designed to support AI/ML/HPC workloads. This is your chance to be part of the AI revolution, creating systems that allow customers to scale from tens to thousands of GPUs without compromising performance. Our team is responsible for designing and developing fundamental architectural changes for GPU delivery, health monitoring, triage automatio

Principal Switch Engineering Architect

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA networking unit is a world-leader fast-growing company which supports the most powerful supercomputers in the world. We make outstanding artificial intelligence happen and accelerate Open-AI's Chat-GPT, for example. We believe in our people and products and seek excellent people to join us! We're looking for a hardware u/architect for our switch division. In this position, as part of a small (~10 employees) elite team, you will have the chance to define the architecture of NVIDIA's next

Senior AI Infrastructure Engineer

T-Mobile

Washington, USA

Full-time

At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation package - this is Total Rewards. Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches. That's how we're UNSTOPPABLE for our employees! Do you have a desire to help drive the future directio

Senior AI-HPC Cluster Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that constantly evolves by adapting to new opportunities that are hard to solve, that only we can tackle, and that matter to the world. This is our life's work, to amplify human