hpc consultant - nvidia h200 gpu Jobs

Refine Results
41 - 60 of 64 Jobs

Senior Principal Software Engineer - Cluster Networks (JoinOCI-SDE)

Oracle Corporation

Seattle, Washington, USA

Full-time

Job Description Cloud Engineering Infrastructure Development Oracle Cloud Infrastructure (OCI) Cluster Networking team is building an ultra-high performance network required to support AI/ML/HPC workloads. This is your opportunity to join the AI revolution and designing systems which allow customers to scale from tens to thousands of GPU without compromising on performance. This team will be responsible for designing, developing and performance tuning the networking stack required to run dist

Senior Principal Software Engineer - Cluster Networks (JoinOCI-SDE)

Oracle Corporation

Seattle, Washington, USA

Full-time

Job Description Cloud Engineering Infrastructure Development Oracle Cloud Infrastructure (OCI) Cluster Networking team is building an ultra-high performance network required to support AI/ML/HPC workloads. This is your opportunity to join the AI revolution and designing systems which allow customers to scale from tens to thousands of GPU without compromising on performance. This team will be responsible for designing, developing and performance tuning the networking stack required to run dist

HPC Engineer, Machine Learning Infrastructure - US Remote

Hugging Face

Remote

Full-time

Description Here at Hugging Face, we're on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better. We have built the fastest-growing, open-source, library of pre-trained models in the world. With more than 1 Million+ models and 320K+ stars on GitHub, over 15.000 companies are using HF technology in production, including leading AI organizations such as Google, Elastic, Salesforce, Grammarly and NASA.

Linux Systems Administration

Leidos

Dayton, Ohio, USA

Full-time

Description Leidos is seeking a Lead Systems Administrator to support a team of administrators with standard system administration duties at the Department of Defense (DoD) High Performance Computing (HPC) Modernization Program (DoD HPCMP) and the U.S. Air Force Research Laboratory (AFRL) DoD Supercomputing Resource Center (DSRC) located at Wright Patterson AFB, OH. . The selected candidate will perform standard System Administration duties as required to maintain smooth operation of multi-user

Master Principal GPU/HPC Cloud Architect (Remote within US)

Oracle Corporation

Remote

Full-time

Job Description Your mission is to work with Oracle's largest customers/partners on migration/net new strategies to move their intellectual property software and to develop their next-generation offerings on the Oracle Cloud. You will work with internal business development strategists, and several other internal Oracle service delivery teams, including Product Management and Development to craft highly scalable, flexible and resilient cloud architecture blueprints that address customer busines

Master Principal GPU/HPC Cloud Architect (Remote within US)

Oracle Corporation

Remote

Full-time

Job Description Your mission is to work with Oracle's largest customers/partners on migration/net new strategies to move their intellectual property software and to develop their next-generation offerings on the Oracle Cloud. You will work with internal business development strategists, and several other internal Oracle service delivery teams, including Product Management and Development to craft highly scalable, flexible and resilient cloud architecture blueprints that address customer busines

Senior Director of Engineering - AI/ML Network Software (JoinOCI-Leader)

Oracle Corporation

Santa Clara, California, USA

Full-time

Job Description Oracle Cloud Infrastructure (OCI) Cluster Networking team is building an ultra-high performance network required to support AI/ML/HPC workloads. This is your opportunity to join the AI revolution and design a network which can scale from tens to thousands of GPU without compromising on performance. This team will deliver Network-as-a-Service that handles provisioning, life cycle management, performance tuning, monitoring and security of our customers' cluster network infrastruct

Network Manager - Entry to Experienced Level (Maryland)

National Security Agency

Fort Meade, Maryland, USA

Full-time

Position Summary The Network Manager will play a unique role in enabling NSA to effectively execute its mission, supplying customers with advanced computing resources, and global networking. The Network Manager assists in the planning, designing, managing the configuration, identifying network faults, restoring service after faults occur, and the performance and security of operational networks. DCIPS Disclaimer The National Security Agency (NSA) is part of the DoD Intelligence Community Defense

GPU Compiler Performance Engineer

Qualcomm Technologies

San Diego, California, USA

Full-time

Company:Qualcomm Technologies, Inc. Job Area:Engineering Group, Engineering Group > GPU ASICS Engineering General Summary: Qualcomm's Adreno GPU has been the industry leading mobile graphics solution in today's Android smart phone market worldwide. Our power efficient GPU solution is fundamental to enable the new exciting markets like VR/AR, IoT, AI, drone, autonomous driving etc. GPU compiler is a key component of graphics solution. We invite you to join us to create world class GPU compiler

Senior HPC Architect

General Dynamics Information Technology

Rockville, Maryland, USA

Full-time

GDIT is seeking a Senior HPC Architect to join our Scientific Infrastructure Team, providing High Performance Computing (HPC) services for a large biomedical research community with the National Institute of Allergy and Infectious Diseases (NIAID). Our Scientific Infrastructure Team is responsible for enabling and managing HPC and its associated infrastructure and interconnects across multiple locations, 100's of COTS and open-source scientific applications, and ~40PB of data storage to include

Senior Performance Engineer (Hardware/Software)

AG Consulting Partners

Seattle, Washington, USA

Contract

*We're excited to welcome new team members, and we're specifically focusing on candidates located withing driving distance to Seattle, Washington or Sunnydale, California. Being on-site up to 4 times, a week is mandatory for this engagement* As a Senior Performance Engineer Consultant for AG Consulting Partners, a typical day might include the following: Drive the assessment, benchmarking, debugging and integration of new silicon and server systems.Act as a central figure in aligning teams from

Master Principal HPC AI/ML Technologists

Oracle Corporation

US

Full-time

Job Description Your mission is to work with Oracle's largest customers/partners on migration/net new strategies to move their intellectual property software and to develop their next-generation offerings on the Oracle Cloud. You will work with internal business development strategists, and several other internal Oracle service delivery teams, including Product Management and Development to craft highly scalable, flexible and resilient cloud architecture blueprints that address customer busines

Principal Member of Technical Staff - AI/ML Infrastructure Engineer

Oracle Corporation

US

Full-time

Job Description As an AI/ML Infrastructure Engineer on the GPU Strategic Customers Engineering team, you will play a critical role in designing, implementing, and maintaining the infrastructure that supports our AI and machine learning initiatives. You will work closely with data scientists, software engineers, and IT professionals to ensure that our AI/ML models are deployed efficiently, securely, and at scale. Your expertise will be crucial in optimizing our infrastructure for performance, re

Principal Member of Technical Staff - AI/ML Infrastructure Engineer

Oracle Corporation

US

Full-time

Job Description As an AI/ML Infrastructure Engineer on the GPU Strategic Customers Engineering team, you will play a critical role in designing, implementing, and maintaining the infrastructure that supports our AI and machine learning initiatives. You will work closely with data scientists, software engineers, and IT professionals to ensure that our AI/ML models are deployed efficiently, securely, and at scale. Your expertise will be crucial in optimizing our infrastructure for performance, re

Principal Member of Technical Staff - AI/ML Infrastructure Engineer

Oracle Corporation

US

Full-time

Job Description Job Description: As an AI/ML Infrastructure Engineer on the GPU Strategic Customers Engineering team, you will play a critical role in designing, implementing, and maintaining the infrastructure that supports our AI and machine learning initiatives. You will work closely with data scientists, software engineers, and IT professionals to ensure that our AI/ML models are deployed efficiently, securely, and at scale. Your expertise will be crucial in optimizing our infrastructure fo

Senior Member of Technical Staff - AI/ML Infrastructure Engineer

Oracle Corporation

US

Full-time

Job Description Job Description: As an AI/ML Infrastructure Engineer on the GPU Strategic Customers Engineering team, you will play a critical role in designing, implementing, and maintaining the infrastructure that supports our AI and machine learning initiatives. You will work closely with data scientists, software engineers, and IT professionals to ensure that our AI/ML models are deployed efficiently, securely, and at scale. Your expertise will be crucial in optimizing our infrastructure fo

Google Cloud Platform Networking & Infra Consultant-Remote

Cyber Sphere LLC

Remote

Third Party, Contract

Hi Professional, Hope you are doing well. This is Srikar a Senior Technical Recruiter. I have a position for Google Cloud Platform Networking & Infra Consultant, please review the job details below and let me know if you're interested. Title Google Cloud Platform Networking & Infra Consultant Location- Remote Duration Contract JD: Job Title: Oracle Cloud HCM Solutions Architect - Payroll, OTL (Oracle Time and Labor) Must Have Skill Requirements: Expertise in GPU configuration and troubleshoo

TS/SCI HPC Systems Engineer

ClearBridge Technology Group

Arlington, Virginia, USA

Contract

Our client, located in Arlington, VA, is currently in need of a TS/SCI cleared HPC Systems Engineer for a 3 month contract. The consultant will work onsite in support of HPC hardware configuration, management and maintenance. The consultant will primarily be focused on the underlying hardware vs the software it runs. Responsibilities: HPC hardware configuration, management and maintenance SID documentation Troubleshooting issues Addressing hardware failures Addressing tickets Validation of har

HPC Field Delivery Resident- TC IV

Apex Systems

Piscataway, New Jersey, USA

Full-time

Job#: 2012178 Job Description: Description: Role: HPC Field Delivery Resident- TC IV Location: Piscataway, NJ Duration: 12 months ONSITE REQUIREMENT in Piscataway NJ, must be willing to work an on call schedule Job Description: Responsibilities: Responsible for verifying and implementing the detailed technical design solution to the problem as identified by the Project/Technical Manager. Often responsible for providing a detailed technical design for enterprise solutions. Is often the Principal

HPC Systems Admin TS/SCl

Apex Systems

Monterey, California, USA

Full-time

Job#: 2001322 Job Description: Apex Systems is looking for a HPC Systems Administrator to support one of our Government Integration clients in the Monterey, CA area. If you meet the qualifications below and are interested, please send an updated resume to Victoria at . Please keep in mind that a Top Secret clearance is a HARD REQUIREMENT! Title: HPC Systems Administrator Location: Monterey, CA (Onsite Daily M-F) Clearance: TS/SCI Contract Length: 12 months renewed annually for up to 5 years The