GPU Infrastructure Engineer (PowerEdge)

• Posted 1 hour ago • Updated 1 hour ago
Full Time
Fitment

Dice Job Match Score™

📋 Comparing job requirements...

Job Details

Skills

  • Linux administration
  • Infrastructure Automation
  • Distributed Systems
  • Cluster Management
  • Parallel Computing
  • Scripting (Python
  • Performance Tuning & Optimization
  • Bash)
  • GPU Infrastructure Engineer
  • NVIDIA GPUs
  • GPU Cluster Deployment
  • High-Performance Computing
  • AI / GenAI Infrastructure
  • Dell PowerEdge Servers
  • PowerEdge Rack / Tower
  • PowerEdge XE Servers
  • iDRAC Management
  • Server Hardware Configuration
  • NVIDIA Base Command Manager
  • NVIDIA DGX / GPU Systems
  • NVIDIA Certifications (NCA
  • NCE)
  • NVIDIA UFM (Unified Fabric Manager)
  • NVIDIA Spectrum-X
  • NVIDIA BlueField (DPU)
  • InfiniBand
  • RoCE (RDMA over Converged Ethernet)
  • High-speed Networking
  • GPU Cluster Networking
  • Low-latency Networking
  • HPL (High-Performance Linpack)
  • STREAM Benchmark
  • NCCL / RCCL
  • OSU Microbenchmarks
  • Red Hat Enterprise Linux (RHEL)
  • RHCSA / RHCE
  • Redfish API
  • Firmware Upgrades & Lifecycle Management
  • System Monitoring & Management

Summary

Solution IT Inc. is looking for GPU Infrastructure Engineer (PowerEdge) for one of its clients in Childress, TX

Job Title: GPU Infrastructure Engineer (PowerEdge)

Required Skills

  • Must have skills = PowerEdge Rack/Tower Experience, NVIDIA certifications
  • Nice to have skills - PowerEdge XE server experience NVIDIA QR Switches
  • Deep hands-on experience with GPU deployment, configuration, and multi-node testing using NVIDIA Base Command Manager
  • Proficiency with benchmarking tools: HPL, STREAM, NCCL, RCCL, MxP, OSU Microbenchmarks
  • Red Hat certification (RHCSA/RHCE) or 7+ years of relevant RH distros experience
  • Experience with GenAI/HPC networking (InfiniBand and/or RoCE)
  • Experience working in Linux based parallel computing environments at scale
  • Strong customer facing and communication skills

Desirable Requirements

  • Bachelor's degree
  • NVIDIA certifications (NCA, NCE, DGX)
  • Experience with NVIDIA UFM, Infiniband, and SpectrumX fabrics
  • Exposure to hybrid cloud or GPU cloud environments
  • Experience with GPU observability/performance profiling tools
  • Code Upgrade
  • Perform cluster-level code upgrades as per approved versions and compatibility guidelines.
  • iDRAC Management
  • Configuration, access validation, and health checks of iDRAC.
  • Troubleshooting and lifecycle management support.
  • Firmware Updates
  • Update server, BIOS, NIC, storage, and related firmware.
  • Ensure version alignment and post-update validation.
  • Redfish
  • Overview and usage of Redfish APIs.
  • Customization and automation using Redfish for system management and monitoring.
  • BlueField
  • Configuration and management of BlueField DPUs.

Work Site: Childress, TX / Onsite

Duration: 10+ months

Expected Start Date: Immediate

Number of Positions: 1

Please send your responses back to

About Solution IT

Solution IT is a national IT consulting company specializing in: Technology Staffing and Oracle E-Business Solutions based in Boston, Massachusetts.

Thanks
Recruiting Team

SOLUTION IT INC
Work: / Extn 155 / 146

URL:
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10290916
  • Position Id: 2026-12658
  • Posted 1 hour ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Childress, Texas

Today

Easy Apply

Contract, Third Party

$DOE

Bethesda, Maryland

Today

Full-time

USD 151,903.55 - 172,157.35 per year

Childress, Texas

Today

Easy Apply

Full-time

Bethesda, Maryland

Today

Full-time

USD 187,181.44 - 212,138.97 per year

Search all similar jobs