Our client is looking Data Center GPU Commissioning Engineer in San Jose CA below is the detailed requirements.
Job Title : Data Center GPU Commissioning Engineer
Location : San Jose CA
Job Description
The Data Center GPU Commissioning Engineer is responsible for commissioning, validating, and stabilizing GPU‑based infrastructure in data center environments. This role ensures GPU servers, interconnects, drivers, firmware, and platform software are correctly installed, configured, tested, and production‑ready to support AI, ML, and HPC workloads.
The engineer works closely with Deployment, Network, Platform, and Operations teams to deliver reliable, high‑performance GPU clusters and ensure smooth handover to run operations.
Required Skills & Experience
Technical Skills
- Bachelor's degree in Computer science or equivalent, with minimum 12+ Years of Overall IT experience.
- Hands‑on experience with GPU‑based servers in data center environments
- Strong understanding of:
- Linux system administration
- GPU drivers, firmware, and system tuning
- Server BIOS, firmware upgrades, and hardware diagnostics
- Familiarity with data center networking concepts and high‑performance interconnects
- Exposure to AI / ML / HPC environments is strongly preferred
Operational Skills
- Strong troubleshooting and root cause analysis skills
- Experience working in structured deployment and commissioning processes
- Ability to follow and improve runbooks and SOPs
Certifications (Preferred)
- OEM server certifications (HPE / Dell / Lenovo or equivalent)
- Linux administration certifications
- GPU / AI platform certifications (nice to have)