Product Quality Engineer RMA Failure Analysis, Python & Hardware Debugging - 68651
We have an immediate need for a Product Quality Engineer RMA Failure Analysis for an exciting opportunity supporting projects in the Santa Clara, CA USA. This role is focused on advanced server hardware diagnostics, failure analysis, and automation development within high-performance computing and enterprise hardware environments. The selected candidate will work on cutting-edge GPU/CPU server platforms while collaborating with engineering and quality teams to troubleshoot complex hardware and software issues in fast-paced production and lab environments.
The ideal candidate will be responsible for supporting Return Material Authorization (RMA) Failure Analysis activities with a strong blend of Python development and hardware debugging expertise. The role requires approximately 65% Python-based automation and software troubleshooting combined with 35% hardware failure analysis and diagnostics. Candidates will work extensively within Linux environments performing command-line troubleshooting, log analysis, remote debugging, scripting, and automation tasks. Responsibilities include writing Python scripts to automate manual operations, analyze logs, debug software behavior, and support data collection activities across server systems. The selected professional will also perform board-level troubleshooting, probe PCB components, validate electrical and power signals, and diagnose complex hardware failures using oscilloscopes, logic analyzers, and multimeters. Strong understanding of server architectures including GPU/CPU systems, memory subsystems, and liquid cooling systems is essential for success in this role.
Required skill sets include strong expertise in Linux programming, Python script development, server board or motherboard debugging, and hardware failure analysis. Candidates should possess hands-on experience with Linux command-line operations, SSH, IPMI, scripting automation, system troubleshooting, and remote debugging protocols. Knowledge of hardware communication interfaces including I2C, UART, JTAG, SPI, and PCIe is highly preferred. Experience using laboratory diagnostic tools such as oscilloscopes, logic analyzers, and multimeters along with understanding of PCB-level debugging and enterprise server architectures is essential. Bash scripting knowledge will be an added advantage. Candidates who can directly work on our payroll are encouraged to apply.
Interested candidates can connect with our recruiter at
Pankaj
PRIMUS Global Services
Phone: Desk Ext. 214
Email: