HPC & GPFS DevOps Engineer

Overview

On Site
170k - 250k
Full Time

Skills

High Performance Computing
Storage Management
Lifecycle Management
Google Cloud Platform
Google Cloud
Cloud Computing
Management
Mentorship
DevOps
IBM GPFS
Python
Scripting
Backup
Disaster Recovery
Storage
Artificial Intelligence
HPC

Job Details

Job Title: Senior DevOps HPC Engineer

Location: Hybrid -Salt Lake City, UT - 2-3 days a week

Overview: We're seeking a Senior DevOps HPC Engineer to design and manage high-performance computing (HPC) infrastructure, focusing on storage and AI model deployment. This role requires expertise in HPC, GPFS, and scientific workloads, with a strong emphasis on storage management and disaster recovery.

Responsibilities:
  • Design and maintain HPC infrastructure and storage solutions, with expertise in GPFS.
  • Implement backup, disaster recovery, and lifecycle management policies.
  • Work in a Google Cloud Platform environment to build scalable cloud-based HPC solutions.
  • Develop tools for monitoring and managing HPC systems.
  • Provide mentorship and troubleshoot complex infrastructure and storage issues.
  • Apply DevOps practices, focusing on HPC and storage needs.

Requirements:
  • Senior-level experience with HPC and GPFS.
  • Proficiency in Python for automation and scripting.
  • Experience in backup policies and disaster recovery for storage.
  • Familiarity with AI model deployment on HPC systems.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Motion Recruitment Partners, LLC