Overview
On Site
Full Time
Skills
Technical Support
Computer Hardware
Computer Cluster Management
Scalability
InfiniBand
Computer Networking
NFS
Samba
File Systems
GPU
Stacks Blockchain
Workflow
High Performance Computing
FOCUS
Linux Administration
Ansible
Python
Management
Linux
Network
Documentation
MPI
Scripting
HPC
Storage
IT Service Management
Innovation
Collaboration
Recruiting
Insurance
Finance
Professional Development
Training
Leadership
CompTIA
Customer Service
Career Counseling
Apex
Oracle Application Express
Job Details
Job#: 3016156
Job Description:
HPC Consultant (100% Remote)
Apex Systems is seeking a highly skilled HPC Consultant to support large-scale compute engineering operations within an enterprise high-performance computing (HPC) environment. This role focuses on Linux systems, automation, and multi-node cluster administration.
This position is fully remote and open to candidates in any U.S. time zone.
Email resume to Julissa at to apply!
You will support the integration, configuration, automation, and daily operations of HPC compute clusters. Responsibilities include validating system performance, supporting users, troubleshooting issues, and maintaining stability across multi-node environments.
Key Responsibilities
You will join a collaborative engineering team consisting of HPC-focused and storage-focused technical professionals working across large-scale compute infrastructure.
Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico. Apex uses a virtual recruiter as part of the application process. Click for more details.
Apex Benefits Overview: Apex offers a range of supplemental benefits, including medical, dental, vision, life, disability, and other insurance plans that offer an optional layer of financial protection. We offer an ESPP (employee stock purchase program) and a 401K program which allows you to contribute typically within 30 days of starting, with a company match after 12 months of tenure. Apex also offers a HSA (Health Savings Account on the HDHP plan), a SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions, a corporate discount savings program and other discounts. In terms of professional development, Apex hosts an on-demand training program, provides access to certification prep and a library of technical and leadership courses/books/seminars once you have 6+ months of tenure, and certification discounts and other perks to associations that include CompTIA and IIBA. Apex has a dedicated customer service team for our Consultants that can address questions around benefits and other resources, as well as a certified Career Coach. You can access a full list of our benefits, programs, support teams and resources within our 'Welcome Packet' as well, which an Apex team member can provide.
Job Description:
HPC Consultant (100% Remote)
Apex Systems is seeking a highly skilled HPC Consultant to support large-scale compute engineering operations within an enterprise high-performance computing (HPC) environment. This role focuses on Linux systems, automation, and multi-node cluster administration.
This position is fully remote and open to candidates in any U.S. time zone.
Email resume to Julissa at to apply!
You will support the integration, configuration, automation, and daily operations of HPC compute clusters. Responsibilities include validating system performance, supporting users, troubleshooting issues, and maintaining stability across multi-node environments.
Key Responsibilities
- Integrate and validate hardware and software components across HPC clusters
- Provide technical support to end users and application stakeholders
- Troubleshoot and escalate complex hardware, OS, and network issues
- Maintain documentation, runbooks, and automation playbooks
- Manage configurations for Linux OS, schedulers, remote graphics tools, and cluster management systems
- Perform Linux system administration with strong focus on tuning, scalability, and operational reliability
- Support InfiniBand networking (OFED stacks, Subnet Managers, diagnostics)
- Assist with storage troubleshooting (NFS, Samba; parallel filesystem experience optional)
- Support GPU compute environments, drivers, and workload integration
- Build and maintain compilers, modules, and HPC software stacks
- Develop automation using Python, Ansible, and related tools
- Engineer solutions that optimize compute workflows and resource utilization
- Strong High-Performance Computing (HPC) experience - primary focus of the role. (Candidates with strong fundamentals may be trained.)
- Deep Linux Systems Administration expertise in enterprise Linux environments.
- Automation experience using Ansible and Python in Linux environments.
- Experience managing and maintaining cluster environments, including multi-node compute systems.
- Storage experience not required, though familiarity with storage fundamentals is a plus.
- Support users with HPC compute and application issues
- Tune and maintain Linux systems running at scale
- Diagnose and resolve HPC compute, network, and storage issues
- Maintain cluster automation and operational documentation
- Support compilers, MPI libraries, and development toolchains
- Implement automation and scripts to enhance reliability and efficiency
You will join a collaborative engineering team consisting of HPC-focused and storage-focused technical professionals working across large-scale compute infrastructure.
Apex Systems is a world-class IT services company that serves thousands of clients across the globe. When you join Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRated's Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico. Apex uses a virtual recruiter as part of the application process. Click for more details.
Apex Benefits Overview: Apex offers a range of supplemental benefits, including medical, dental, vision, life, disability, and other insurance plans that offer an optional layer of financial protection. We offer an ESPP (employee stock purchase program) and a 401K program which allows you to contribute typically within 30 days of starting, with a company match after 12 months of tenure. Apex also offers a HSA (Health Savings Account on the HDHP plan), a SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions, a corporate discount savings program and other discounts. In terms of professional development, Apex hosts an on-demand training program, provides access to certification prep and a library of technical and leadership courses/books/seminars once you have 6+ months of tenure, and certification discounts and other perks to associations that include CompTIA and IIBA. Apex has a dedicated customer service team for our Consultants that can address questions around benefits and other resources, as well as a certified Career Coach. You can access a full list of our benefits, programs, support teams and resources within our 'Welcome Packet' as well, which an Apex team member can provide.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.