Apply Now

HPC consultant

Fremont, CA, US • Posted 15 hours ago • Updated 15 hours ago

Contract W2

Contract Corp To Corp

6 Months

No Travel Required

On-site

$49 - $54/hr

VDart, Inc.

Fitment

Dice Job Match Score™

📋 Comparing job requirements...

Job Details

Skills

HPC
GPU
Azure

Summary

Role- HPC consultant
Location- Fremont, CA/ Tualatin, OR
Contract

HPC Cluster & Scheduler Management
•   Design, configure, tune, and optimize SLURM partitions, queues, QoS, and scheduling policies to maximize cluster utilization and workload efficiency.
•   Perform in-depth analysis of job scheduling behavior, bottlenecks, and resource contention.
•   Troubleshoot job failures, performance degradation, and scheduler-related issues in production HPC environments.
•   Implement fair-share, backfill, reservations, and policy-driven scheduling as required.
Storage Benchmarking & Procurement Support
•   Lead HPC storage performance benchmarking using industry-standard tools (e.g., IOR, FIO, MDTest, IOzone).
•   Analyze I/O patterns of HPC workloads and map them to appropriate storage architectures (parallel file systems, NVMe, Lustre, Spectrum Scale, etc.).
•   Provide technical input for storage selection and procurement, including performance expectations, sizing, and cost-performance tradeoffs.
•   Collaborate with vendors and internal teams during POCs and performance validation exercises.
HPC Application Build & Optimization
•   Build, install, configure, and maintain HPC applications, compilers, libraries, and scientific software stacks.
•   Optimize application performance using MPI, OpenMP, GPU acceleration (where applicable), and tuned math libraries.
•   Support multiple compiler toolchains (GCC, Intel, LLVM, NVIDIA HPC SDK, etc.).
•   Implement and manage environment modules (Lmod) or similar software management frameworks.
System Performance & Operations
•   Conduct system-level performance tuning across compute, memory, network, and storage layers.
•   Diagnose node-level issues involving CPU, GPU, interconnects (InfiniBand/Ethernet), and OS configurations.
•   Create operational runbooks, performance baselines, and troubleshooting documentation.
•   Support cluster upgrades, expansions, and hardware refresh activities.
Collaboration & Delivery
•   Work closely with application owners, researchers, and infrastructure teams to meet aggressive delivery timelines.
•   Translate workload requirements into practical HPC configurations and optimizations.
•   Provide clear technical guidance and recommendations to leadership and stakeholders.
Required Skills & Experience
Core HPC Skills
•   8–12+ years of hands-on HPC engineering experience in production environments.
•   Strong expertise with SLURM (configuration, tuning, troubleshooting).
•   Solid understanding of Linux systems (RHEL/CentOS/Rocky/Alma preferred).
•   Deep knowledge of HPC storage systems and I/O performance analysis.
•   Proven experience building and optimizing HPC applications and libraries.
Technical Proficiency
•   MPI implementations (Open MPI, MPICH), OpenMP
•   Compilers and toolchains (GCC, Intel, NVIDIA HPC SDK)
•   Performance tools (perf, vtune, nvprof/nsys, IB diagnostics)
•   Environment modules (Lmod), package managers (Spack preferred)
•   Bash/Python scripting for automation and diagnostics

Nice to Have
•   Experience with GPU-based HPC workloads (NVIDIA CUDA, ROCm).
•   Exposure to cloud-based HPC (Azure, AWS, Google Cloud Platform).
•   Familiarity with parallel file systems such as Lustre or IBM Spectrum Scale.
•   Vendor engagement experience for HPC hardware/storage evaluations.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10330808
Position Id: 97624-5195-
Posted 15 hours ago

Company Info

About VDart, Inc.

VDart, headquartered in Atlanta, GA, is a global leader in digital talent solutions and IT staffing, delivering top technology professionals to businesses worldwide. With a strong presence across North America, Europe and Asia, we specialize in helping organizations navigate complex technology landscapes with the right expertise.

Through a strategic, client-focused approach, we have placed over 20,000 professionals across key industries and advanced technology solutions. Whether placing top talent in cutting-edge roles or providing strategic digital workforce solutions, our network of 4,000 specialists across 13 countries is committed to excellence, agility and impact.

Backed by 18 years of industry experience, we go beyond staffing to build long-term partnerships that accelerate digital transformation and drive sustained growth. Whether you need a technology partner to fuel innovation or specialized workforce solutions to maintain a competitive edge, VDart delivers the right people, skills and mindset to create a lasting impact in a digital-first world.

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Platform Engineer

Remote

•

9d ago

Platform Engineer Remote Contract The Platform LLM Infrastructure Engineer operates at the intersection of capacity planning GPU capacity optimization quota management model lifecycle management and production reliability This role is critical in ensuring scalable efficient and resilient infrastructure for large language model LLM platforms Key Responsibilities Manage shared GPU resources across regions including handling LLM capacity requests and optimizing utilization Depl

Easy Apply

Contract, Third Party

Depends on Experience

AI Architect

Hybrid in Dallas, Texas

•

Today

Role: AI Architect Location: Charlotte, NC, Dallas, TX, Iselin, NJ (Onsite) Type: ContractWe are seeking a Principal GenAI Architect to serve as a hands-on practitioner and core technical visionary. This is a rare, high-impact role requiring deep expertise in Generative AI, distributed systems, and agentic architectures. You will act as the central design authority for our GenAI capabilities within a matrixed organization, bridging internal platform development, third-party vendor reviews, a

Easy Apply

Contract, Third Party

$86 - $88

Intune Modern Device Management Consultant

Englewood Cliffs, New Jersey

•

Today

Role- Intune Modern Device Management Consultant Location: Englewood cliff, NJ-Onsite JD: Provides technical direction to project teams leads/participate in project planning sessions with clients or IT management. Provides solutions and evaluates the merit of each solution including highly complex issues Demonstrable detailed understanding of architecture principles and methods, technology and standards Strong presentation skills including developing architectural designs, presenta

Easy Apply

Contract, Third Party

$65 - $67

Search all similar jobs