LLM Inference & GPU Systems Consultant

Charlotte, NC, US • Posted 8 hours ago • Updated 8 hours ago
Contract Independent
Contract W2
2 Years
Travel Required
Able to Sponsor
On-site
$55/hr
Company Branding Image
Fitment

Dice Job Match Score™

📊 Calculating match score...

Job Details

Skills

  • RunAI
  • LLM
  • Inference
  • GPU
  • vLLM
  • TensorRT-LLM
  • AI
  • Artificial Intelligence
  • NVIDIA H200
  • NVIDIA H200 cluster
  • KV Cache
  • OpenShift AI
  • orchestration

Summary

Job Title: LLM Inference & GPU Systems Consultant
Location: Charlotte-NC  Local candidates only
Duration: Long Term

Must have :   RunAI /LLM Inference & GPU / vLLM and TensorRT-LLM.

Required Skills & Experience
Required Qualifications
8+ years experience working as an LLM Systems Engineer or AI Infrastructure Runtime Engineer.
8+ years hands-on experience with NVIDIA H200 clusters and runtime optimization techniques (KV Cache, prefill/decode).
Proficiency in OpenShift AI and GPU orchestration tools like RunAI.
Strong experience with modern inference frameworks, specifically vLLM and TensorRT-LLM.
Proven track record managing the Hugging Face deployment lifecycle.
Must be onsite at client in Charlotte, NC at least 3 days/week
Inference Serving: Deploy and manage inference engines including vLLM and TensorRT-LLM.
Hardware Utilization: Optimize GPU throughput tuning, batching strategies, and latency optimization. Manage workload orchestration using RunAI and Kubernetes GPU orchestration.
Model Lifecycle Management: Oversee the complete Hugging Face model lifecycle, including model onboarding, deployment, and retirement.
Platform Operations: Operate and maintain the OpenShift AI ecosystem as the primary container platform for GenAI workloads.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91170837
  • Position Id: 8981635
  • Posted 8 hours ago

Company Info

About TechVirtue LLC

TechVirtue is involved in developing a wide range of solutions in finding the perfect candidate who has a strong knowledge in his/her work and suits the company's work culture. We even provide one-stop solutions ranging from software development and maintenance to expert support and advisory. Our team consists of experts who have several years of experience in staffing, recruitment, and web development. Our dedicated and motivated team makes sure to fulfill all our customers requirements.

About_Company_OneAbout_Company_Two
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

It looks like there aren't any Similar Jobs for this job yet.

Search all similar jobs