On-prem Platform Engineer

Brevard, NC, US • Posted 3 days ago • Updated 3 days ago
Contract Independent
Contract W2
Contract Corp To Corp
Able to Sponsor
On-site
Depends on Experience
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

  • Artificial Intelligence
  • Grafana
  • Kubernetes
  • Generative Artificial Intelligence (AI)
  • Machine Learning (ML)
  • GPU
  • CUDA

Summary

Role: On-prem Platform Engineer

Location: Brevard, Charlotte, NC (Onsite)

Tech Skills Needed:

vLLM TensorRT LLM Triton Inference Server SGLang Inference Optimization Continuous Batching Speculative Decoding KV Cache / Prefix Caching FP8 / AWQ / GPTQ Tensor Parallelism Kubernetes ML Serving KServe OpenShift AI Helm / Operators GPU Orchestration Run:AI Performance Benchmarking CUDA / NCCL / MIG Prometheus / Grafana ML Observability

GuideLLM, Locust

Responsibilities:

  • Build, configure, and operate on prem Kubernetes/OpenShift AI platforms for deploying and serving GenAI models and LLM inference workloads.
  • Design and optimize high performance inference stacks using vLLM, TensorRT LLM, Triton Inference Server, SGLang, and advanced techniques (continuous batching, speculative decoding, KV caching).
  • Manage GPU orchestration and capacity using Run:AI, MIG, CUDA/NCCL, and tensor parallelism to maximize utilization and throughput.
  • Deploy and operate Kubernetes ML serving frameworks (KServe, Helm, Operators) for scalable, reliable model serving.
  • Drive inference optimization and benchmarking, leveraging FP8, AWQ, GPTQ, and performance tools such as GuideLLM and Locust.
  • Implement observability and ML monitoring using Prometheus, Grafana, Arize AI, ensuring SLA/SLO compliance for GenAI services.
  • Collaborate with ML and research teams to onboard new models, tune inference performance, and productionize GenAI use cases.

Sunil

Lead Technical Recruiter
Phone:

E-mail:

Linkedin:

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90962964
  • Position Id: 8966039
  • Posted 3 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Greenville, South Carolina

Today

Full-time

USD 113,200.00 - 188,800.00 per year

Spartanburg, South Carolina

Today

Easy Apply

Third Party, Contract

$DOE

Remote

Today

Easy Apply

Contract

Depends on Experience

Remote

3d ago

Easy Apply

Contract, Third Party

$80 - $110

Search all similar jobs