LLM Inference / AI Infrastructure Engineer

Charlotte, NC, US • Posted 1 day ago • Updated 1 day ago
Contract W2
12 Months
On-site
Depends on Experience
Fitment

Dice Job Match Score™

🧠 Analyzing your skills...

Job Details

Skills

  • LLM Inference / AI Infrastructure Engineer

Summary

LLM Inference / AI Infrastructure Engineer
Location: Charlotte, NC
Duration: 9-12 Month

JD:
vLLM TensorRTLLM Triton Inference Server SGLang Inference Optimization Continuous Batching Speculative Decoding KV Cache / Prefix Caching FP8 / AWQ / GPTQ Tensor Parallelism Kubernetes ML Serving KServe OpenShift AI Helm / Operators GPU Orchestration Run:AI Performance Benchmarking CUDA / NCCL / MIG Prometheus / Grafana ML Observability

skills sanity check: HAVE YOU WORKED ON Nvidia H200? If yes, chances are you will know all above skills

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10121431
  • Position Id: 8979669
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Charlotte, North Carolina

Today

Easy Apply

Contract

55

Charlotte, North Carolina

9d ago

Easy Apply

Third Party, Contract

Depends on Experience

Charlotte, North Carolina

Today

Easy Apply

Contract, Third Party

$0,00/-

Charlotte, North Carolina

Today

Easy Apply

Third Party, Contract

$$55/hr - $60/hr

Search all similar jobs