Charlotte, North Carolina
•
Today
Job Title: LLM Inference & GPU Systems Consultant Location: Charlotte-NC Local candidates only Duration: Long Term Must have : RunAI /LLM Inference & GPU / vLLM and TensorRT-LLM. Required Skills & Experience Required Qualifications 8+ years experience working as an LLM Systems Engineer or AI Infrastructure Runtime Engineer. 8+ years hands-on experience with NVIDIA H200 clusters and runtime optimization techniques (KV Cache, prefill/decode). Proficiency in OpenShift AI and GPU orchestration tool
Easy Apply
Contract
55




