Charlotte, North Carolina
•
Today
Job Title: Cloud Infrastructure Engineer Location: Charlotte, NC (5 Days onsite) Duration: 12+ months Primary Skills vLLM TensorRT-LLM Triton Inference Server SGLang Kubernetes ML Serving KServe OpenShift AI GPU Orchestration Google Cloud Platform Terraform Nvidia Key Responsibilities Design and manage scalable AI/ML infrastructure for GenAI and LLM workloads. Deploy and optimize LLM inference pipelines using vLLM, TensorRT-LLM, Triton Inference Server, and SGLang. Implement inference optimi
Easy Apply
Contract, Third Party
$$55/hr - $60/hr




