Charlotte, North Carolina
•
Today
Job Title: Cloud Infrastructure Engineer Location: Charlotte, NC (5 Days onsite) Duration: 12+ months Primary Skills vLLM TensorRT-LLM Triton Inference Server SGLang Kubernetes ML Serving KServe OpenShift AI GPU Orchestration Google Cloud Platform Terraform Key Responsibilities Design and manage scalable AI/ML infrastructure for GenAI and LLM workloads. Deploy and optimize LLM inference pipelines using vLLM, TensorRT-LLM, Triton Inference Server, and SGLang. Implement inference optimization
Easy Apply
Third Party, Contract
$$45/hr - $50/hr




