Cloud Infrastructure Engineer

Charlotte, NC, US • Posted 6 hours ago • Updated 1 hour ago
Contract W2
Contract Corp To Corp
On-site
$45/hr - $50/hr
Company Branding Image
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

  • Kubernetes
  • llm
  • TensorRT-LLM
  • MLOps platforms

Summary

Job Title: Cloud Infrastructure Engineer

Location: Charlotte, NC (5 Days onsite)

Duration: 12+ months

Primary Skills

  • vLLM
  • TensorRT-LLM
  • Triton Inference Server
  • SGLang
  • Kubernetes ML Serving
  • KServe
  • OpenShift AI
  • GPU Orchestration
  • Google Cloud Platform
  • Terraform

Key Responsibilities

  • Design and manage scalable AI/ML infrastructure for GenAI and LLM workloads.
  • Deploy and optimize LLM inference pipelines using vLLM, TensorRT-LLM, Triton Inference Server, and SGLang.
  • Implement inference optimization techniques including:
  1. Continuous Batching
  2. Speculative Decoding
  3. KV Cache / Prefix Caching
  4. FP8 / AWQ / GPTQ quantization
  5. Tensor Parallelism
  • Build and maintain Kubernetes-based ML serving platforms using KServe and OpenShift AI.
  • Manage GPU orchestration and scheduling using technologies such as Run:AI, CUDA, NCCL, and MIG.
  • Develop Helm charts, Kubernetes Operators, and platform automation for AI workloads.
  • Conduct performance benchmarking and optimization for GPU-based inference systems.
  • Implement monitoring and observability using Prometheus and Grafana.
  • Collaborate with data science and ML engineering teams to productionize LLM models.
  • Automate infrastructure provisioning and deployment using Terraform.

Required Qualifications

  • 6+ years of experience in cloud engineering or platform engineering.
  • Experience with LLMOps/MLOps platforms.
  • Strong hands-on experience with Kubernetes and containerized AI/ML workloads.
  • Experience with GPU infrastructure and distributed inference optimization.
  • Proficiency in Google Cloud Platform cloud services and cloud-native architecture.
  • Strong scripting/programming skills in Python.
  • Experience with ML observability and production monitoring tools.
  • Familiarity with OpenShift AI and enterprise Kubernetes ecosystems.

Preferred Qualifications

  • Knowledge of GenAI frameworks and RAG architectures.
  • Exposure to enterprise AI governance and security practices.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91165889
  • Position Id: 2026-508
  • Posted 6 hours ago

Company Info

About Key2Source INC

At Key2Source, we recognize your drive for a competitive edge and are equipped with the expertise and resources to provide the technological advantage you seek. We offer advanced, professional staffing solutions, both permanent and contingent, throughout the United States. Our extensive database of staffing resources is supported by a robust Human Resources management system, ensuring high quality.

To support your success, we continually refine our expertise and invest heavily in the training and development of our team, utilizing the latest technology. Our commitment to excellence is reflected in our near 100% client retention rate across diverse industries such as IT/ITES, retail, telecom, e-commerce, FMCG, logistics, pharmaceuticals, and more. Our dedication to quality and our proven track record establish us as a leader in workforce solutions.

About_Company_OneAbout_Company_Two
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Charlotte, North Carolina

Today

Easy Apply

Contract, Third Party

Charlotte, North Carolina

Today

Easy Apply

Third Party, Contract

$$45 - $50/hr C2C

Search all similar jobs