Apply Now

Cloud Infrastructure Engineer

Charlotte, NC, US • Posted 6 hours ago • Updated 1 hour ago

Contract W2

Contract Corp To Corp

On-site

$45/hr - $50/hr

Key2Source INC

Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

Kubernetes
llm
TensorRT-LLM
MLOps platforms

Summary

Job Title: Cloud Infrastructure Engineer

Location: Charlotte, NC (5 Days onsite)

Duration: 12+ months

Primary Skills

vLLM
TensorRT-LLM
Triton Inference Server
SGLang
Kubernetes ML Serving
KServe
OpenShift AI
GPU Orchestration
Google Cloud Platform
Terraform

Key Responsibilities

Design and manage scalable AI/ML infrastructure for GenAI and LLM workloads.
Deploy and optimize LLM inference pipelines using vLLM, TensorRT-LLM, Triton Inference Server, and SGLang.
Implement inference optimization techniques including:

Continuous Batching
Speculative Decoding
KV Cache / Prefix Caching
FP8 / AWQ / GPTQ quantization
Tensor Parallelism

Build and maintain Kubernetes-based ML serving platforms using KServe and OpenShift AI.
Manage GPU orchestration and scheduling using technologies such as Run:AI, CUDA, NCCL, and MIG.
Develop Helm charts, Kubernetes Operators, and platform automation for AI workloads.
Conduct performance benchmarking and optimization for GPU-based inference systems.
Implement monitoring and observability using Prometheus and Grafana.
Collaborate with data science and ML engineering teams to productionize LLM models.
Automate infrastructure provisioning and deployment using Terraform.

Required Qualifications

6+ years of experience in cloud engineering or platform engineering.
Experience with LLMOps/MLOps platforms.
Strong hands-on experience with Kubernetes and containerized AI/ML workloads.
Experience with GPU infrastructure and distributed inference optimization.
Proficiency in Google Cloud Platform cloud services and cloud-native architecture.
Strong scripting/programming skills in Python.
Experience with ML observability and production monitoring tools.
Familiarity with OpenShift AI and enterprise Kubernetes ecosystems.

Preferred Qualifications

Knowledge of GenAI frameworks and RAG architectures.
Exposure to enterprise AI governance and security practices.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91165889
Position Id: 2026-508
Posted 6 hours ago

Company Info

About Key2Source INC

At Key2Source, we recognize your drive for a competitive edge and are equipped with the expertise and resources to provide the technological advantage you seek. We offer advanced, professional staffing solutions, both permanent and contingent, throughout the United States. Our extensive database of staffing resources is supported by a robust Human Resources management system, ensuring high quality.

To support your success, we continually refine our expertise and invest heavily in the training and development of our team, utilizing the latest technology. Our commitment to excellence is reflected in our near 100% client retention rate across diverse industries such as IT/ITES, retail, telecom, e-commerce, FMCG, logistics, pharmaceuticals, and more. Our dedication to quality and our proven track record establish us as a leader in workforce solutions.

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Charlotte, North Carolina

•

Today

Job Title: Senior Cloud Platform Engineer (GenAI Platforms) Location: Charlotte, NC (5 Days onsite) Duration: 12+ months Primary Skills Google Cloud Platform Azure Terraform Kubernetes OpenShift (OCP) Platform Engineering Observability SRE / SLOs Python GenAI Platforms Arize AI Claude Cowork HashiCorp Vault Internal Developer Portals LLMs RAG MLOps / LLMOps Key Responsibilities Architect and implement enterprise-scale cloud platforms across Google Cloud Platform and Azure. Build secure landing

Easy Apply

Contract, Third Party

AI Performance & Benchmarking Engineer

Charlotte, North Carolina

•

Today

Job Title: AI Performance & Benchmarking Engineer Location: Charlotte, NC (Onsite) Duration: 12+ Months Job Summary We are seeking an experienced AI Performance & Benchmarking Engineer with strong expertise in LLM performance testing, benchmarking, and infrastructure optimization. The ideal candidate should have hands-on experience with GuideLLM, NVIDIA H200 GPUs, Locust, Kubernetes/OpenShift, and observability tools to evaluate and optimize AI workloads at scale. Required Skills GuideLLM NV

Easy Apply

Third Party, Contract

$$45 - $50/hr C2C

Search all similar jobs