Apply Now

On-prem Platform Engineer at Brevard & Charlotte, NC

Charlotte, NC, US • Posted 2 days ago • Updated 1 hour ago

Contract W2

Contract Corp To Corp

On-site

Fitment

Dice Job Match Score™

📋 Comparing job requirements...

Job Details

Skills

Terraform
gcp
Arize AI
Claude Cowork

Summary

On-prem Platform Engineer

Brevard, Charlotte

Arize AI, Claude Cowork, Google Cloud Platform, Terraform

vLLM TensorRT LLM Triton Inference Server SGLang Inference Optimization Continuous Batching Speculative Decoding KV Cache / Prefix Caching FP8 / AWQ / GPTQ Tensor Parallelism Kubernetes ML Serving KServe OpenShift AI Helm / Operators GPU Orchestration Run:AI Performance Benchmarking CUDA / NCCL / MIG Prometheus / Grafana ML Observability

GuideLLM, Locust

Build, configure, and operate on prem Kubernetes/OpenShift AI platforms for deploying and serving GenAI models and LLM inference workloads.

Design and optimize high performance inference stacks using vLLM, TensorRT LLM, Triton Inference Server, SGLang, and advanced techniques (continuous batching, speculative decoding, KV caching).

Manage GPU orchestration and capacity using Run:AI, MIG, CUDA/NCCL, and tensor parallelism to maximize utilization and throughput.

Deploy and operate Kubernetes ML serving frameworks (KServe, Helm, Operators) for scalable, reliable model serving.

Drive inference optimization and benchmarking, leveraging FP8, AWQ, GPTQ, and performance tools such as GuideLLM and Locust.

Implement observability and ML monitoring using Prometheus, Grafana, Arize AI, ensuring SLA/SLO compliance for GenAI services.

Collaborate with ML and research teams to onboard new models, tune inference performance, and productionize GenAI use cases.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91091604
Position Id: 2026-3895
Posted 2 days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Charlotte, North Carolina

•

2d ago

Role :AI Infrastructure Platform Engineer Location : Charlotte , NC Local candidates only. In This Role, You Will - Lead complex infrastructure initiatives supporting Generative AI and Predictive AI platforms from design to production operations. - Serve as a technical lead for platforms supporting AI/ML model training, inference, and batch workloads. - Design, build, deploy, and operate OpenShift-based container platforms optimized for high-performance GPU workloads. - Build, support and opera

Easy Apply

Contract, Third Party

Depends on Experience

AI Platform Architect

Charlotte, North Carolina

•

8d ago

Client: Photon Position: AI Platform Architect Location: Addison, TX, Charlotte, NC Duration: Contract & Full time We are looking for an AI Platform Architect a builder who can architect the "factory" where AI is made. Our goal is to build an internal, on-premises AI ecosystem that mimics the capabilities of AWS or Azure. You will be responsible for creating a horizontal platform used by various lines of business to deploy AI projects simultaneously. Key Responsibilities Platform Architecture: D

Easy Apply

Contract, Third Party

$180,000 - $250,000

AI Architect

Hybrid in Charlotte, North Carolina

•

Today

Company Overview: Req ID: 371108 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a AI Architect to join our team in Charlotte, North Carolina (US-NC), United States (US). Job Description: Job Duties: Role Overview: We are seeking a Principal GenAI Architect to serve as a hands-on practitioner and core technical visionary.

Easy Apply

Contract

$133

AI Architect

Charlotte, North Carolina

•

Yesterday

Job Title: AI Architect Location: Charlotte, NC / Dallas, TX / Iselin, NJ (Hybrid) Employment Type: Contract Job Duties: Role Overview: We are seeking a AI Architect to serve as a hands-on practitioner and core technical visionary. This is a rare, high-impact role requiring deep expertise in Generative AI, distributed systems, and agentic architectures. You will act as the central design authority for our GenAI capabilities within a matrixed organization, bridging internal platform developmen

Easy Apply

Contract, Third Party

Depends on Experience

Search all similar jobs

On-prem Platform Engineer at Brevard & Charlotte, NC

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs