gpu jobs in charlotte, nc

NVIDIA H200 -- LLM Inference & GPU Systems Consultant

Hybrid in Charlotte, North Carolina

•

4d ago

Role Overview: We are seeking an AI Infrastructure Runtime Engineer to build and maintain large-scale on-prem LLM infrastructure. This is an enterprise private GenAI environment running on NVIDIA H200 GPU clusters and an OpenShift AI deployment ecosystem. You will manage production inference internally, including self-hosting open-source LLMs like Llama. We are focused exclusively on inferencing; this role involves no model training infrastructure or fine-tuning pipelines. Key Responsibilities N

Easy Apply

Contract

70 - 80

Hybrid || LLM Inference & GPU Systems Consultant || Charlotte, NC

Charlotte, North Carolina

•

Today

TECHNOGEN, Inc. is a Proven Leader in providing full IT Services, Software Development and Solutions for 15 years. TECHNOGEN is a Small & Woman Owned Minority Business with GSA Advantage Certification. We have offices in VA; MD & Offshore development centers in India. We have successfully executed 100+ projects for clients ranging from small business and non-profits to Fortune 50 companies and federal, state and local agencies. Description: Local candidates preferred. Role Overview: We are se

Easy Apply

Contract, Third Party

$0,00/-

LLM Inference & GPU Systems Consultant

Charlotte, North Carolina

•

5d ago

Role : LLM Inference & GPU Systems Consultant Location : Charlotte , NC ( Locals only) We are seeking an AI Infrastructure Runtime Engineer to build and maintain large-scale on-prem LLM infrastructure. This is an enterprise private GenAI environment running on NVIDIA H200 GPU clusters and an OpenShift AI deployment ecosystem. You will manage production inference internally, including self-hosting open-source LLMs like Llama. We are focused exclusively on inferencing; this role involves no model

Easy Apply

Contract, Third Party

Depends on Experience

Sr. GPU/Algorithm Engineer (CUDA)

Remote

•

23d ago

Title: Sr. GPU / Algorithm Engineer Location: Remote Client based in Guelph, ON area, may need to visit once a month for a couple days (expenses covered) Start date: 2/2 Duration: 6 12 months (more likely towards 12 months) Overtime expectation: As needed (project-based) Drug Screen required: Yes Background Check required: Yes Citizenship Requirement: Open but must have ability to travel to Canada and get across the border Product/Project: Optical Imaging System Summary: Our client is se

Easy Apply

Contract

$70 - $90

Senior Staff GPU Physical Design Engineer

Remote

•

Today

Job Description Title: Senior Staff GPU Physical Design Engineer Location: Austin, TX or San Jose, CA. onsite strongly preferred Need: GPU or high-performance SoC experience. Advanced node (?5nm). Full block ownership Position Summary We are seeking a Senior Staff GPU Physical Design Engineer for our client to work on the physical implementation of high-performance GPU and system-level IP, driving execution across synthesis, place-and-route, timing closure, and signoff for their advanced tech

Contract

Senior AI Performance Engineer (CUDA / GPU / NVIDIA Stack)

Remote

•

2d ago

We are hiring a Senior AI Performance Engineer to work on large-scale GPU-accelerated AI systems powering real-time Vision AI platforms. This role is focused on hands-on performance optimization across distributed multi-GPU environments improving latency, throughput, and GPU utilization for production AI workloads. This is NOT a generic ML / DevOps role.We are looking for candidates with deep GPU + NVIDIA ecosystem experience. Key ResponsibilitiesAnalyze and optimize AI/ML workloads across mul

Easy Apply

Contract

95 - 100

100 Remote - Network Deployment Architect - Cloud GPU & CPU

Remote

•

16d ago

Role: Network Deployment Architect Role Summary The Hybrid Networking Support Lead is responsible for operational ownership, support readiness, and daytoday stability of hybrid network connectivity spanning onpremises infrastructure and cloud environments(Cloud GPU, Cloud CPU, and OnPrem). This role ensures secure connectivity, effective segmentation, reliable data transfer performance, and rapid incident resolution across hybrid environments. Key Responsibilities Hybrid Connectivity

Easy Apply

Full-time, Third Party

Depends on Experience

Project Manager 5 - Contingent

Charlotte, North Carolina

•

Today

Description In this contingent resource assignment you may: Consult as an expert to develop or influence initiatives and resources for highly complex business and technical needs across Project Management. Consult on the strategy and resolution of highly complex and unique challenges requiring in-depth evaluation across multiple areas delivering solutions that are long-term large-scale and require vision creativity innovation and advanced analytical and inductive thinking. Provide expertise to c

Easy Apply

Full-time

USD 70.00 - 75.00 per hour

Principal AI Cloud Infrastructure Engineer

Charlotte, North Carolina

•

Today

The position is described below. If you want to apply, click the Apply Now button at the top or bottom of this page. After you click Apply Now and complete your application, you'll be invited to create a profile, which will let you see your application status and any communications. If you already have a profile with us, you can log in to check status. Need Help? If you have a disability and need assistance with the application, you can request a reasonable accommodation. Send an email to Acce

Full-time

Software Engineer L5, Python Platform

Remote

•

Today

At Netflix, our mission is to entertain the world. Together, we are writing the next episode - pushing the boundaries of storytelling, global fandom and making the unimaginable a reality. We are a dream team obsessed with the uncomfortable excitement of discovering what happens when you merge creativity, intuition and cutting-edge technology. Come be a part of what's next. Our application development platform teams enable the underlying technology and best practices for engineering at Netflix.

Full-time

USD 388,000.00 - 619,000.00 per year

Data Center Infrastructure Architect

Remote or West Palm Beach, Florida

•

Today

RESPONSIBILITIES: Kforece has a client in West Palm Beach, FL that is seeking a Data Center Infrastructure Architect. Responsibilities: Architect and Design Network, Server, and GPU Infrastructure: * Architect and design the physical infrastructure to support data center scale deployments of networks, servers, and GPU including rack layouts, components, cable management, power, and cooling requirements * Work with internal teams and vendors to define and develop the hardware deployment strategi

Full-time

$125,000 - $140,000 annually

Staff AI/ML Infrastructure Engineer

Remote or West Palm Beach, Florida

•

Today

RESPONSIBILITIES: Kforce has a client in West Palm Beach, FL that is seeking a Staff AI/ML Infrastructure Engineer. Key Responsibilities: * Design and maintain GPU and bare metal infrastructure in containerized and physical environments * Build scalable GPU clusters in partnership with networking and provisioning teams * Ensure reliable, high-performance provisioning of GPU infrastructure * Develop automated testing systems for GPU-based platforms * Implement infrastructure solutions for divers

Full-time

$145,000 - $160,000 annually

Senior Core Platform Engineer

Remote or Austin, Texas

•

Today

Senior Core Platform Engineer Austin, TX About the Team Our team is at the core of Avride's self-driving stack. We build the base infrastructure layer that powers all autopilot code. It includes C++ framework for implementing autonomy components, execution graph building and optimization systems, as well as runtimes that execute those graphs, both onboard and in simulation. For the onboard runtime, we aim to achieve the best possible performance while operating under strict latency guarantees

Full-time

Site Reliability Engineer II

Remote or Cambridge, England

•

Yesterday

Are you passionate about cutting-edge AI infrastructure? Do you want to build your SRE career on one of the most exciting platforms in cloud computing? Join the Akamai Inference Cloud Team The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We design, implement, deploy and operate AI platforms that enable customers to run inference models and developers to create AI applications. Partner with the best In this role, responsibilities will include automation, monitoring

Full-time

USD 95,000.00 - 171,000.00 per year

Senior Site Reliability Engineer

Remote or Cambridge, England

•

Yesterday

Do you enjoy solving complex reliability challenges for cutting-edge technology? Do you have a passion for automation and building systems that scale? Join the Akamai Inference Cloud Team! The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We design, implement, deploy and operate AI platforms that enable customers to run inference models and developers to create AI applications with unmatched performance, compliance, and economics. Partner with the best In this role

Full-time

USD 121,400.00 - 218,600.00 per year

ML Platform Engineer

Remote or Austin, Texas

•

Today

About the team The ML Platform team at Avride builds the infrastructure that powers large-scale ML training and data processing for autonomous driving. We sit between Cloud Platform and ML engineers, turning low-level compute, storage, and networking primitives into an ML platform that teams actually use - scalable orchestration, distributed compute, and production-grade tooling for the full model lifecycle. About the role As an ML Platform Engineer at Avride, you'll own critical pieces of th

Full-time

Senior Engineering Manager, Machine Learning Platform

Remote

•

Today

Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest. Machine Learning is central to how Affirm delivers on its mission - powering decisions across underwriting, fraud, servicing, and personalization. We are seeking a Senior Manager to lead our ML Platform engineering organization, the team that builds and operates the critical infrastructure enabling every ML capability at Aff

Full-time

USD 260,000.00 - 310,000.00 per year

Principal Performance Engineer Lead

Remote or Cambridge, England

•

Yesterday

Do you want to push the boundaries of AI inference speed and accuracy at global scale? \n Are you passionate about optimizing how models perform in production serving environments? \n Join the Akamai Inference Cloud Team! \n The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We design and operate AI platforms that enable customers to run models with unmatched performance, compliance, and economics. The Model Intelligence & Lifecycle team owns the end-to-end model lifecyc

Full-time

USD 169,300.00 - 304,700.00 per year

C++ Software Engineer, Motion Planning

Remote or Austin, Texas

•

Today

About the Team Our team develops the core software and data processing systems that power motion planning and decision-making in autonomous vehicles. We work at the intersection of machine learning, large-scale data infrastructure, and real-time vehicle control, collaborating across engineering, analytics, and product teams to deliver safe and intelligent driving capabilities. About the Role We are seeking a highly skilled C++ Software Engineer to join our core Motion Planning team. You will b

Full-time

Senior II Software Engineer Lead - Akamai Inference Cloud (Remote)

Remote or Cambridge, England

•

Yesterday

Do you thrive on technical leadership and building cutting-edge AI systems? Are you ready to drive innovation at the intersection of AI and edge computing? Join the Akamai Inference Cloud Team! The Akamai Inference Cloud team is part of Akamai's Cloud Technology Group. We design, implement, deploy and operate AI platforms that enable customers to run inference models and developers to create AI applications with unmatched performance, compliance, and economics. Partner with the best As a Se

Full-time

USD 126,100.00 - 261,900.00 per year

Filter Results

Job post features

Posted date

Work settings

Employment type

Distance

Employer type

Work authorization

gpu jobs in Charlotte, NC, USA