Apply Now

AI Infrastructure Engineer

San Jose, CA, US • Posted 13 hours ago • Updated 13 hours ago

Contract W2

Contract Independent

Contract Corp To Corp

12 Months

No Travel Required

On-site

Depends on Experience

Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

(Prometheus OR Grafana OR ELK OR Stack OpenTelemetry)
(Terraform OR Nutanix Calm)
GPU Infrastructure
Kubernetes
Nutanix

Summary

Job Title: AI Infrastructure Engineer (Nutanix AI Platform)

Location: San Jose, CA (Hybrid/Onsite Preferred)

Duration: 12 months+ Contract

Experience: 10+ Years
Rate: Negotiable

Position Overview

We are seeking a highly experienced AI Infrastructure Engineer to architect, deploy and optimize enterprise-scale AI infrastructure solutions leveraging the Nutanix ecosystem. The ideal candidate will have deep expertise in Nutanix Cloud Infrastructure (NCI), AOS/AHV, Nutanix Kubernetes Platform (NKP), GPU-accelerated computing and hybrid cloud environments.

This role focuses on building scalable, high-performance infrastructure that supports Large Language Models (LLMs), Generative AI workloads, AI training and AI inference platforms across on-premises and cloud environments.

The selected candidate will serve as a Subject Matter Expert (SME) for Nutanix AI infrastructure and will work closely with architecture, cloud, platform, networking and security teams.

Required Skills (Must Have)

Nutanix Platform Expertise

Strong hands-on experience with:
- Nutanix Cloud Infrastructure (NCI)
- Nutanix AOS (Acropolis Operating System)
- Nutanix AHV (Acropolis Hypervisor)
- Nutanix Cloud Manager (NCM)
- Nutanix Flow
- Nutanix Objects and Files

Kubernetes & Container Platforms

Extensive experience with:
- Nutanix Kubernetes Platform (NKP)
- Kubernetes cluster deployment and administration
- Container orchestration and workload management
- AI/ML workload deployment in Kubernetes environments

GPU & AI Infrastructure

Experience designing and managing GPU-enabled environments
Hands-on experience with:
- NVIDIA GPU ecosystem (A100, H100, CUDA, GPU Passthrough, vGPU)
- AMD GPU ecosystem
Experience supporting AI model training and inference workloads

Infrastructure Automation

Terraform
Infrastructure as Code (IaC)
Nutanix Calm
Automated provisioning and lifecycle management

Monitoring & Observability

Prometheus
Grafana
ELK Stack
OpenTelemetry
Monitoring, logging, alerting and performance tuning

Key Responsibilities

AI Infrastructure Architecture

Design and implement scalable AI infrastructure platforms using Nutanix technologies.
Build optimized environments supporting Generative AI, LLM training and inference workloads.
Design high-performance compute, storage and networking architectures for AI applications.

Hybrid Cloud & Multicloud Solutions

Architect hybrid cloud solutions leveraging Nutanix Cloud Clusters (NC2).
Enable seamless workload portability between on-premises environments and public cloud platforms.
Support cloud bursting and dynamic workload scaling.

Kubernetes Platform Engineering

Deploy, manage and optimize AI workloads on Nutanix Kubernetes Platform (NKP).
Design highly available and resilient containerized environments.
Implement workload automation and orchestration best practices.

Storage & Data Services

Design high-performance storage solutions using Nutanix Objects and Nutanix Files.
Optimize storage architectures for AI/ML datasets and model repositories.
Ensure data availability, scalability and performance.

Security & Networking

Implement Zero-Trust security principles.
Utilize Nutanix Flow for micro-segmentation and workload security.
Collaborate with security and networking teams to protect sensitive AI data.

Performance Optimization

Optimize GPU utilization and AI infrastructure performance.
Configure GPU Passthrough and vGPU environments.
Improve resource efficiency, scalability and operational costs.

Observability & Reliability

Establish enterprise monitoring, logging and alerting frameworks.
Ensure high availability, disaster recovery and fault tolerance.
Perform root cause analysis and capacity planning.

Preferred Qualifications

Experience supporting AI/ML platforms, LLMs and Generative AI initiatives.
Experience with AI model serving frameworks and inference platforms.
Knowledge of MLOps, AI platform engineering and data pipelines.
Experience working in enterprise-scale hybrid cloud environments.
Nutanix certifications highly preferred.

Required Experience

10+ years of Infrastructure, Cloud, Platform Engineering, or Architecture experience.
Strong enterprise-level Nutanix experience.
Experience supporting Kubernetes-based production environments.
Experience with GPU-enabled infrastructure and AI workloads.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 90773860
Position Id: 3129-4981-
Posted 13 hours ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Nutanix AI Infrastructure Engineer

San Jose, California

•

2d ago

Onsite Position Role Description: Architect and build custom Artificial Intelligence (AI) infrastructure solutions leveraging the Nutanix Kubernetes Platform and Nutanix AI. You will be responsible for designing high-performance computational stacks that integrate Nutanix AI, high-speed software-defined storage, and GPU-accelerated nodes. Your mission is to make AI infrastructure "invisible" by optimizing for performance, power consumption, and seamless hybrid-multicloud scalability across on-pr

Easy Apply

Contract, Third Party

$55 - $60

Infrastructure Architect

San Jose, California

•

27d ago

Role Overview We are looking for a Principal Infrastructure Architect to join our IT PMO organization to take responsibility and lead the design, orchestration, and lifecycle management of our next-generation GPU Farm and AI Factory environments. This role is unique in its breadth, requiring a deep understanding of high-performance AI compute stacks alongside the disciplined management of physical data center assets and their long-term operational health. You will bridge the gap between R&D engi

Easy Apply

Contract

$70 - $77

AI Engineer

Hybrid in Santa Clara, California

•

Yesterday

Role: AI Engineer Duration: 6+ months Location: Bay Area, CA (hybrid) Technology: Exp in Google Cloud Platform (Gemini, Vertex) Responsibilities: Design and implement end-to-end ML pipelines on Google Cloud Platform (Google Cloud Platform)Build, fine-tune, and optimize AI/ML models for production deployment using Vertex AI and Gemini modelsDevelop Generative AI solutions leveraging Gemini APIs, prompt engineering, Retrieval-Augmented Generation (RAG), and multimodal AI capabilitiesDevelop and

Easy Apply

Contract

Depends on Experience

Cloud Engineer - Senior GenAI / Agentic Lead

Santa Clara, California

•

Today

Hiring: Cloud Engineer - Senior GenAI / Agentic Lead Preferred: Local Candidates Location: Santa Clara, CA Work Mode: Onsite from Day 1 Experience: 10+ years We are looking for a Senior Cloud Engineer with strong expertise in Generative AI, Agentic Ecosystems, Copilot Studio, and multi-cloud platforms including Azure, AWS, and Google Cloud Platform. Key Skills: Generative AI / LLM Applications Agentic Ecosystem & Multi-Agent Workflows Azure AI Foundry & Azure OpenAI Microsoft Copilot Studio

Easy Apply

Third Party, Contract

Search all similar jobs

AI Infrastructure Engineer

Dice Job Match Score™

Job Details

Skills

Summary

Location: San Jose, CA (Hybrid/Onsite Preferred)

Duration: 12 months+ Contract

Experience: 10+ Years Rate: Negotiable

Position Overview

Required Skills (Must Have)

Nutanix Platform Expertise

Kubernetes & Container Platforms

GPU & AI Infrastructure

Infrastructure Automation

Monitoring & Observability

Key Responsibilities

AI Infrastructure Architecture

Hybrid Cloud & Multicloud Solutions

Kubernetes Platform Engineering

Storage & Data Services

Security & Networking

Performance Optimization

Observability & Reliability

Preferred Qualifications

Required Experience

Similar Jobs

Experience: 10+ Years
Rate: Negotiable