Apply Now

Senior MLOps / LLMOps Engineer

Hybrid in Jersey City, NJ, US • Posted 1 day ago • Updated 1 day ago

Full Time

Hybrid

$70 - $80/hr

Fitment

Dice Job Match Score™

📋 Comparing job requirements...

Job Details

Skills

Open shift
TensorRT
DevOps
MLOps
LMOps

Summary

Job Title - Senior MLOps / LLMOps Engineer Kubernetes & AI Inference Platforms

Duration - 2 Months

Location: New Jersey

Job Summary

We are seeking a highly skilled Senior MLOps / LLMOps Engineer to design, deploy, and support enterprise-scale AI/LLM platforms in production environments. The ideal candidate will have strong experience with Kubernetes/OpenShift, NVIDIA TensorRT-LLM, Triton Inference Server, and scalable AI infrastructure. This role focuses on building reliable, secure, and high-performance inference platforms for mission-critical AI applications.

Key Responsibilities

Deploy, manage, and troubleshoot containerized AI/LLM applications on Kubernetes/OpenShift platforms.
Configure, optimize, and support LLM inference workloads using NVIDIA TensorRT-LLM and Triton Inference Server.
Design and maintain scalable MLOps/LLMOps and container deployment pipelines.
Build CI/CD workflows for AI models, containers, and infrastructure deployments.
Package and deploy AI models across UAT, testing, and production environments.
Monitor platform performance, GPU utilization, availability, and operational health.
Implement logging, alerting, monitoring, and automated operational support processes.
Troubleshoot model deployment, scaling, networking, and load balancing issues.
Support model optimization techniques including quantization, pruning, and performance tuning.
Create operational runbooks, deployment procedures, health checks, and support documentation.
Support backup, restore, disaster recovery, failover, and business continuity planning.
Ensure platform security, RBAC, compliance, and governance standards are maintained.
Collaborate with AI, infrastructure, DevOps, and operations teams to deliver scalable AI solutions.

Required Qualifications

5+ years of experience in Kubernetes/OpenShift administration and containerized environments.
Strong hands-on experience with NVIDIA TensorRT-LLM and Triton Inference Server.
Experience deploying and supporting LLM/AI inference services in production.
Strong knowledge of Docker, microservices, and API-based architectures.
Experience building and supporting MLOps/LLMOps pipelines and CI/CD workflows.
Expertise in monitoring, logging, and troubleshooting distributed systems.
Experience with NVIDIA GPU infrastructure and AI workload optimization.
Understanding of incident management, change management, and operational best practices.
Strong problem-solving, communication, and collaboration skills.

Preferred Qualifications

Experience with OpenShift AI and enterprise AI platforms.
Knowledge of model optimization and inference acceleration techniques.
Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
Familiarity with Infrastructure as Code (Terraform, Ansible, Helm, etc.).
Kubernetes/OpenShift or cloud certifications are a plus.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10336460
Position Id: 8967577
Posted 1 day ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Hybrid in New York, New York

•

2d ago

SVP, MLOps / DevOps Engineer Permanent Hybrid Were partnering with our client, a fast-growing fintech firm, on a senior-level MLOps-focused hire to help build and scale the infrastructure behind their AI and machine learning platforms. This is a highly visible, hands-on role where youll sit at the intersection of engineering, data, and cloud; helping bring AI/ML models into production in a secure, scalable, and automated way. The priority here isMLOps first, supported by strongDevOps experienc

Easy Apply

Full-time

Depends on Experience

AI Operations Platform Consultant

Jersey City, New Jersey

•

Today

Job#: 3033136 Job Description: AI Operations Platform Consultant Location: Jersey City, New Jersey (Hybrid) Employment Type: Contract Role Overview We are seeking an AI Operations Platform Consultant for a project-based role. This position focuses on the hands-on technical troubleshooting of production solutions rather than designing or architecting new systems. The initial contract is funded through July 2026, with the potential for extension. Key Responsibilities Manage, operate, and supp

Easy Apply

Full-time

Associate Director Platform Engineering

Hybrid in Jersey City, New Jersey

•

Today

Are you ready to make an impact at DTCC? Do you want to work on innovative projects, collaborate with a dynamic and supportive team, and receive investment in your professional development? At DTCC, we are at the forefront of innovation in the financial markets. We are committed to helping our employees grow and succeed. We believe that you have the skills and drive to make a real impact. We foster a thriving internal community and are committed to creating a workplace that looks like the world

Full-time

Senior ML Platform Engineer - Artificial Intelligence

New York, New York

•

Today

Description & Requirements Bloomberg's Engineering AI department has 400+ AI practitioners building highly sought after products and features that often require novel innovations. We are investing in AI to build better search, discovery, and workflow solutions using technologies such as transformers, gradient boosted decision trees, large language models, and dense vector databases. We are expanding our group and seeking highly skilled individuals who will be responsible for contributing to the

Full-time

USD 160,000.00 - 240,000.00 per year

Search all similar jobs

Senior MLOps / LLMOps Engineer

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs