Apply Now

Senior MLOps Engineer

Cupertino, CA, US • Posted 30+ days ago • Updated 1 hour ago

Contract Corp To Corp

Contract W2

On-site

Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

AI
LLM
MLOps

Summary

Hello,

My name is Sreeja and I represent TestingXperts Inc. TestingXperts is a Specialist QA & Software Testing Company, and an Independent Software Testing division of Damco Group, which is a leading IT Solutions and Services company working with Fortune Enterprises globally. Inheriting the virtues of job quality and optimal user satisfaction from Damco Group, TestingXperts aims at promoting the ethics of connected innovation, thereby seeding the integral values in our employees and achieving unmatched contentment in our clients. To know more about Testingxperts Inc., please visit our website .

If you are interested in the opportunity listed below, please forward your updated resume along with current contact information, or perhaps you can recommend someone who would be interested in this position

Role : Senior MLOps Engineer
Location : Cupertino, CA/ Austin, TX Onsite Mandatory

Visa Type : / GC

Objective:
Build intelligent, data-driven platform. The focus is to support the development of next-generation test analytics and test agents that enable faster insights, improved diagnostics, and scalable infrastructure for Generative AI systems connecting test stations, line level data and pipelines . You will build automated evaluation tools, and conduct rigorous statistical analyses to ensure the reliability of both human and AI-based assessment systems.

Benchmark, adapt, and integrate AI/ML models into existing software systems. Independently run and analyze ML experiments for real improvements.
Must-Have Requirements
Requirement Details
Backend/Systems Experience 3+ years building production backend or distributed systems (pre-AI experience required)
Production AI Systems Has shipped AI/LLM features serving real users at scale - not just prototypes or demos
Agentic Systems Has built AI agents, skills, tools, or MCP (Model Context Protocol) integrations
Python Proficient for backend development
Secondary Language Working knowledge of Go, TypeScript, or Rust
Cloud Infrastructure Deep experience with AWS/Google Cloud Platform/Azure - cost optimization, compute decisions, not just deployment
Container & Orchestration Hands-on with Docker and Kubernetes - can build, deploy, debug, and scale services themselves
LLM Integration Understands token economics, context limits, rate limiting, structured outputs, API failure modes
LLM Evaluation Understands how to evaluate LLM outputs and the inherent challenges (non-determinism, quality measurement, regression detection)
Hands-On Engineer Not just an architect - writes code, debugs production issues, deploys their own work
________________________________________
Preferred / Differentiators
Built multi-step agentic workflows with tool use and function calling
Experience with agent orchestration frameworks (LangGraph, CrewAI, or custom)
Built guardrails, fallbacks, or graceful degradation for AI systems
Streaming inference and async agent orchestration
Cost/latency optimization: caching, batching, prompt compression
ML observability tools: Langfuse, Arize, Braintrust, W&B
Retrieval systems (vector search, hybrid search) - as a tool, not the focus
________________________________________
Screening Questions for Candidates
1. "Describe a production AI agent or skill system you built. What broke and how did you fix it?"
2. "Have you built MCP servers/integrations or custom tool-use systems for LLMs?"
3. "How do you evaluate whether an LLM-based feature is working well? What makes this hard?"
4. "Walk me through how you'd deploy and scale an AI service on Kubernetes."
________________________________________
Not a Fit If
Primarily a model trainer/fine-tuner (we're not training models)
AI experience is mainly academic, research, or tutorial-based
No production systems experience (only notebooks/demos)
Looking for entry-level role with heavy mentorship
Background is primarily data science/analytics rather than engineering
"Architects" who don't write or deploy code themselves

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10383634
Position Id: 2026-34978
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Applied AI developer

Hybrid in Palo Alto, California

•

4d ago

Applied AI Developer Engineering (Enterprise AI Team) Location: Hybrid (Palo Alto, CA) 3 days onsite, 2 days remote. Role Type: Contract (W2) About Our Client Our client is a fast-growing, publicly traded technology company operating in the cybersecurity and data management space. They provide a modern platform focused on protecting, managing, and securing data across cloud, hybrid, and on-prem environments. Their solutions help enterprise organizations defend against cyber threats, ensure comp

Easy Apply

Contract

Depends on Experience

Senior Software Engineer (Java, Python, LLMs and AI)

Hybrid in San Jose, California

•

8d ago

About the role: We're building out a high-caliber engineering team to support a new platform. We're looking for a Senior Software Engineer who is equally strong in Java and Python, has built and operated microservices on AWS with mature CI/CD pipelines, and has hands-on experience integrating AI/ML capabilities into production systems. This role starts as a contract with a path to full-time conversion based on performance and fit. What you'll do Design, build, and own backend microservices in Ja

Easy Apply

Contract, Third Party

Depends on Experience

AI Developer

Palo Alto, California

•

Yesterday

About the Team The Enterprise AI team is Rubrik''s internal AI enablement engine. We evaluate where AI can make a real difference, build the platforms and patterns that make adoption easy, and help engineering teams across the organization work smarter and faster. We operate at the intersection of applied AI, distributed systems, and enterprise operations our job is to make Rubrik more efficient, one AI-powered workflow at a time. This is a high-impact, high-autonomy team where you''ll work clos

Easy Apply

Contract

Depends on Experience

GenAI Technical Solutions Architect

Cupertino, California

•

2d ago

Job Title: GenAI Technical Solutions Architect Location: Cupertino/Sunnyvale, CA (Onsite) Duration: 6-12+ Months Contract ** W2-Visa-independent candidates required ** Job Summary: What You''ll Do Build & Prototype - Assemble AI-powered workflows using existing enterprise tools, MCPs, and APIs - Write scripts, automations, and glue code to connect systems and fill gaps - Prototype rapidly get a working version in front of users fast, then iterate Evaluate & Advise - Assess technical feasibil

Easy Apply

Contract

Depends on Experience

Search all similar jobs