Apply Now

ML & GenAI Platform Engineer

San Jose, CA, US • Posted 1 day ago • Updated 1 day ago

Full Time

On-site

$100,000 - $160,000/yr

Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

AI/ML
LLM
RAG
Azure
Pyhton

Summary

Role Overview

We are seeking an experienced ML & Generative AI Platform Engineer with 8 10 years of industry experience, specializing in deploying, scaling, and operating AI/ML and GenAI systems in cloud environments.
This role is focused on production-grade implementation of ML and LLM-powered applications, Retrieval-Augmented Generation (RAG) pipelines, and agentic AI workflows.
The ideal candidate has strong Python engineering skills, deep understanding of AI infrastructure, and hands-on experience delivering enterprise AI systems end-to-end.

Key Responsibilities

Deploy, scale, and operate ML and Generative AI systems in cloud-based production environments (Azure preferred).
Build and manage enterprise-grade RAG applications using embeddings, vector search, and retrieval pipelines.
Implement and operationalize agentic AI workflows with tool use, leveraging frameworks such as LangChain and LangGraph.
Develop reusable infrastructure and orchestration for GenAI systems using Model Context Protocol (MCP) and AI Development Kit (ADK).
Design and implement model and agent serving architectures, including APIs, batch inference, and real-time workflows.
Establish best practices for observability, monitoring, evaluation, and governance of GenAI pipelines in production.
Integrate AI solutions into business workflows in collaboration with data engineering, application teams, and stakeholders.
Drive adoption of MLOps / LLMOps practices, including CI/CD automation, versioning, testing, and lifecycle management.
Ensure security, compliance, reliability, and cost optimization of AI services deployed at scale.

Required Qualifications

8 10 years of experience in ML Engineering, AI Platform Engineering, or Cloud AI Deployment roles.
Strong proficiency in Python, with experience building production-ready AI/ML services and workflows.
Proven experience deploying and supporting GenAI applications in real-world enterprise environments.
Experience with orchestration frameworks including but not limited to LangChain, LangGraph, and LangSmith.
Strong knowledge of model serving, inference pipelines, monitoring, and observability for AI systems.
Experience working with cloud AI ecosystems (Azure AI, Azure ML, Databricks preferred).
Familiarity with containerization and deployment tools (Docker, Kubernetes, REST APIs).

Preferred Qualifications

Experience with Azure Databricks, Azure ML, Data Lake, Synapse, or related Azure services.
Exposure to vector databases such as Pinecone, Weaviate, FAISS, or Azure Cognitive Search.
Experience deploying agentic AI systems with tool integrations in production.
Familiarity with enterprise governance frameworks for Responsible AI.

Strong understanding of CI/CD pipelines and DevOps practices for AI platforms

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: RTX153458
Position Id: NA
Posted 1 day ago

Contact the job poster

Gopala Mallipudi

IT Recruiter @ Nisum

View Profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Applied AI Engineer

Santa Clara, California

•

2d ago

Description POSITION DESCRIPTION The AI Engineer will serve as a critical bridge between business stakeholders and technical implementation, translating complex organizational challenges into practical, high-impact AI solutions. This hands-on role requires both the analytical depth to collaborate with cross-functional business teams in identifying and scoping AI opportunities, and the engineering expertise to design, build, and deploy those solutions. Working closely with data scientists, softwa

Full-time

USD 160,000.00 - 190,000.00 per year

Cloud Engineer - Senior GenAI / Agentic Lead

Santa Clara, California

•

Today

Hiring: Cloud Engineer - Senior GenAI / Agentic Lead Preferred: Local Candidates Location: Santa Clara, CA Work Mode: Onsite from Day 1 Experience: 10+ years We are looking for a Senior Cloud Engineer with strong expertise in Generative AI, Agentic Ecosystems, Copilot Studio, and multi-cloud platforms including Azure, AWS, and Google Cloud Platform. Key Skills: Generative AI / LLM Applications Agentic Ecosystem & Multi-Agent Workflows Azure AI Foundry & Azure OpenAI Microsoft Copilot Studio

Easy Apply

Third Party, Contract

AI Data Scientist | GEN AI and LLMs

San Jose, California

•

3d ago

Job Title: AI Data Scientist GenAI / LLM Engineer Location:San Jose, CA Experience Required:8+ Years Employment Type:Contract We are looking for a highly experienced AI Data Scientist with strong expertise in Generative AI, Large Language Models (LLMs), NLP, Machine Learning, and scalable AI system design. The ideal candidate should have hands-on experience building end-to-end AI/ML solutions, deploying production-grade applications, and working with modern GenAI frameworks and vector data

Easy Apply

Contract, Third Party

Depends on Experience

Principal Software Engineer

Palo Alto, California

•

28d ago

Job Description Job Summary: We are seeking a highly skilled and experienced Principal AI/ML and Gen AI Engineer to join our dynamic team at CCB COSMIC (CCB Operational Systems for Machine Intelligence and Cognition). The ideal candidate will possess a strong foundation in AWS, AI/ML, Databricks, and the Gen AI ecosystem. This role is primarily focused on scaling infrastructure and platforms from a serving and fine-tuning standpoint. The candidate should also have extensive experience in LLM Ops

Full-time

Search all similar jobs

More jobs at Nisum in San Jose, CA