ML & GenAI Platform Engineer

San Jose, CA, US • Posted 1 day ago • Updated 1 day ago
Full Time
On-site
$100,000 - $160,000/yr
Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

  • AI/ML
  • LLM
  • RAG
  • Azure
  • Pyhton

Summary

Role Overview

  • We are seeking an experienced ML & Generative AI Platform Engineer with 8 10 years of industry experience, specializing in deploying, scaling, and operating AI/ML and GenAI systems in cloud environments.
  • This role is focused on production-grade implementation of ML and LLM-powered applications, Retrieval-Augmented Generation (RAG) pipelines, and agentic AI workflows.
  • The ideal candidate has strong Python engineering skills, deep understanding of AI infrastructure, and hands-on experience delivering enterprise AI systems end-to-end.

Key Responsibilities

  • Deploy, scale, and operate ML and Generative AI systems in cloud-based production environments (Azure preferred).
  • Build and manage enterprise-grade RAG applications using embeddings, vector search, and retrieval pipelines.
  • Implement and operationalize agentic AI workflows with tool use, leveraging frameworks such as LangChain and LangGraph.
  • Develop reusable infrastructure and orchestration for GenAI systems using Model Context Protocol (MCP) and AI Development Kit (ADK).
  • Design and implement model and agent serving architectures, including APIs, batch inference, and real-time workflows.
  • Establish best practices for observability, monitoring, evaluation, and governance of GenAI pipelines in production.
  • Integrate AI solutions into business workflows in collaboration with data engineering, application teams, and stakeholders.
  • Drive adoption of MLOps / LLMOps practices, including CI/CD automation, versioning, testing, and lifecycle management.
  • Ensure security, compliance, reliability, and cost optimization of AI services deployed at scale.

Required Qualifications

  • 8 10 years of experience in ML Engineering, AI Platform Engineering, or Cloud AI Deployment roles.
  • Strong proficiency in Python, with experience building production-ready AI/ML services and workflows.
  • Proven experience deploying and supporting GenAI applications in real-world enterprise environments.
  • Experience with orchestration frameworks including but not limited to LangChain, LangGraph, and LangSmith.
  • Strong knowledge of model serving, inference pipelines, monitoring, and observability for AI systems.
  • Experience working with cloud AI ecosystems (Azure AI, Azure ML, Databricks preferred).
  • Familiarity with containerization and deployment tools (Docker, Kubernetes, REST APIs).

Preferred Qualifications

  • Experience with Azure Databricks, Azure ML, Data Lake, Synapse, or related Azure services.
  • Exposure to vector databases such as Pinecone, Weaviate, FAISS, or Azure Cognitive Search.
  • Experience deploying agentic AI systems with tool integrations in production.
  • Familiarity with enterprise governance frameworks for Responsible AI.

Strong understanding of CI/CD pipelines and DevOps practices for AI platforms

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: RTX153458
  • Position Id: NA
  • Posted 1 day ago
Contact the job poster
Gopala Mallipudi

Gopala Mallipudi

IT Recruiter @ Nisum
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Santa Clara, California

2d ago

Full-time

USD 160,000.00 - 190,000.00 per year

Santa Clara, California

Today

Easy Apply

Third Party, Contract

San Jose, California

3d ago

Easy Apply

Contract, Third Party

Depends on Experience

Palo Alto, California

28d ago

Full-time

Search all similar jobs