MLOps Engineer

Newark, CA, US • Posted 11 hours ago • Updated 26 minutes ago
Full Time
On-site
Compensation information provided in the description
Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

  • Innovation
  • CaliberRM
  • Progress Chef
  • Application Development
  • Management
  • Servers
  • LangSmith
  • Microservices
  • Reasoning
  • Research
  • Algorithms
  • Reporting
  • Software Development Methodology
  • Data Science
  • Testing
  • High Availability
  • Large Language Models (LLMs)
  • Virtual Machines
  • GPU
  • Privacy
  • Regulatory Compliance
  • Artificial Intelligence
  • Workflow
  • Mentorship
  • Computer Science
  • Machine Learning Operations (ML Ops)
  • Cloud Computing
  • Amazon Web Services
  • Microsoft Azure
  • Google Cloud
  • Google Cloud Platform
  • Terraform
  • Docker
  • Orchestration
  • Kubernetes
  • GitLab
  • Continuous Integration
  • Continuous Delivery
  • GitHub
  • Python
  • Rust
  • Scripting
  • Machine Learning (ML)
  • PyTorch
  • TensorFlow
  • scikit-learn
  • DevOps
  • Agile
  • Software Development
  • Problem Solving
  • Conflict Resolution
  • Communication
  • FOCUS
  • Recovery
  • Collaboration
  • Health Care
  • SAP BASIS
  • Training

Summary

If you're ready to be part of our legacy of hope and innovation, we encourage you to take the first step and explore our current job openings. Your best is waiting to be discovered.

Day - 08 Hour (United States of America)

We are seeking a high-caliber Senior AI Platform & ML Ops Engineer to architect the "layered" infrastructure required for autonomous, agentic systems within Stanford Healthcare. In this role, you will be the "Master Chef" of our AI ecosystem, seamlessly folding Expert-Level DevOps (Kubernetes, Terraform, DevOps orchestration) with Agentic Application Development (LangGraph, CrewAI, Tool-calling logic). You won't just manage servers; you will build the robust, full-stack "factory" where multi-agent frameworks interact with healthcare APIs, ensuring every autonomous action is governed by strict ML Ops observability (LangSmith, Arize) and safety guardrails. If you have the "crispy" coding skills to build RAG pipelines in Python and the "rich" architectural depth to deploy scalable microservices, extensive full stack software development expertise, we want you to lead the integration of reasoning-based AI into the future of clinical and business workflow automations.

This is a Stanford Health Care job.

A Brief Overview
The MLOPs Engineer will play an integral role incorporating Artificial Intelligence (AI) within Stanford Health Care. The solutions will impact patient care, medical research, and operational services. This group is tasked to innovate, build, deploy and monitor production grade AI, machine learning (ML) and predictive algorithms into healthcare. The role will partner closely with lead researchers within the AI field and leaders across various clinical specialties and operations.

This role will report to the Infrastructure group and have a dotted line relationship to the Data Science team. The role will be responsible for maintaining cloud-based infrastructure as code repositories, maintaining infrastructure, deployment pipelines and designing the security landscape for the team and objects. The role will set the standards for the full SDLC of projects for the Data Science team.

Locations
Stanford Health Care

What you will do
  • Design, build and maintain scalable and robust infrastructure for AI/ML systems, including cloud-based environments, containerization and orchestration platforms.
  • Develop and implement CI/CD pipelines to automate the deployment, testing and monitoring of AI/ML models and applications.
  • Collaborate with data scientists, data engineers and software engineers to optimize model training, deployment and inference pipelines.
  • Monitor and troubleshoot AI/ML systems to ensure high availability, performance and reliability.
  • Maintain and monitor model training and inference pipelines across multi-cloud tenants especially around Large Language Models (LLMs).
  • Maintain Kubernetes pods, container registry and virtual machine image library and model registry
  • Monitor infrastructure utilization and costs pertaining to model training, inference and GPU utilization
  • Implement best practices for security, data privacy and compliance in AI/ML workflows and infrastructure.
  • Evaluate and integrate new tools, technologies and frameworks to improve the efficiency and effectiveness of our MLOps processes.
  • Mentor and provide technical guidance to junior members of the organization.
  • Stay up-to-date with the latest advancements and trends in MLOps, DevOps and cloud technologies and share them with the team.

Education Qualifications
  • Bachelor's or higher degree in Computer Science, Engineering or a related field

Experience Qualifications
  • Three (3) or more years of directly related experience

Required Knowledge, Skills and Abilities
  • Proven experience as an MLOps Engineer.
  • Strong knowledge of cloud platforms such as AWS, Azure or Google Cloud and experience with infrastructure-as-code tools like Terraform or CloudFormation.
  • Proficiency in containerization technologies such as Docker and container orchestration platforms like Kubernetes.
  • Experience with CI/CD tools such as GitLab CI/CD, Github Actions or CiricleCI.
  • Solid programming skills in languages such as Python, Rust or Go and experience in scripting and automation.
  • Familiarity with machine learning frameworks and libraries such as PyTorch, Tensorflow and scikit-learn.
  • Deep understanding of DevOps principles, agile methodologies and software development lifecycle.
  • Strong problem-solving and trouble shooting skills, with the ability to analyze and resolve complex technical issues.
  • Excellent communication and collaboration skills with the ability to work effectively in cross-functional teams.

Physical Demands and Work Conditions
Blood Borne Pathogens
  • Category III - Tasks that involve NO exposure to blood, body fluids or tissues, and Category I tasks that are not a condition of employment

These principles apply to ALL employees:

SHC Commitment to Providing an Exceptional Patient & Family Experience

Stanford Health Care sets a high standard for delivering value and an exceptional experience for our patients and families. Candidates for employment and existing employees must adopt and execute C-I-CARE standards for all of patients, families and towards each other. C-I-CARE is the foundation of Stanford's patient-experience and represents a framework for patient-centered interactions. Simply put, we do what it takes to enable and empower patients and families to focus on health, healing and recovery.

You will do this by executing against our three experience pillars, from the patient and family's perspective:

  • Know Me: Anticipate my needs and status to deliver effective care
  • Show Me the Way: Guide and prompt my actions to arrive at better outcomes and better health
  • Coordinate for Me: Own the complexity of my care through coordination

Equal Opportunity Employer Stanford Health Care (SHC) strongly values diversity and is committed to equal opportunity and non-discrimination in all of its policies and practices, including the area of employment. Accordingly, SHC does not discriminate against any person on the basis of race, color, sex, sexual orientation or gender identity and/or expression, religion, age, national or ethnic origin, political beliefs, marital status, medical condition, genetic information, veteran status, or disability, or the perception of any of the above. People of all genders, members of all racial and ethnic groups, people with disabilities, and veterans are encouraged to apply. Qualified applicants with criminal convictions will be considered after an individualized assessment of the conviction and the job requirements.

Base Pay Scale: Generally starting at $79.21 - $104.97 per hour

The salary of the finalist selected for this role will be set based on a variety of factors, including but not limited to, internal equity, experience, education, specialty and training. This pay scale is not a promise of a particular wage.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: stanford
  • Position Id: 1064f82713d42fa422c1e0f730a8ec8
  • Posted 11 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Cupertino, California

Today

Full-time

San Jose, California

Today

Full-time

USD 149,100.00 - 215,925.00 per year

Milpitas, California

5d ago

Easy Apply

Contract

Depends on Experience

Mountain View, California

Today

Full-time

USD 180,000.00 - 240,000.00 per year

Search all similar jobs