Apply Now

AI Engineer (Production AI)

San Francisco, CA, US • Posted 7 hours ago • Updated 6 hours ago

Full Time

On-site

Depends on Experience

Fitment

Dice Job Match Score™

⏳ Almost there, hang tight...

Job Details

Skills

AI
ML
RAG
Docker
C++
Java
C#

Summary

Job role: AI Engineer

Location: On-site (Bay Area)

Fulltime opportunity

Role Overview

We're seeking a hands-on AI Engineer who builds production-ready AI systems, not research prototypes. You'll optimize our AI ingestion pipeline for more accurate, responsive agentic behavior, deploy high-performance models on GPU infrastructure using our Trident architecture, and maintain robust MLOps workflows from training through production deployment. This is for engineers who ship code, not just notebooks.

Key Responsibilities

Enhance AI Pipeline Accuracy: Improve our data ingestion and processing pipeline to deliver more accurate responses and sophisticated agentic behaviors in production applications.

GPU-Optimized Model Deployment: Deploy and optimize AI models on high-performance GPU infrastructure using our Trident architecture, ensuring efficient training, inference, and scaling.

Production MLOps: Build and maintain end-to-end MLOps pipelines including RAG systems, model distillation, fine-tuning workflows, training orchestration, and production inference deployment.

Data Model Engineering: Design and implement robust data models and processing workflows that power our AI persona capabilities.

Infrastructure & DevOps: Create production-grade CI/CD pipelines, containerization (Docker), comprehensive logging systems, and monitoring for AI model performance.

Real Production Deployment: Take AI systems from development through production deployment, focusing on reliability, performance, and operational excellence.

Required Technical Skills

Core Programming (Non-negotiable):

Python (primary language for AI/ML work)

Strong proficiency in C++, Java, or C# for performance-critical components

Data modeling and processing at production scale

AI/ML Production Stack:

RAG Pipeline development and optimization

MLOps workflows: training, inference, model lifecycle management

Model distillation and fine-tuning techniques for production deployment

Experience deploying models to GPU infrastructure (Trident or similar architectures)

Production Engineering:

CI/CD pipeline creation and management

Docker containerization and microservices architecture

Production logging, monitoring, and observability

Experience scaling AI systems in real production environments

What We DO Want

3-5 years of production AI/ML engineering experience

Engineers from mid-sized companies who have successfully deployed AI systems at scale

Proven track record of building, deploying, and maintaining ML systems in production

Experience optimizing AI systems for performance, cost, and reliability

Strong system design and architecture skills for scalable AI applications

Sample Projects You'll Own

Optimize our RAG pipeline for improved accuracy and response quality

Deploy and scale transformer models on our Trident GPU architecture

Build MLOps workflows for continuous model training and deployment

Design data processing systems for multi-modal AI persona training

Create monitoring and alerting systems for production AI model performance

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: RTX1daa12
Position Id: 8956868
Posted 7 hours ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Sr. AI Engineer

San Francisco, California

•

Today

Company Overview Docusign brings agreements to life. Over 1.5 million customers and more than a billion people in over 180 countries use Docusign solutions to accelerate the process of doing business and simplify people's lives. With intelligent agreement management, Docusign unleashes business-critical data that is trapped inside of documents. Until now, these were disconnected from business systems of record, costing businesses time, money, and opportunity. Using Docusign's Intelligent Agreem

Full-time

USD 157,500.00 - 254,350.00 per year

Lead AI Architect

San Francisco, California

•

Today

Full-time

USD 157,500.00 - 254,350.00 per year

AI Solutions Engineer

Remote or San Francisco, California

•

Today

About Pinterest: Millions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product. Discover a career where you ignite innovation for millions, transform passion into growth opportunities, celebrate each other's unique experiences and embrace the flexibility

Full-time

USD 123,696.00 - 254,667.00 per year

AI/ML Computational Science Manager

Remote or San Francisco, California

•

Today

Design and develop artificial intelligence AI and machine learning ML systems leveraging existing cloud AI services. Design and build scalable data pipelines to support model training and production with DevOps MLOps. Customize and apply Deep Learning and Gen AI models for use cases based on the business needs, data availability, system and infrastructure requirements including edge devices and High Performance Computers HPCs . Justify the quality and value of the solution. Engage in research an

Full-time

USD 94,400.00 per year

Search all similar jobs