Apply Now

Evaluation Reliability SRE

Cupertino, CA, US • Posted 30+ days ago • Updated 5 hours ago

Full Time

On-site

Fitment

Dice Job Match Score™

⏳ Almost there, hang tight...

Job Details

Skills

Privacy
IOS Development
OS X
Artificial Intelligence
Shipping
Backbone.js
System Integration Testing
Operational Excellence
Solaris
Fluency
Log Analysis
Orchestration
Kubernetes
Resource Management
Scheduling
Virtual Machines
Provisioning
Virtualization
Reliability Engineering
Management
Evaluation
Machine Learning (ML)

Summary

Join the team redefining what a deeply personal and integrated assistant can be.

As part of the Siri organization, you will help shape one of the world's most widely used AI assistants, powered by our next-generation of Apple Intelligence, with capabilities like personal context understanding and on-screen awareness, built with privacy from the ground up. Your work will have direct, meaningful impact for users across iOS, iPadOS, macOS, watchOS, and visionOS.

This is a rare opportunity to build at the intersection of cutting-edge AI and human-centered design, shipping technology that is centered around users and their needs.

Description

Siri's quality signal drives every model and product decision before a release ships. But a signal is only as trustworthy as the infrastructure behind it.

The Evaluation Reliability Engineering (ERE) team exists to make that infrastructure bulletproof. Within ERE, Core SRE owns the production backbone: resource management, session orchestration, on-call response, and the observability systems that surface failures before they corrupt evaluation signal. We sit at the intersection of distributed systems, ML evaluation infrastructure, and operational excellence.

This is a senior hands-on role. You share primary on-call as part of a global follow-the-sun rotation, lead incident investigations end-to-end, and set the operational bar the rest of the team works against. You are fluent with agentic coding tools like Claude Code, Cursor, or Copilot, and use them as a force multiplier across runbook authoring, automation, and log analysis.

Minimum Qualifications

5+ years of site reliability, infrastructure, or platform engineering experience with direct on-call ownership in production systems

Hands-on orchestration experience (Kubernetes or equivalent): cluster health, resource management, scheduling, and failure diagnosis at scale

Preferred Qualifications

Experience owning or closely operating a device or VM provisioning pipeline; familiarity with virtualization-layer failure modes is a strong plus

Track record of improving system reliability against measurable outcomes - uptime, MTTR, incident frequency - not just responding to incidents but eliminating their causes

Incident command discipline: able to lead a multi-team incident from declaration to close-out

Depth in at least one of: distributed systems reliability, device management infrastructure, evaluation or ML platform operations

Demonstrated cross-team technical influence; prior experience shaping reliability practices beyond the immediate team

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 90733111
Position Id: ae9de3d2e56ff5b930f737e7139efdbe
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Site Reliability Engineer(SRE)

Sunnyvale, California

•

Today

Job Title: SRE EngineerLocation: We have two locations: Sunnyvale, CA AND San Jose CA(Onsite)You will engage in incident response drills, post-mortems, and root cause analysis sessions to learn from past issues and prevent future ones.Each morning starts with a structured review of overnight alerts and system performance metrics - identifying any anomalies, triaging what needs attention.You will collaborate with your team in a morning stand-up meeting to discuss ongoing projects, recent incident

Easy Apply

Contract

Depends on Experience

Senior Lead Site Reliability Engineer

Palo Alto, California

•

Today

Job Description Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability. As a Senior Lead Site Reliability Engineer at JPMorgan Chase within the Infrastructure Platforms and Foundational Services (IPFS) team, you work with your fellow stakeholders to define non-functional requirements (NFRs) and availability targets for the services in your application and product lines. You

Full-time

USD 171,000.00 - 260,000.00 per year

Job Posting Title AI/ DevOps Engineer

San Jose, California

•

Today

Senior SRE - RTCDP Datastores & AI/ML Ops Adobe's Real-Time Customer Data Platform (RTCDP) powers personalized experiences for some of the world's largest brands. As a Senior SRE on this team, you'll be central to keeping RTCDP reliable, scalable, and operationally excellent at global scale. This is a hands-on, high-ownership role at the intersection of production operations (Day 2 ownership) and core datastore engineering, with a growing surface area in operationalizing AI/ML services and workf

Full-time

USD 173,500.00 per year

Sr Staff Site Reliability Engineer - Veza

Santa Clara, California

•

Today

Company Description Veza is the pioneer in identity security, purpose-built to answer the fundamental question enterprises face: who can and should take what action on what data. Veza's Access Graph platform maps an organization's entire identity ecosystem across users, groups, roles, policies, permissions, and resources providing deep visibility and control over human, non-human, and agentic identities across SaaS, cloud, on-prem, and custom applications. With over 30 billion access permission

Full-time

USD 165,500.00 - 289,600.00 per year

Search all similar jobs

More jobs at Apple, Inc. in Cupertino, CA