Senior Engineering Program Manager, Enterprise Technology Services

Overview

On Site

Full Time

Skills

EPM

Reliability Engineering

Strategic Thinking

Analytical Skill

Creative Problem Solving

Process Improvement

Cross-functional Team

Global Operations

Roadmaps

Scalability

KPI

Investments

Capacity Management

Reporting

Apache Velocity

Operational Excellence

Budget

Regulatory Compliance

Computer Hardware

Recovery

Clarity

Accountability

Collaboration

Project Management

Network

Software Engineering

Customer Relationship Management (CRM)

Tableau

Dashboard

Project Scoping

Facilitation

IT Program Management

Service Delivery

Kubernetes

Cloud Computing

Microsoft Azure

Google Cloud Platform

Google Cloud

DevOps

Continuous Delivery

Splunk

Amazon Web Services

Amazon S3

Hosting

Servers

Storage

Database

Backup

DMZ

WAF

Computer Networking

Citrix

VMware

Linux

Incident Management

Root Cause Analysis

Data Centers

Manufacturing

Stakeholder Management

Management

Communication

Negotiations

Presentations

Leadership

Operational Efficiency

Job Details

We are seeking a Engineering Program Manager (EPM) to lead large-scale Site Reliability Engineering (SRE) initiatives that underpin the resilience, scalability, and performance of our cloud-native services. This senior role requires strategic thinking, program leadership, and deep collaboration across engineering, operations, and product to drive reliability outcomes at scale. You will be a key partner to senior engineering leaders, ensuring alignment of priorities, disciplined execution, and operational excellence across the SRE portfolio.

Description Our organization works with many cross functional teams across the company. We're looking for an intellectually curious and creative individual who is comfortable operating in ambiguity, a strategic and operational thinker with strong analytical and creative problem-solving skills. They have a passion for process improvement, operational efficiency, and contributing to delivering on some of Apple's most important product goals through operational execution. You will work directly with our cross-functional team across Global Operations to execute global projects from inception to launch.

Responsibilities

Program Leadership & Strategy
Define and drive multi-year SRE program roadmaps that enhance service availability, performance, and scalability.
Translate strategic reliability objectives into actionable execution plans with clear milestones, KPIs, and accountability.
Partner with engineering, product, and operations leaders to prioritize investments balancing short-term delivery with long-term platform evolution.
Execution & Delivery
Lead cross-functional engineering programs spanning capacity planning, incident reduction, observability, automation, and infrastructure modernization.
Establish governance models, reporting cadences, and decision frameworks to improve delivery velocity and predictability.
Manage complex dependencies across SRE, platform engineering, security, and product teams.
Operational Excellence
Drive adoption of reliability best practices (SLAs, budgets, incident retrospectives) across services.
Ensure consistent application of security and compliance standards across heterogeneous hardware and software environments.
Champion automation and tooling that reduces toil and accelerates recovery.
Stakeholder Management & Communication
Communicate program status, risks, and impact to executives, engineering leaders, and partner orgs with clarity and transparency.
Build trust and alignment across globally distributed teams, fostering a culture of accountability and collaboration.
Serve as a bridge between business needs and technical execution, ensuring customer impact remains central.

Minimum Qualifications

5 + years of technical program or project management for large-scale Infrastructure projects.
Proven track record and technical knowledge in infrastructure delivery (data stores, storage compute, network), DevOps, SRE, and/or software engineering.
Build strong customer relationships with operations, data centers, suppliers, vendors, manufacturing teams. Identify opportunities that benefit the customer and deliver solutions that meet customer expectations.
Experience with Tableau or similar dashboard applications.
Expert in project scoping, identifying risks, developing mitigation strategies, stakeholder management, data-driven analysis for decisions and facilitating resolutions along with application readiness and change adoption.
Willing and able to travel to international manufacturing partners (can be up to 2 weeks at a time)

Preferred Qualifications

Experience in technical program management, service delivery, or engineering leadership, driving reliability, infrastructure, or platform engineering programs.
Proven track record of leading large, multi-team programs in highly available, large-scale distributed systems or cloud environments.
Strong understanding of SRE practices, DevOps principles, and modern infrastructure (Kubernetes, containers, cloud platforms like AWS/Azure/Google Cloud Platform).
Readily learns and adopts new technologies. Knowledge of DevOps, continuous delivery, Splunk, AWS services like S3, hosting components such as Netscalers, OS, servers, storage, databases, backup, load balancers, DMZ, WAF, networking, Citrix, VMWare, Linux etc,. Also Deep understanding of incident management processes and best practices. Ability to drive the root cause analysis, identify the corrective actions, and followup to closure.
Deeply understands architecture and integration points of application sets to support process, products, configuration, policies that takes into consideration the needs of supply chains, data centers and global contract manufacturing sites.
Demonstrated success in executive stakeholder management and influencing without direct authority.
Excellent communication, negotiation and presentation skills to globally dispersed project teams and leadership
Clear, measurable improvements in service availability, latency, and operational efficiency across key platforms.
Cross-org alignment and execution confidence in reliability programs.
A repeatable framework for SRE program delivery that scales across services and geographies.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Share