Imagine a world where every video stream starts instantly, every live event plays flawlessly, and the perfect movie finds you before you even search. As a Senior Video SRE at Apple, you'll turn this vision into reality. You'll tackle unprecedented scale challenges, engineer systems that self-heal, and ensure our video applications and infrastructure remain rock-solid when millions tune in simultaneously. If you thrive on solving complex distributed systems problems and want your work to directly impact how the world experiences entertainment, this is your opportunity.
As a Senior Video Site Reliability Engineer at Apple, you will be responsible for the reliability, scalability, and performance of our distributed applications that serve millions of users globally. You will build strong partnerships with application development teams, sister SRE teams, platform teams, product teams, as well as video encoding specialists to drive shared ownership of service reliability and maintain exceptional quality of experience for our customers.\n\nYour day-to-day work will include embedding reliability practices into the development lifecycle, building sophisticated monitoring and observability solutions, and developing automation to reduce operational toil.\n\nYou will own critical infrastructure components that video services depend on, and use data-driven approaches to identify and eliminate single points of failure. You will lead reliability design reviews and define SLO frameworks that set the standard for service health. As part of the role, you will participate in on-call rotations, lead incident response efforts, and drive post-incident reviews that result in meaningful reliability improvements.\n\nThis role offers the opportunity to work with complex JVM-based microservices and distributed systems technologies and influence architectural decisions that shape how Apple delivers video streaming content worldwide.
Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.\n5+ years of experience in Site Reliability Engineering, DevOps, or Systems Engineering with demonstrated senior-level impact.\nProduction ownership at scale including on-call/incident response, post incident reviews and driving operational improvements.\nStrong understanding of Linux fundamentals and networking principles, with experience operating and debugging production systems.\nProficiency in at least one programming language (Shell, Python, Go, or similar) to reduce toil, build SRE tooling, and improve operability.\nHands-on experience with cloud infrastructure and container orchestration.\nExcellent troubleshooting and root-cause analysis skills across the full technology stack.\nEffective communicator who can collaborate with cross-functional partners to drive reliability outcomes.
Thorough understanding of distributed systems fundamentals, failure modes, and resilience patterns that prevent cascading outages.\nTrack record of building and continuously improving observability (metrics/logs/traces), alert quality, and incident response processes for complex, high-traffic environments.\nHands-on performance optimization, capacity planning, and reliability engineering (load testing, bottleneck analysis, degradation strategies).\nProven ability to build and operate Infrastructure as Code and CI/CD pipelines, including safe deployment practices and change risk controls.\nExperience debugging and operating JVM-based applications in production (e.g., understanding of thread analysis, heap profiling).\nWorking knowledge of database systems, key-value stores, caching layers, message queues, and storage infrastructure at scale.\nFamiliarity with video streaming technologies, codecs, protocols, and media delivery infrastructure.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
- Dice Id: 90733111
- Position Id: 45a9874e2be388a8c4cfa137e5bff24
- Posted 1 day ago