Compute SRE

Cupertino, CA, US • Posted 5 hours ago • Updated 5 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

  • Scalability
  • Operational Excellence
  • Continuous Improvement
  • Accountability
  • Cloud Computing
  • Collaboration
  • Software Release Life Cycle
  • Virtual Machines
  • Continuous Integration
  • Continuous Delivery
  • Capacity Management
  • Testing
  • Disaster Recovery
  • Computer Science
  • Reliability Engineering
  • Python
  • Java
  • Puppet
  • Progress Chef
  • Ansible
  • Terraform
  • IaaS
  • Incident Management
  • Communication

Summary

As a Site Reliability Engineer at Apple, you will be responsible for driving the reliability, scalability, and observability of our cloud platform. Your work will ensure the uptime and performance of mission critical systems that serve millions of users every day. We're looking for a self-motivated engineer, committed to operational excellence and continuous improvement. You'll work closely with developers and architects within the team to build and extend our platform, as well as be a part of rich fabric of people from many different disciplines all invested in building the best cloud platform to run world-class services at scale. We're building a team of high trust and accountability, and are searching for a like-minded individual who is excited to build foundational capabilities into Apple's Cloud Platform!

AS AN SRE AT APPLE YOU WILL: \n\n- Build, operate, and scale Apple's Cloud Platform that powers mission critical services across the globe. \n- Accelerate delivery of core services with automation and visibility into release cadences. \n- Collaborate with developers to build and release reliable software that manages the lifecycle of customer VMs. \n- Drive reliability and excellence of service through CI/CD, production readiness reviews, and incident response. \n- Instrument, analyze, and iterate on performance bottlenecks across distributed systems. \n- Actively participate in oncall rotations, capacity planning, scale testing, and disaster recovery exercises. \n- Ensure uptime SLOs with well-architected systems and rigorous observability.\n

Bachelor's Degree in Computer Science, an engineering-related field, or equivalent related experience.\n1+ years in a Site Reliability Engineering Infrastructure focused role.\nProficiency in Go, Python, or Java.\nProficiency with Infrastructure as Code (IaC) tools like Puppet, Chef, Ansible, or Terraform.\nExperience with cloud infrastructure and experience running businesses.\nExperience in architecting, building, and running large-scale distributed systems.\nExperience providing 24/7 on-call support and incident management for critical production infrastructure.\nAbility to troubleshoot issues across the entire infrastructure stack (Profiling, Tracing, etc).\n

Interpersonal and written communication skills, targeted to both technical and non-technical audiences.\nExperience operating large-scale multi-tenant Infrastructure as a Managed service.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: 494f48eaf6fe52146b8439259fcf2d62
  • Posted 5 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Cupertino, California

Today

Full-time

Cupertino, California

Today

Full-time

Cupertino, California

Today

Full-time

Cupertino, California

Today

Full-time

Search all similar jobs