Staff Software Engineer, ASE Storage Infrastructure

Washington, WA, US • Posted 4 days ago • Updated 1 day ago
Full Time
On-site
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

  • IaaS
  • IO
  • High Availability
  • Technical Direction
  • Meta-data Management
  • Backup
  • Network
  • Stacks Blockchain
  • Artificial Intelligence
  • Testing
  • Design Review
  • Technical Writing
  • Collaboration
  • Software Development
  • Rust
  • C++
  • C
  • Concurrent Computing
  • Attention To Detail
  • Incident Management
  • Computer Science
  • Distributed File System
  • Database
  • Storage
  • EBS
  • Ceph
  • HDFS
  • Replication
  • Repair

Summary

Apple Services Engineering (ASE) designs, builds, and operates the cloud infrastructure, server systems, and platform technologies that power many of Apple's most beloved experiences. \\n\\nWithin ASE, the Storage Platforms organization develops the systems that store, protect, and serve Apple's data at massive scale, with a mission to deliver storage that is durable, secure, highly available, and operated with excellence. Engineers on this team will have the rare opportunity to work on storage device-optimized low-level storage, large-scale distributed systems, and high-performance IO stacks operating at mission-critical levels of availability and durability.\\n\\nElastic Disk provides foundational block storage services to Apple's compute platforms. We are building a next-generation block storage system designed from the ground up for low latency, high availability, and strong durability - serving as a critical building block for compute workloads across Apple. \\n\\nWe are looking for a passionate lead engineer/architect with experience in developing large-scale services and infrastructure to take a central role in designing and building our next version. Join our friendly, collaborative, highly-motivated team!

You will shape Elastic Disk's features and architecture as it scales orders of magnitude, setting the multi-year technical direction for replication, durability, metadata, and the data path. In this role, you will:\n\nOwn the technical vision for the Replication & Metadata Systems powering Elastic Disk; \nEvolve our replication protocol to strengthen availability, correctness, durability to improve the overall reliability of the service as we scale.\nAdvance end-to-end durability: next-generation repair, replica reconstruction, continuous scrubbing, replication-metadata checksums, and point-in-time backup & snapshot.\nDrive performance of the replication and network data path - profiling bottlenecks and evaluating next-gen network stacks to keep replication off the critical latency path.\nRaise the engineering bar through best practices of AI toolings and insist on high standards for reviews and testing.\nRepresent the team in cross-org efforts to shape a unified replication foundation for Apple's storage backends.\nMentor and grow engineers; lead through design reviews, technical writing, and hands-on collaboration with storage engine, control plane, compute platform, and storage reliability partners.

10+ years of professional software development experience building infrastructure, systems, or distributed services.\n5+ years designing, building, and operating large-scale distributed storage or database systems in production.\nDeep expertise in at least one systems-level language; strong Rust, C++, or C proficiency preferred.\nStrong fundamentals in distributed systems - proven ability to reason about correctness under concurrency and failure; meticulous attention to detail.\nExperience owning services through the full lifecycle: design, implementation, rollout, on-call, and incident response, with measurable availability and durability SLOs.\nBachelor's degree in Computer Science, a related engineering field, or equivalent experience.

MS or PhD in Computer Science or a related field.\nStrong storage and database background - having led or been a core contributor to a distributed storage system, database, or storage engine at scale.\nHands-on experience implementing or extending consensus protocols (Raft, Paxos, Multi-Paxos, ZAB, Viewstamped Replication) in production.\nExperience building block storage, distributed file systems, distributed databases, or replicated storage engines at scale (e.g., EBS-class systems, Ceph, HDFS, Pangu, GFS-family, Spanner, CockroachDB, FoundationDB).\nDeep knowledge of data durability techniques: replication, erasure coding, scrubbers, silent-corruption detection, and automated repair/reconstruction.\nTrack record of driving cross-org technical initiatives or shared platform abstractions across multiple teams.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: 60943123f679a5beb204787f3a6d256c
  • Posted 4 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Washington

Yesterday

Full-time

Washington

Yesterday

Full-time

Washington

Yesterday

Full-time

Washington

Yesterday

Full-time

Search all similar jobs