Site Reliability Engineer - Human Engineering

Cupertino, CA, US • Posted 2 days ago • Updated 2 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

🎯 Assessing qualifications...

Job Details

Skills

  • Cloud Computing
  • Research
  • Management
  • Data Collection
  • Privacy
  • Analytics
  • Amazon Web Services
  • Redis
  • Elasticsearch
  • Kubernetes
  • Storage
  • Amazon EC2
  • Remote Desktop Services
  • Amazon RDS
  • Amazon S3
  • Virtual Private Cloud
  • Terraform
  • Scripting
  • Continuous Integration
  • Continuous Delivery
  • Computer Networking
  • Dragon NaturallySpeaking
  • DNS
  • Load Balancing
  • TLS
  • Firewall
  • Communication
  • Slack
  • Command-line Interface
  • Workflow
  • Orchestration
  • Computer Science
  • Apache Kafka
  • RabbitMQ
  • Messaging
  • API
  • Django
  • Python
  • Web Applications
  • Celery
  • PostgreSQL
  • Regulatory Compliance
  • Incident Management
  • Open Source

Summary

At Apple, new ideas have a way of becoming phenomenal products, services, and customer experiences very quickly. Imagine what you could do here. Bring passion and dedication to your job and there's no telling what you could accomplish.\\n\\nWe are a team of software engineers developing web-based tools and native applications for Apple teams. Our work empowers Apple engineers and researchers to build the products that inspire and delight millions every day.\\n\\nWe're looking for a Site Reliability Engineer who thinks like a systems engineer first and an operator second. You won't just keep things running - you'll shape how our platform evolves. Our team operates 50+ services across Kubernetes and AWS, handles sensitive health and research data, and is ramping up many architectural shifts: new service-to-service auth patterns, event-driven pipelines, and a move from on-prem to cloud-native infrastructure. We need someone who gets excited about that kind of work, can reason about distributed systems at the design level, and is a strong enough communicator to bring the rest of the team along.\\n

The Human Engineering Software team builds tools used across Apple for user studies, research participant management, health data collection, and privacy-preserving analytics. Our infrastructure spans Django backends, Kubernetes clusters (self-hosted and AWS), PostgreSQL, Redis, Kafka, Elasticsearch and a growing set of internal service integrations.\n\nThis role is engineering-forward SRE. You'll spend as much time designing systems as operating them. You'll work closely with our full-stack engineers to improve how services communicate, how we observe production behavior, and how we ship changes safely. You'll have a seat at the architecture table - we want you proposing solutions, not just implementing them.\n

BS in Computer Science, Engineering, or equivalent practical experience, with 3+ years of experience in distributed systems\nDeep experience with Kubernetes in production - cluster operations, networking, storage, troubleshooting\nStrong proficiency designing and operating services in AWS (EC2, EKS, RDS, S3, IAM, VPC)\nHands-on infrastructure-as-code experience (Terraform, Helm, or equivalent)\nProficiency in at least one backend language (Python, Go, or similar) - you can write production services, not just scripts\nExperience with CI/CD pipeline design and GitOps workflows\nStrong understanding of networking fundamentals: DNS, load balancing, TLS, firewall rules, service discovery\nExcellent communication skills. You can explain a complex system to a room of engineers who didn't build it\nExperience building internal automation or self-service tooling (Slack bots, CLI tools, workflow orchestration) that reduced manual operational work

BS in Computer Science, Engineering, or equivalent practical experience, with 5+ years of experience in distributed systems\nExperience with event-driven architectures (Kafka, RabbitMQ, or similar messaging systems)\nExperience with service mesh or API gateway patterns (Istio, Envoy, Kong, or similar)\nFamiliarity with Django/Python web applications and their operational characteristics (Celery, Gunicorn, PostgreSQL)\nExperience with observability tooling beyond basic monitoring: distributed tracing, SLO frameworks, structured logging\nBackground working with sensitive data (health data, PII) and associated compliance requirements\nExperience leading incident response and building on-call culture\nContributions to internal or open-source infrastructure tooling
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90733111
  • Position Id: 14d42194988f95cca8a2b06d5c334156
  • Posted 2 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Palo Alto, California

Today

Full-time

USD 49.00 - 85.00 per hour

Cupertino, California

Today

Full-time

San Jose, California

Today

Full-time

USD 159,200.00 per year

Sunnyvale, California

Today

Full-time

USD 165,000.00 - 242,000.00 per year

Search all similar jobs