Apply Now

Site Reliability Engineer - Human Engineering

Austin, TX, US • Posted 30+ days ago • Updated 7 hours ago

Full Time

On-site

Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

Cloud Computing
Research
Management
Data Collection
Privacy
Analytics
Amazon Web Services
Redis
Elasticsearch
Kubernetes
Storage
Amazon EC2
Remote Desktop Services
Amazon RDS
Amazon S3
Virtual Private Cloud
Terraform
Scripting
Continuous Integration
Continuous Delivery
Computer Networking
DNS
Dragon NaturallySpeaking
Load Balancing
TLS
Firewall
Communication
Slack
Command-line Interface
Workflow
Orchestration
Computer Science
Apache Kafka
RabbitMQ
Messaging
API
Django
Python
Web Applications
Celery
PostgreSQL
Regulatory Compliance
Incident Management
Open Source

Summary

At Apple, new ideas have a way of becoming phenomenal products, services, and customer experiences very quickly. Imagine what you could do here. Bring passion and dedication to your job and there's no telling what you could accomplish.

We are a team of software engineers developing web-based tools and native applications for Apple teams. Our work empowers Apple engineers and researchers to build the products that inspire and delight millions every day.

We're looking for a Site Reliability Engineer who thinks like a systems engineer first and an operator second. You won't just keep things running - you'll shape how our platform evolves. Our team operates 50+ services across Kubernetes and AWS, handles sensitive health and research data, and is ramping up many architectural shifts: new service-to-service auth patterns, event-driven pipelines, and a move from on-prem to cloud-native infrastructure. We need someone who gets excited about that kind of work, can reason about distributed systems at the design level, and is a strong enough communicator to bring the rest of the team along.

Description

The Human Engineering Software team builds tools used across Apple for user studies, research participant management, health data collection, and privacy-preserving analytics. Our infrastructure spans Django backends, Kubernetes clusters (self-hosted and AWS), PostgreSQL, Redis, Kafka, Elasticsearch and a growing set of internal service integrations.

This role is engineering-forward SRE. You'll spend as much time designing systems as operating them. You'll work closely with our full-stack engineers to improve how services communicate, how we observe production behavior, and how we ship changes safely. You'll have a seat at the architecture table - we want you proposing solutions, not just implementing them.

Minimum Qualifications

BS in Computer Science, Engineering, or equivalent practical experience, with 3+ years of experience in distributed systems

Deep experience with Kubernetes in production - cluster operations, networking, storage, troubleshooting

Strong proficiency designing and operating services in AWS (EC2, EKS, RDS, S3, IAM, VPC)

Hands-on infrastructure-as-code experience (Terraform, Helm, or equivalent)

Proficiency in at least one backend language (Python, Go, or similar) - you can write production services, not just scripts

Experience with CI/CD pipeline design and GitOps workflows

Strong understanding of networking fundamentals: DNS, load balancing, TLS, firewall rules, service discovery

Excellent communication skills. You can explain a complex system to a room of engineers who didn't build it

Experience building internal automation or self-service tooling (Slack bots, CLI tools, workflow orchestration) that reduced manual operational work

Preferred Qualifications

BS in Computer Science, Engineering, or equivalent practical experience, with 5+ years of experience in distributed systems

Experience with event-driven architectures (Kafka, RabbitMQ, or similar messaging systems)

Experience with service mesh or API gateway patterns (Istio, Envoy, Kong, or similar)

Familiarity with Django/Python web applications and their operational characteristics (Celery, Gunicorn, PostgreSQL)

Experience with observability tooling beyond basic monitoring: distributed tracing, SLO frameworks, structured logging

Background working with sensitive data (health data, PII) and associated compliance requirements

Experience leading incident response and building on-call culture

Contributions to internal or open-source infrastructure tooling

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 90733111
Position Id: 14d42194988f95cca8a2b06d5c334156
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Austin, Texas

•

Today

Are you passionate about building systems that are resilient, scalable, and thoughtfully designed? Do you light up in technical discussions and bring fresh ideas to the table? As part of our CAD Infrastructure Development group, you'll help build and evolve the distributed systems that power our products at scale. You'll ensure our services can seamlessly and efficiently handle large-scale demands. Joining this group means you'll be responsible for contributing to the platform infrastructure tha

Full-time

DevOps SRE

Austin, Texas

•

23d ago

Job Title: DevOps SRE Location: Austin, TX or Sunnyvale, CA (5x/ week onsite) Duration: 6-12+ months Share responses to - Description Hands-on experience with provisioning, maintaining, deploying Kubernetes clusters in production environments, preferably AWS EKS. Hands on experieince with setting up and managing Istio in production grade Kubernetes cluster. Must have deep understanding of Kubernetes and Docker architecture and associated tools. Experience with deploying and upgrading missi

Easy Apply

Contract, Third Party

Depends on Experience

Senior Site Reliability Engineer (Deployment)

Hybrid in Austin, Texas

•

18d ago

Senior Site Reliability Engineer (Deployment) Location: Austin, TX (Onsite 3 days/week) Employment Type: W2 Only Openings: 3 We are hiring Senior Site Reliability Engineers to support enterprise platform deployments for a fast-growing AI-focused technology company. This is a hands-on role working with customer infrastructure teams to deploy, secure, automate, and optimize Kubernetes-based platforms. Key Responsibilities: Deploy and manage applications on Kubernetes environments (AWS, Azure, Goog

Easy Apply

Contract

Depends on Experience

Software Architect - Distributed Systems & Platform Engineering

Austin, Texas

•

Today

Do you thrive at the intersection of big-picture thinking and hands-on technical execution? Are you energized by designing systems that are resilient, scalable, and elegant? As part of our CAD Infrastructure Development group, you'll help architect and build the distributed systems that power our products at scale. You'll ensure our services can seamlessly and efficiently handle large-scale demands. Joining this group means you'll be responsible for shaping the technical direction of our platfor

Full-time

Search all similar jobs

More jobs at Apple, Inc. in Austin, TX

Site Reliability Engineer - Human Engineering

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs