Site Reliability Engineer, Enterprise Technology Services

  • Austin, TX
  • Posted 1 day ago | Updated 1 hour ago

Overview

On Site
Full Time

Skills

Leadership
Incident Management
Infrastructure Architecture
Computer Hardware
Management
Service Level
IT Management
Testing
Failover
CHAOS
Regulatory Compliance
Auditing
DevOps
IaaS
Amazon Web Services
Google Cloud Platform
Google Cloud
Orchestration
Kubernetes
Terraform
Computer Networking
Systems Design
Scripting Language
Python
Bash
Stacks Blockchain
Grafana
Continuous Integration
Continuous Delivery
Cloud Computing
Cryptography

Job Details

We are looking for a Senior Site Reliability Engineer (SRE) with strong architectural experience to join JMET SRE Team. This individual will play a key role in designing and scaling reliable, secure, and high-performance infrastructure across our cloud and hybrid environments. You will be responsible for establishing reliability patterns, driving large-scale systems design, and building automation frameworks to support production systems at scale.

Description This is a hands-on leadership role with architectural ownership, strategic influence, and deep technical impact across multiple domains, including application and infrastructure security, incident response engineering, and resilience automation.

Responsibilities
  • Architect Scalable Infrastructure: Design, evolve, and review highly reliable, performant, and cost-efficient cloud-native and hybrid infrastructure using IaC, containers, and micro services principles.
  • Support Cryptographic Systems at ScaleDesign and operationalize scalable, secure integrations with Hardware Security Modules (HSMs) for sensitive workloads, key management, and cryptographic operations.
  • Drive SRE Best Practices: Define and implement service-level indicators (SLIs), objectives (SLOs), and agreements (SLAs) to guide engineering teams towards reliability and observability goals.
  • Incident Architecture & Prevention: Serve as a technical lead during major incidents. Partner with security and platform teams to conduct deep post-incident reviews, drive systemic improvements, and establish preventive architectural controls.
  • Sytem Design & Tooling: Build and maintain reusable tooling, automation frameworks, and reliability platforms (observability, alerting, chaos testing, auto-scaling, failover).
  • Reliability as Code: Champion resilience engineering via automation pipelines, CI/CD integrations, canary releases, and chaos engineering principles.
  • Multi-Cloud and Hybrid Systems: Design, assess, and guide architecture decisions across AWS, Google Cloud Platform, AliCloud, and on-premises infrastructure. Ensure consistency, interoperability, and regulatory compliance.
  • Security & Compliance: Ensure architectural patterns are aligned with security standards, compliance requirements, and audit readiness.

Minimum Qualifications
  • 7+ years of experience in SRE, DevOps, or Infrastructure Engineering roles, with 2+ years in an architectural or principal engineering capacity.
  • Deep expertise in cloud infrastructure (AWS, Google Cloud Platform, or AliCloud) and container orchestration (Kubernetes, EKS).
  • Proven experience with Infrastructure as Code (Terraform, Pulumi, CloudFormation).
  • Strong understanding of distributed systems, networking, and systems design at scale.
  • Proficiency in at least one programming or scripting language (Python, Go, Bash, or similar).

Preferred Qualifications
  • Experience designing observability stacks (Prometheus, Grafana, Datadog, OpenTelemetry, ELK, etc.).
  • Solid background in CI/CD tools and modern deployment strategies (ArgoCD, Spinnaker, GitOps).
  • Familiarity with security best practices in cloud and containerized environments.
  • Familiarity with HSMs and crypto operations at scale will be a plus.

This posting is not for a specific job opening and by submitting your resume you are expressing interest in being contacted about this type of role at Apple in the future.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.