SRE Operations Director

  • Dallas, TX
  • Posted 2 days ago | Updated 2 days ago

Overview

On Site
$120,000 - $140,000
Full Time
Accepts corp to corp applications

Skills

Continuous Delivery
Continuous Integration
DevOps
Docker
GitHub

Job Details

Title: SRE Operations Director

Location: Dallas, TX (Onsite)

Visa: Any

Client: Tech Mahindra

Roles Descriptions:

SRE - Director to lead our global SRE team in building scalable, resilient, and highly available systems. This role combines deep technical expertise with strong leadership, strategic thinking, and a passion for delivering exceptional customer experiences through operational excellence.

As SRE Director, this individual will champion automation initiatives for SRE operations, aiming to enhance the performance and reliability of infrastructure and critical services. The role involves close collaboration with engineering, product, security, and operations teams to define and implement the best reliability practices organization wide.

Key Responsibilities:

Leadership & Strategy:

  • Build and lead a high-performing team of SREs Tools across various Business segment.
  • Define and execute the SRE strategy aligned with business and engineering goals.
  • Foster a culture of reliability, observability, and performance.

Reliability Engineering:

  • Own SLAs/SLOs/SLIs for key services and ensure they are met consistently.
  • Drive incident management practices, root cause analysis (RCA), and continuous improvement.
  • Oversee reliability tooling, runbooks, and automation frameworks.

Platform & Infrastructure:

  • Partner with Infrastructure, DevOps, and Cloud teams to ensure scalable platform architecture.
  • Guide the adoption of Infrastructure-as-Code (IaC), CI/CD pipelines, and modern observability tools.
  • Drive cost optimization and efficient resource utilization in cloud environments.

Collaboration & Communication:

  • Act as a reliability evangelist across engineering teams, enabling them to own and improve their services.
  • Report reliability and performance metrics to leadership and stakeholders.
  • Collaborate closely with security, compliance, and governance teams to meet regulatory requirements.

Qualifications:

  • Bachelor s or master s degree in computer science, Engineering, or related field.
  • 15+ years of experience in software engineering or infrastructure roles, with at least 5+ years in SRE or DevOps leadership.
  • Proven success managing high-availability, large-scale distributed systems (e.g., microservices, cloud-native apps).
  • Deep understanding of cloud platforms (AWS Google Cloud Platform), containers (Docker, Kubernetes), monitoring (Prometheus, Grafana, Datadog, new relic), and automation tools (Terraform, Ansible, etc.).
  • Experience with modern CI/CD tools (e.g., Jenkins, ArgoCD, GitHub Actions).
  • Strong leadership, communication, and team development skills.

Preferred Qualifications:

  • Experience in regulated industries (e.g., Telecom, communications) and Global telco leaders.
  • Certifications in cloud platforms (AWS Certified DevOps Engineer, Google SRE Certificate, etc.).
  • Experience managing hybrid or multi-cloud environments.
  • Worked as senior role in Top 5 Consultancy companies.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.