Lead Site Reliability Engineer (Hybrid Arlington, TX)

Overview

Hybrid
Depends on Experience
Contract - W2

Skills

.NET
Automated Testing
C#
Cloud Computing
Collaboration
Continuous Delivery
Continuous Integration
Cosmos-Db
Database Performance Tuning
DevOps
Finance
Microservices
IT Management
Incident Management
Java
Jenkins
Kubernetes
Mentorship
Microsoft Azure
Microsoft SQL Server
Operational Excellence
Oracle
People Management
Recruiting
Reliability Engineering
Root Cause Analysis
Scripting
Software Architecture
Terraform
Workflow

Job Details

Overview

we are hiring a Lead Site Reliability Engineer (SRE) to help drive this transformation.

This is a technical leadership role (no people management) where you will guide engineering teams in reliability, automation, observability, and cloud-native architecture using Azure.

What You ll Do

  • Lead reliability and operational excellence across distributed cloud systems.
  • Define and drive adoption of SLOs/SLIs, monitoring, and incident response practices.
  • Architect and optimize CI/CD pipelines, automated testing, and deployment workflows.
  • Champion Azure cloud, AKS/Kubernetes, and containerization best practices.
  • Collaborate closely with Software Architecture, Engineering, and Production Ops teams.
  • Conduct root cause analysis and implement long-term reliability improvements.
  • Mentor engineers and influence decisions across multiple teams and platforms.

What You Bring

  • Strong coding background in C#, .NET, Java, or Go (not just scripting).
  • Hands-on deep experience with Azure, especially AKS / Kubernetes.
  • Proven ability to design and optimize CI/CD pipelines (Azure DevOps, Terraform, Jenkins).
  • Solid understanding of distributed systems, scaling patterns, and microservices.
  • Experience defining and measuring SLOs/SLIs and implementing observability tooling.
  • Database performance tuning (SQL Server / Oracle / CosmosDB) is a major plus.

Why This Role Matters

This is a key position in our Production Site Reliability Engineering (PSRE) organization, supporting enterprise cloud modernization across North America. You'll have both influence and autonomy, helping shape how reliability is engineered across GM Financial.

Work Model

Hybrid: 2 days onsite in Arlington, TX | 3 days remote

Contract-to-Hire: Full-time conversion expected within 6 12 months

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.