Overview
Hybrid
Depends on Experience
Contract - W2
Skills
.NET
Automated Testing
C#
Cloud Computing
Collaboration
Continuous Delivery
Continuous Integration
Cosmos-Db
Database Performance Tuning
DevOps
Finance
Microservices
IT Management
Incident Management
Java
Jenkins
Kubernetes
Mentorship
Microsoft Azure
Microsoft SQL Server
Operational Excellence
Oracle
People Management
Recruiting
Reliability Engineering
Root Cause Analysis
Scripting
Software Architecture
Terraform
Workflow
Job Details
Overview
we are hiring a Lead Site Reliability Engineer (SRE) to help drive this transformation.
This is a technical leadership role (no people management) where you will guide engineering teams in reliability, automation, observability, and cloud-native architecture using Azure.
What You ll Do
- Lead reliability and operational excellence across distributed cloud systems.
- Define and drive adoption of SLOs/SLIs, monitoring, and incident response practices.
- Architect and optimize CI/CD pipelines, automated testing, and deployment workflows.
- Champion Azure cloud, AKS/Kubernetes, and containerization best practices.
- Collaborate closely with Software Architecture, Engineering, and Production Ops teams.
- Conduct root cause analysis and implement long-term reliability improvements.
- Mentor engineers and influence decisions across multiple teams and platforms.
What You Bring
- Strong coding background in C#, .NET, Java, or Go (not just scripting).
- Hands-on deep experience with Azure, especially AKS / Kubernetes.
- Proven ability to design and optimize CI/CD pipelines (Azure DevOps, Terraform, Jenkins).
- Solid understanding of distributed systems, scaling patterns, and microservices.
- Experience defining and measuring SLOs/SLIs and implementing observability tooling.
- Database performance tuning (SQL Server / Oracle / CosmosDB) is a major plus.
Why This Role Matters
This is a key position in our Production Site Reliability Engineering (PSRE) organization, supporting enterprise cloud modernization across North America. You'll have both influence and autonomy, helping shape how reliability is engineered across GM Financial.
Work Model
Hybrid: 2 days onsite in Arlington, TX | 3 days remote
Contract-to-Hire: Full-time conversion expected within 6 12 months
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.