Site Reliability Engineering (SRE) Lead

Hybrid in Fort Mill, SC, US • Posted 10 days ago • Updated 10 days ago

Contract W2

Contract Independent

Hybrid

Depends on Experience

MASH Pro Tech

Fitment

Dice Job Match Score™

🤯 Applying directly to the forehead...

Job Details

Skills

Aws
docker
kubernetes
Terraform
CloudFormation
Ansible
Python
Bash
Prometheus
Grafana
ELK/ELK Stack
Splunk
Datadog

Summary

Position: Site Reliability Engineering (SRE) Lead
Location: Fort Mill, SC (2-3 days onsite in a week, hybrid)
Only seniors with Lead exp

Summary
A senior technical leader responsible for owning reliability strategy, leading an SRE team, and ensuring the operational health, scalability, and availability of services. Combines hands on engineering, automation, and people leadership to drive reliability across the organization.
Core responsibilities
Strategy & process
Define SRE strategy, process frameworks, standards, and best practices.
Establish SLIs, SLOs, and error budget policies; embed reliability into the SDLC.
Promote a culture of service ownership and maintain strong cross team feedback loops.
Reliability & capacity
Oversee monitoring and maintenance to meet SLAs and uptime targets.
Drive capacity planning and forecasting to ensure performance at scale.
Use data and metrics to prioritize reliability investments and tradeoffs.
Automation & tooling
Lead automation efforts to eliminate operational toil and streamline runbooks.
Oversee Infrastructure as Code practices (for example Terraform, CloudFormation) and configuration management.
Improve CI/CD pipelines to enable safer, faster releases.
Incident & change management
Lead incident response and communications during outages.
Conduct blameless postmortems and ensure corrective actions are executed.
Govern change control to ensure safe, tested production deployments.
Collaboration & communication
Partner with engineering, architecture, and product teams to bake reliability into designs and roadmaps.
Translate technical issues and tradeoffs for technical and nontechnical stakeholders.
Team leadership
Hire, mentor, and develop SRE engineers; set team goals and a roadmap.
Lead calmly and effectively under pressure during critical incidents and drive customer focused decisions.
Qualifications & skills
Technical
Proven SRE/DevOps/infrastructure experience (typically 6+ years) with leadership experience (about 2 3 years).
Strong cloud experience (AWS preferred), containerization (Docker), and orchestration (Kubernetes).
Expertise with IaC and automation tools (Terraform, CloudFormation, Ansible, or similar).
Proficient in scripting and programming for automation (Python, Bash, or similar).
Deep experience with monitoring and observability tooling (Prometheus, Grafana, ELK/ELK Stack, Splunk, Datadog, etc.).
Leadership & soft skills
Strong people leadership and coaching skills with proven stakeholder communication.
Excellent problem solving, analytical thinking, and adaptability.
Strategic mindset balancing engineering excellence with business priorities.
Deliverables
A measurable reliability roadmap aligned to business goals.
Reduced operational toil through automation and improved runbooks.
Clear SLIs, SLOs and established error budget governance.
A high performing SRE team with documented processes for incident and change management.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91166674
Position Id: 8869271
Posted 10 days ago

Company Info

About MASH Pro Tech

Welcome to MASH Pro Tech, where innovation meets excellence. We are a leading technology solutions provider dedicated to transforming businesses through cutting-edge technology and unparalleled expertise. Our mission is to empower organizations with the tools and knowledge they need to thrive in the digital age.

Choosing MASH Pro Tech means partnering with a team that is passionate about technology and dedicated to your success. We take the time to understand your business and develop solutions that drive real results. With our extensive experience and commitment to excellence, you can trust us to deliver the best possible outcomes for your organization.

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.