Lead SRE Engineer - Stamford, CT (Need locals only

Stamford, CT, US • Posted 1 day ago • Updated 1 day ago
Contract Corp To Corp
Contract W2
Contract Independent
12 Months
No Travel Required
On-site
Depends on Experience
Fitment

Dice Job Match Score™

🎯 Assessing qualifications...

Job Details

Skills

  • SRE
  • DevOps
  • Platform Engineering
  • .
  • Datadog
  • Prometheus/Grafana
  • ELK/OpenSearch
  • Nagios
  • Nimsoft
  • AWS
  • Terraform
  • CI/CD

Summary

Title: Lead SRE Engineer   

Location: Stamford, CT (Need only locals)

Exp: 12+ years

 

Responsibilities: Reliability Engineering & Operations

  • Own and improve service reliability through SLO/SLI definition, error budgets, and operational best practices.
  • Design, implement, and maintain observability (monitoring, logging, tracing, alerting) to reduce MTTR and improve proactive detection.
  • Lead incident response practices including on-call improvements, runbooks, post-incident reviews (RCA), and preventative actions.
  • Partner with application teams to improve performance, capacity planning, and resiliency under failure scenarios.

Qualifications:

  • 7+ years of experience in SRE, DevOps, Platform Engineering, or Systems Engineering roles supporting production environments.
  • Strong proficiency with observability platforms (e.g., Datadog, PrometheGrafana, ELK/OpenSearch, Nagios, Nimsoft, etc).
  • Strong hands-on AWS experience building and operating production systems.
  • Proven expertise with Infrastructure as Code (Terraform and/or CloudFormation/CDK).
  • Strong CI/CD and automation background (pipeline design, deployment strategies, testing automation).
  • Experience defining and validating RTO/RPO, and implementing BCP/DR plans with structured testing.
  • Experience with Kubernetes and auto-scaling container platforms (EKS, ECS, or Kubernetes on-prem).
  • Strong Linux fundamentals, networking concepts (DNS, TCP/IP, load balancing), and troubleshooting skills.
  • Proficiency in at least one scripting/programming language (Python, Go, Bash, or similar).
  • Ability to write clear operational documentation, runbooks, and post-incident reports.
  • Ability to work effectively in a fast-paced, dynamic and high-intensity environment including open-floor plan if applicable to the position, with timely responsiveness and the ability to work beyond normal business hours when required.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10120177
  • Position Id: 8975312
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Stamford, Connecticut

Yesterday

Easy Apply

Contract, Third Party

Depends on Experience

Stamford, Connecticut

Today

Easy Apply

Contract

USD 67.00 - 76.00 per hour

Greenwich, Connecticut

Today

Full-time

USD 110,000.00 - 150,000.00 per year

Elmsford, New York

Today

Contract

depending on experience

Search all similar jobs