Principle SRE Engineer (15+ years exp mandatory)--100% REMOTE

Woonsocket, RI, US • Posted 11 hours ago • Updated 11 hours ago
Contract Corp To Corp
Contract W2
Contract Independent
On-site
Depends on Experience
Fitment

Dice Job Match Score™

🤯 Applying directly to the forehead...

Job Details

Skills

Summary

Job Title: Principal SRE Engineer
Location: Remote
Duration: Long-Term
 
Role Summary
  • We are seeking a Principal SRE to lead reliability engineering strategy and establish enterprise-wide observability, incident management, and reliability governance.
  • This role will design and implement SLIs/SLOs, drive automation, and build centralized reliability visibility using Grafana and ServiceNow Performance Analytics.
  • The Principal SRE will work closely with engineering teams to embed reliability practices across the application lifecycle and create a Single Pane of Glass for operational and executive insight across 40+ applications.
Key Responsibilities
  • Reliability Strategy & Standards
  • Define and implement SLIs, SLOs, and reliability targets aligned with organizational Golden Pathways.
  • Build and operationalize observability standards across metrics, logs, and traces.
  • Establish SRE telemetry ingestion pipelines and reliability engineering workflows.
Monitoring & Observability
  • Design and implement telemetry for:
  • Application Performance Monitoring (APM) – service response times and bottleneck detection.
  • Logging & Tracing – correlated logs and distributed tracing.
  • Event & Alerting – meaningful alerts tied to severity and actionability.
  • Build service health dashboards and monitoring pipelines using Grafana.
Incident Management & Reliability Engineering
  • Evolve incident management practices and RCA frameworks.
  • Develop automation workflows to improve detection, response, and recovery.
  • Implement RCA tagging, compliance monitoring, and lifecycle tracking.
Centralized SRE Visibility
  • Design and build a Central SRE Operating View and Golden Dashboard (Single Pane of Glass).
  • Aggregate telemetry including reliability metrics, incident trends, MTTR, RCA themes, alert noise, and resilience indicators across 40+ applications.
  • Provide executive dashboards for CIO/VP visibility and monthly reliability reviews.
SRE Governance & Reporting
  • Develop executive scorecards including:
  • Per-application reliability score
  • SRE maturity score
  • MTTD / MTTR / MTTRestore metrics
  • Escalation patterns and failure trends
  • Deliver runbooks, telemetry integration guides, and RCA enforcement playbooks.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10317972
  • Position Id: 26-02058
  • Posted 11 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

4d ago

Easy Apply

Contract, Third Party

80 - 85

Remote or Woonsocket, Rhode Island

Today

Full-time

USD 118,450.00 - 260,590.00 per year

Remote

Today

Contract

75-95/hr

Arkansas

Today

Easy Apply

Third Party, Contract

Search all similar jobs