AWS SRE Engineer - Observability

Jersey City, NJ, US • Posted 1 day ago • Updated 1 day ago
Contract Independent
Contract W2
Contract Corp To Corp
12 Months
No Travel Required
On-site
$50 - $55/hr
Fitment

Dice Job Match Score™

📋 Comparing job requirements...

Job Details

Skills

  • aws
  • sre
  • Observability

Summary

 

Role:   AWS SRE Engineer - Observability  

Location:  Jersey City, NJ Onsite  (Must be local only )

Duration: 12+ Months

10+ Years exp 

Visa - Any visa except GC 

 

Job Description:

 

We are seeking an experienced AWS SRE Engineer with strong expertise in Observability, particularly in developing and maintaining Grafana dashboards for monitoring, alerting, and operational insights. The ideal candidate will have hands-on experience in AWS environments and a solid understanding of core SRE principles such as SLA, SLO, SLI, error budgets, and golden signals. Exposure to Client or Databricks is preferred.

Key Responsibilities:

  • Design, develop, and maintain Grafana dashboards to provide actionable insights into platform health, performance, and reliability.
  • Build and enhance observability solutions for AWS-hosted applications and infrastructure.
  • Define and track SLIs, SLOs, and SLAs to measure service reliability and performance.
  • Monitor system health using golden signals and implement effective alerting strategies.
  • Support incident response, root cause analysis, and continuous improvement initiatives.
  • Collaborate with engineering, platform, and operations teams to improve system resilience and operational efficiency.
  • Manage error budgets and contribute to reliability-focused engineering decisions.
  • Integrate observability practices into CI/CD and cloud operations where applicable.

Required Skills and Qualifications:

  • Strong experience as an SRE / Cloud Reliability Engineer in AWS environments.
  • Proven expertise in Observability and Grafana dashboard development.
  • Good understanding of monitoring, alerting, logging, and metrics visualization.
  • Strong knowledge of SRE concepts: SLA, SLO, SLI, error budgets, and golden signals.
  • Experience in troubleshooting production systems and improving platform reliability.
  • Strong collaboration and communication skills.

Preferred Qualifications:

  • Knowledge of Client or Databricks.
  • Experience with modern cloud monitoring and observability ecosystems.
  • Familiarity with automation, infrastructure as code, and cloud-native operational practices.

 

 

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91163612
  • Position Id: 8968370
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

New York, New York

Yesterday

Easy Apply

Contract

Depends on Experience

New York, New York

19d ago

Easy Apply

Contract

Depends on Experience

Hybrid in New York, New York

5d ago

Easy Apply

Contract

Depends on Experience

Jersey City, New Jersey

Today

Contract

USD86 - USD87

Search all similar jobs