Sr. Staff Site Reliability (SRE) / DevOps Engineer

Remote • Posted 1 hour ago • Updated 1 hour ago
Contract W2
Contract Independent
Remote
Depends on Experience
Fitment

Dice Job Match Score™

🎯 Assessing qualifications...

Job Details

Skills

  • Datadog APM instrumentation
  • monitors
  • dashboards

Summary

We are seeking a Staff Site Reliability (SRE)/DevOps Engineer to improve the reliability, observability, and operational health of our production platform. This role requires someone who can go beyond basic monitoring the ideal candidate must understand application architecture and service dependencies in order to design meaningful alerts and actionable observability, not just monitoring noise.

This position combines SRE, DevOps, and observability engineering, with a strong focus on improving alert quality, reducing operational fatigue, and strengthening platform reliability.

 

Key Responsibilities

  • Optimize and clean up Datadog APM instrumentation, monitors, and dashboards to improve signal quality and reduce telemetry costs
  • Design intelligent alerting strategies to reduce PagerDuty alert fatigue
  • Develop monitoring that reflects real user impact and system health, not infrastructure noise
  • Gain deep understanding of application architecture and service dependencies to diagnose failures and cascading impacts
  • Support DevOps and platform engineering efforts, including automation and CI/CD improvements
  • Participate in on-call support during business hours (Mon Fri) and lead incident response improvements

 

Required Qualifications

  • 8+ years of experience combines SRE reliability practices with strong DevOps engineering skills
  • Strong hands-on experience with Datadog (APM, monitoring, dashboards, alerting)
  • Experience designing actionable monitoring and intelligent alerting
  • Strong understanding of distributed systems and application architecture
  • Experience supporting production systems and incident response
  • Solid DevOps automation and infrastructure skills
  • Understands applications deeply enough to create meaningful alerts
  • Can reduce monitoring noise and operational fatigue

 

 

If you are interested in getting more information about this opportunity, please contact Irina Rozenberg at your earliest convenience.

 

At Ariel Partners, we solve the most difficult problems that inhibit technology from enabling our customers to achieve their goals. Our vision is to be recognized by our stakeholders as an elite provider of IT solutions, so when they have their biggest challenges, we are on their short list. We are looking for team members who share our values of: Integrity to do the right thing even when it hurts; Commitment to the long-term success and happiness of our customers, our people, and our partners; Courage to take on difficult challenges, accept new ideas, and accept incremental failure; and the constant pursuit of Excellence. Ariel Partners is an Equal Opportunity Employer in accordance with federal, state, and local laws.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10212364
  • Position Id: 8907708
  • Posted 1 hour ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

18d ago

Easy Apply

Contract

70 - 85

Remote

Today

Easy Apply

Contract

Depends on Experience

Remote

Today

Full-time

USD 89,850.00 - 141,550.00 per year

Remote

Today

Easy Apply

Contract

Depends on Experience

Search all similar jobs