Overview
Skills
Job Details
Observability Lead engineers with experience on PromQL andDatadog
Location-Austin, TX or Remote
Duration :12+ Months
Candidate should be comfortable working as per PST time also
Skills-Detailed JD
We are seeking observability engineers to assist in operational support and observability maintenance tasks. This role is ideal for someone who is detail oriented, eager to learn and comfortable following established runbooks and processes. Candidate will help validate dashboards and alerts, run and monitor migration scripts and support the observability team s day-to-day operations.
- Execute validation steps for dashboards and alerts based on provided runbooks and checklists
- Assist in running and verifying migration scripts for dashboards, alerts or telemetry pipelines
- Ability to write scripts using Python and Terraform
- Follow standard procedures for validating PromQL queriers including metrics/labels/expressions
- Report issues, anomalies or missing configurations to observability team
- Help end users in troubleshooting metric queries/dashboard widgets/alerts
Mandatory Requirements
- Expertise of monitoring tools like Grafana, Prometheus and OpenTelemetry
- Basic understanding of observability concepts: dashboards, alerts, metrics and logs.
- Some experience in running or modifying Python scripts (with modifying JSON/YAML and API calls)
- Attention to detail and strong organizational habits are a must
Desired skills
Experience with Datadog and OpenTelemetry is a plus