Overview
Skills
Job Details
Job Description:
We are seeking an Observability Engineer to assist in operational support and observability maintenance tasks. This role is ideal for someone who is detail-oriented, eager to learn, and comfortable following established runbooks and processes. You will help validate dashboards and alerts, run and monitor migration scripts, and support the observability teams day-to-day operations. Key Responsibilities:- Execute validation steps for dashboards and alerts based on provided runbooks and checklists.- Assist in running and verifying migration scripts for dashboards, alerts, or telemetry pipelines.- Follow standard procedures for validating PromQL queries including metrics/labels/expressions.- Report issues, anomalies, or missing configurations to Observability team.- Help end users in troubleshooting metric queries/dashboard widgets/alerts. Preferred Skills & Experience:- Familiarity with monitoring tools like Grafana, Prometheus and Open Telemetry.- Basic understanding of observability concepts: dashboards, alerts, metrics, and logs.- Some experience running or modifying Python Scripts (with modifying JSON/YAML and API calls). - Attention to detail and strong organizational habits. Must Have:- Experience with Datadog and Open Telemetry.
Keywords: Observability, Grafana, Prometheus and Open Telemetry, Datadog, PromQL, Python Scripts