Role Overview:
We are seeking an experienced Observability Architect with deep expertise in Dynatrace to lead the design, implementation, and scaling of end-to-end observability solutions across enterprise applications, infrastructure, and cloud environments. This role is pivotal in driving proactive monitoring strategies, enabling AIOps/self-healing capabilities, and delivering actionable insights to enhance system availability, performance, and reliability.
Key Responsibilities:
Define and own the observability strategy using Dynatrace as the core platform, integrating with existing IT monitoring ecosystems.
Architect and implement full-stack monitoring across applications, microservices, APIs, databases, infrastructure, and hybrid multi-cloud workloads.
Design service flow mapping, distributed tracing, real-user monitoring (RUM), and synthetic monitoring frameworks.
Enable AI-driven root cause analysis (RCA), anomaly detection, and self-healing automation.
Collaborate with enterprise architects, DevOps, SREs, and application teams to embed observability into CI/CD pipelines.
Drive instrumentation best practices using OpenTelemetry (OTel) and other open standards.
Establish dashboards, service-level objectives (SLOs), and KPIs aligned with business outcomes.
Provide governance, roadmap planning, and technical leadership for observability adoption across the enterprise.
Required Skills & Qualifications:
Proven experience in observability architecture and monitoring strategy design using Dynatrace as the core platform.
Strong understanding of application performance monitoring (APM), infrastructure monitoring, and log analytics.
Experience implementing observability across hybrid multi-cloud environments (AWS, Azure, Google Cloud Platform, on-prem).
Proficiency in OpenTelemetry (OTel) and integration with CI/CD pipelines.
Knowledge of AIOps, automated RCA, and self-healing mechanisms.
Experience designing SLOs, dashboards, and metrics frameworks for enterprise-scale systems.
Excellent collaboration skills with cross-functional teams (DevOps, SRE, Architecture).
Strong analytical mindset, problem-solving skills, and ability to translate technical insights into business outcomes.