Dynatrace Observability Engineer

Overview

On Site
Depends on Experience
Full Time

Skills

Dynatrace
SaaS
Managed
AWS
Azure
GCP
metrics
logs
traces
OneAgent
ActiveGate
RUM
Python
Shell)

Job Details

Role :: Dynatrace Observability Engineer

Location :: Mclean VA Onsite

Type :: Fulltime

Job Description

Must Have Technical/Functional Skills

  • 3 5+ years hands-on experience with Dynatrace (SaaS or Managed) as a primary monitoring platform.
  • Deep understanding of observability pillars: metrics, logs, traces.
  • Strong hands-on experience with cloud platforms (AWS, Azure, or Google Cloud Platform) and hybrid environments.
  • Expertise in deploying and managing Dynatrace OneAgent, ActiveGate, RUM (Real User Monitoring), and Synthetic Monitoring.
  • Familiarity with OpenTelemetry concepts and other observability standards.
  • Strong troubleshooting skills in distributed systems, microservices architectures, and containerized workloads (Kubernetes).
  • Proficiency with infrastructure-as-code (Terraform) and automation scripting (Python, Shell).
  • Good knowledge of ITSM/incident management tools integration.

Roles & Responsibilities

  • Design, deploy, configure, and manage the Dynatrace platform to monitor applications, services, servers, networks, and cloud resources.
  • Define monitoring strategies, custom metrics, SLOs/SLIs, synthetic tests, and distributed tracing within Dynatrace.
  • Develop custom dashboards, anomaly detection, problem detection rules, and service flow visualizations.
  • Integrate Dynatrace with ITSM tools (ServiceNow, Jira) and CI/CD pipelines for proactive monitoring.
  • Analyze telemetry data to identify performance bottlenecks, availability risks, and system anomalies.
  • Lead observability reviews and recommend enhancements to improve system monitoring, alerting, and self-healing capabilities.
  • Advocate Dynatrace best practices across development and operations teams (OneAgent deployment, tagging, smartscape, etc.).
  • Enable automatic and dynamic baseline creation for anomaly detection.
  • Work with cloud (AWS, Azure, Google Cloud Platform) and containerized environments (Kubernetes, OpenShift) to implement cloud-native monitoring.
  • Produce regular reports on system health, performance KPIs, and service level adherence.
  • Participate in incident response activities and postmortem analyses using Dynatrace-provided insights.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Stanley David and Associates