Observability Architect

Atlanta, GA, US • Posted 1 day ago • Updated 1 day ago

Contract Independent

Contract W2

Contract Corp To Corp

On-site

Depends on Experience

Fitment

Dice Job Match Score™

🎯 Assessing qualifications...

Job Details

Skills

Observability Architect
AppDynamics
Dynatrace
APM
AWS
Azure
Splunk Architect

Summary

Role:- Observability Architect

Location: Atlanta, GA- Onsite 5 days

Key Responsibilities Observability Architecture & Strategy

Develop and maintain the enterprise observability reference architecture, covering logs, metrics, traces, events, dashboards, and alerts.
Lead the design and implementation of observability solutions that support hybrid multi-cloud and on-premise environments.
Establish standards, governance, and reusable frameworks for telemetry generation, ingestion, correlation, storage, and visualization.
Drive continuous improvement of monitoring maturity, integrating data-driven insights and AI-based analytics where applicable. Log Aggregation & Monitoring Solutions
Architect and administer large-scale log aggregation platforms such as Splunk, supporting both on-prem and cloud deployments.
Define and automate ingestion pipelines, parsing logic, index strategies, role-based access, and performance tuning.
Implement configuration management and infrastructure-as-code (IaC) practices for repeatable deployment and scaling of observability tools. Application & Network Performance Monitoring
Deploy, configure, and optimize APM solutions such as AppDynamics, Dynatrace, or equivalent platforms.
Integrate application tracing, synthetic monitoring, real-user monitoring (RUM), and business transaction analytics.
Support and enhance Network Performance Monitoring (NPM) capabilities to ensure end-to-end visibility across distributed systems. Cloud-Native & Modern Monitoring
Leverage cloud-native monitoring tools across AWS, Azure, or Google Cloud Platform (e.g., CloudWatch, Azure Monitor, Google Cloud Platform Operations Suite).
Guide teams in instrumenting microservices, serverless functions, containers, and Kubernetes clusters using OpenTelemetry and modern telemetry standards.
Partner with infrastructure, application, and SRE teams to ensure high availability, resilience, and performance. Automation & AI-Driven Engineering
Build automated workflows for alert tuning, anomaly detection, dashboards, and telemetry enrichment.
Explore and integrate AI/ML-based observability features such as predictive analytics, signal correlation, and automated root-cause analysis.
Advocate for automation-first practices and reduction of operational toil.

Required Qualifications

5+ years of hands-on experience with enterprise-scale log aggregation platforms, including architecture, deployment, and administration of tools like Splunk across on-prem and cloud environments.

5+ years of experience using automated configuration management and IaC tools (e.g., Ansible, Terraform, GitOps frameworks).

2+ years of experience with APM tools such as AppDynamics or Dynatrace, including end-to-end application visibility and performance diagnostics.

Experience with Network Performance Monitoring tools and methodologies.

Strong understanding of cloud infrastructure and cloud-native monitoring technologies (AWS, Azure, Google Cloud Platform).

Familiarity with OpenTelemetry, distributed tracing, and service mesh observability.

Expertise in designing dashboards, KPIs, and alerting strategies that align to business SLIs/SLOs.

Experience collaborating with DevOps, SRE, cloud engineering, and application teams in large enterprises. Preferred Qualifications

Experience implementing AI/ML-driven observability capabilities (e.g., anomaly detection, auto-baselining, correlation engines).

Knowledge of container ecosystems and orchestration platforms (Kubernetes, AKS/EKS/GKE).

Experience working with event-driven architectures and microservices environments.

Strong scripting or programming skills (Python, PowerShell, Bash, etc.).

Relevant certifications (e.g., Splunk Architect, Dynatrace Professional, Cloud certifications).

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10217521
Position Id: 8883596
Posted 1 day ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Observability Architect

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs