Overview
On Site
Depends on Experience
Accepts corp to corp applications
Contract - Independent
Contract - W2
Skills
Splunk
Cloudwatch
Kibana
Job Details
Job Title: Observability Architect
Location Dallas, Tx US (ONSITE)
Contract
Roles Descriptions:
Observability architect who has hands on experience on New Relic | Splunk | CloudWatch | Kibana | APM | Monitoring Solutions.
As this individual will champion automation monitoring solution, which include triaging, incident management, self-healing solution etc.
Key Responsibilities:
- Design and implement end-to-end observability strategies covering metrics, logs, traces, and user experience monitoring
- Architect custom monitoring frameworks tailored to specific business applications and infrastructure landscapes
- Implement and manage observability platforms including New Relic, Splunk, AWS CloudWatch, and Kibana
- Develop and maintain APM scripts, synthetic monitors, custom dashboards, and alerting mechanisms
- Integrate observability tools with CI/CD pipelines for proactive issue detection and faster MTTR
- Collaborate with application, infrastructure, DevOps, and security teams to ensure observability coverage across systems
- Conduct root cause analysis using correlation across metrics, logs, and traces
- Provide technical leadership in observability best practices, architecture reviews, and roadmap planning
- Define and enforce standards for SLAs, SLOs, and SLIs across environments
- Mentor and guide engineering teams in the effective use of observability tools
Key Skills and Technologies
Monitoring & APM Tools:
- Deep experience with New Relic (including APM, infrastructure, synthetics, custom instrumentation)
- Strong proficiency in Splunk (querying, dashboards, alerts, ingestion pipeline design)
- Hands-on with AWS CloudWatch (metrics, logs, alarms, insights)
- Working knowledge of Kibana and Elastic Stack (ELK)
Scripting & Customization:
- Experience in APM scripting, custom instrumentation (using Java, Python, or Node.js agents)
- Ability to create synthetic monitors, custom event generators, and automated dashboards
- Familiarity with Terraform, CloudFormation, or scripting languages (Shell, Python) for observability automation
Architecture & Integration:
- Expertise in designing observability frameworks for cloud-native (AWS/Google Cloud Platform/Azure) and hybrid environments
- Understanding of distributed systems, microservices, and event-driven architectures
- Ability to integrate observability platforms with DevOps pipelines, incident response, and ITSM tools
Qualifications:
- Bachelor s or master s degree in computer science, Engineering, or related field.
- 15+ years of experience in software engineering or infrastructure roles, with at least 5+ years in Operations.
- Proven success managing high-availability, large-scale distributed systems (e.g., microservices, cloud-native apps).
- Deep understanding of cloud platforms (AWS Google Cloud Platform), containers (Docker, Kubernetes), monitoring (Prometheus, Grafana, Datadog, new relic), and automation tools (Terraform, Ansible, etc.).
- Experience with modern CI/CD tools (e.g., Jenkins, ArgoCD, GitHub Actions).
- Strong leadership, communication, and team development skills.
Preferred Qualifications:
- Experience in regulated industries (e.g., Telecom, communications) and Global telco leaders.
- Certifications in cloud platforms (AWS Certified DevOps Engineer, Google SRE Certificate, etc.).
- Experience managing hybrid or multi-cloud environments
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.