Overview
On Site
Depends on Experience
Accepts corp to corp applications
Contract - W2
Contract - Independent
Contract - 12 Month(s)
Skills
AIOps
Observability
Job Details
Hi,
The following requirement is open with our client.
Title : AIOps Architect/ Observability Architect/ Lead Engineer
Location : Atlanta, GA
Duration : 12+ Months
Relevant Experience (in Yrs.): 8+
Detailed Job Description:
- Observability, Automation, and AIOps Architect/Lead Engineer As Observability Architect/Lead Engineer responsible for designing, implementing, and maintaining observability solutions to ensure the health, performance, and reliability of systems and applications. They work with development, operations, and security teams to integrate observability into the software development lifecycle and define standards and best practices. This role requires strong technical skills in areas like logging, monitoring, tracing, and alerting, as well as the ability to influence and lead teams What You?ll Do? Architect and implement enterprise-grade observability and automation solutions across distributed systems and cloud-native environments.? Lead the strategy and execution of AIOps initiatives to proactively detect, diagnose, and resolve incidents using machine learning and predictive analytics.? Collaborate with cross-functional teams including SREs, DevOps, software engineers, and business stakeholders to align observability and automation goals with business outcomes.? Design and maintain scalable telemetry pipelines for metrics, logs, traces, and events using modern observability stacks (e.g., OpenTelemetry, Prometheus, Grafana, ELK).? Drive automation of operational tasks and incident response using AI/ML models and rule-based systems.? Develop and maintain CI/CD pipelines integrated with observability and AIOps tools to ensure continuous feedback and improvement.? Provide technical leadership in selecting and integrating tools for monitoring, alerting, and automated remediation.? Promote best practices in observability, automation, and AIOps through documentation, training, and knowledge sharing.? Communicate architecture strategies and technical roadmaps to leadership and stakeholders.? Operate within Agile squads and contribute to sprint planning, reviews, and retrospectives.? Own and support the solutions you build, ensuring reliability, scalability, and performance.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.