Apply Now

Senior Observability Engineer

Hybrid in Irvine, CA, US • Posted 60+ days ago • Updated 2 days ago

Contract W2

12 Months

No Travel Required

Hybrid

Depends on Experience

Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

Dynatrace
APM
Observability

Summary

As a Senior Observability Engineer, you will be responsible for designing, implementing, migrating, and optimizing endtoend monitoring and observability solutions to ensure the reliability, performance, and resilience of distributed systems and application services for our clients. You will play a critical role in advancing observability maturity across complex legacy, distributed and hybrid environments by enabling proactive detection, rapid diagnosis, efficient resolution of incidents and using AI capabilities.

This role requires deep expertise in realtime monitoring, intelligent alerting, anomaly detection, and automated remediation, with a strong focus on reducing operational risk, minimizing alert fatigue, and improving overall service reliability.

What You Will Do:

Assess the current state of monitoring and observability across applications and systems, including identifying alert fatigue, monitoring gaps, and coverage deficiencies.
Define and execute strategies to incrementally improve the monitoring and observability maturity of platforms, applications, and infrastructure.
Design and implement endtoend observability solutions that provide comprehensive visibility into business transactions, service dependencies, and underlying technical components.
Establish and promote monitoring best practices focused on noise reduction, controlled metric cardinality, and the prevention of duplicate or redundant telemetry.
Define and implement automated alerting strategies aligned with Service Level Objectives (SLOs) and Service Level Agreements (SLAs) to ensure actionable and meaningful alerts.
Develop and enforce monitoring audit standards to support governance, compliance, and regulatory requirements.
Act as an escalation point for complex or critical monitoringrelated incidents and provide strategic guidance and recommendations to engineering and operations teams.
Automate monitoring configurations, policy management, and telemetry collection using CI/CD pipelines and Infrastructure as Code (IaC) practices with tools such as Helm, Ansible, and Terraform.
Build reusable automation frameworks and standardized reporting solutions to support consistent monitoring rollouts, configuration management, and operational insights.
Leverage AI and machine learning techniques to enhance observability outcomes, including intelligent anomaly detection, alert noise reduction, predictive incident identification, automated rootcause analysis, and datadriven insights to improve service reliability and operational efficiency.

Required Qualifications: (Must have)

Overall 10+ years of experience out of which, 7+ years of solid experience with APM, monitoring, observability and event management tools including Dynatrace/AppDynamics, Splunk, Cortex, Prometheus, Grafana, and Netcool.
Experience with ITSM, ticketing tools and their integration with monitoring tools.
Proficiency in Application Workloads (Binary, Java, Python, .NET, Batch Jobs).
Experience in Python, Bash, PowerShell or JavaScript for automation of tasks.
Exposure to CI/CD pipelines and IaC (Infrastructure as Code).
Strong in analytical and problem-solving skills for diagnosing complex issues
Effective in communication, individual leadership, and cross-functional team collaboration.
Ability to think outside the box, sensitivity towards business impacts, and self-awareness to refine processes.
Bachelor’s degree in computer science or engineering field.

Preferred Qualifications: (Good to have)

Proficiency in broader aspects of monitoring and observability (APM, System Monitoring, Logs, Tracing, Visualization, Reporting and Integration)
Experience in automation/programming/coding to an extent that can instrument monitoring solutions for a given platform/tooling/practice.
Certified professional in Dynatrace/AppDynamics, Splunk, ITIL or AI.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: itassoc
Position Id: 8973910
Posted 30+ days ago

Contact the job poster

Meena Advani

Recruiting Director @ IT Associates, Inc.

View Profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Irvine, California

•

Yesterday

Job SummaryWe are seeking an experienced Application Management Services (AMS) Lead to oversee production support, application monitoring, incident management, and operational excellence for mission-critical enterprise applications. The ideal candidate will have strong experience leading incident response, implementing observability solutions, and driving continuous service improvements in high-availability environments. Experience in the Connected Car, Telematics, or Automotive domain is highly

Easy Apply

Contract, Third Party

50 - 55

Observability Engineer

Arizona

•

2d ago

Hi, I hope you are doing well! We have an excellent opportunity with one of our clients and would like to share the details with you. Please find the job description below and let me know if this role interests you. If you would like to proceed, kindly share a copy of your updated resume along with your contact details and a convenient time for us to connect. Title: Observability Engineer (Dynatrace, Splunk & OpenSearch) Location: Phoenix, AZ (Onsite) Long Term Contract! Experience: 7+ Years

Full-time

Senior Observability Engineer

New Jersey

•

Today

THE POSITIONOur roster has an opening with your name on it FanDuel is looking for a Senior Observability Engineer to design, build, and mature the observability ecosystem that underpins our platform and services. You will deliver deep visibility into system behavior by combining system telemetry with user signals to provide a holistic view of performance, reliability, and user experience. You'll also explore how AI and machine learning can enhance observability, from intelligent alerting and an

Full-time

USD 149,000.00 - 186,000.00 per year

Senior Observability Platform Engineer - 3631049

No location provided

•

Today

Lighthouse Technology Services is partnering with our client to fill their Senior Observability Platform Engineer position! This is a 10+ months contract with potential to hire. Preference is for someone to work hybrid in Buffalo, NY for conversion purposes, but is also open to contract remote in United States. This role will be a W2 employee of Lighthouse Technology Services. No C2C or subcontracting arrangements will be considered. What You'll Be Doing: Lead hands-on technical leadership for

Full-time

USD 80.00 - 85.00 per hour

Search all similar jobs

More jobs at IT Associates, Inc. in Irvine, CA

Senior Observability Engineer

Dice Job Match Score™

Job Details

Skills

Summary

Meena Advani

Similar Jobs