Senior Site Reliability Engineer - Observability & Monitoring

Plano, TX, US • Posted 2 days ago • Updated 9 hours ago
Full Time
On-site
Company Branding Image
Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

  • FOCUS
  • Instrumentation
  • Event Management
  • Onboarding
  • Dynatrace
  • Splunk
  • SPL
  • Dashboard
  • Extraction
  • Kubernetes
  • Linux
  • File Systems
  • Microsoft SSIS
  • JSON
  • Scripting
  • Python
  • Shell Scripting
  • Windows PowerShell
  • Documentation
  • IBM SmartCloud
  • IBM
  • Netcool
  • Grafana
  • Apache Kafka
  • ServiceNow
  • Workflow
  • .NET
  • Git
  • Continuous Integration
  • Continuous Delivery
  • Change Management
  • IT Service Management
  • Innovation
  • Collaboration
  • Recruiting
  • Artificial Intelligence
  • Privacy
  • Insurance
  • Finance
  • Professional Development
  • Training
  • Leadership
  • CompTIA
  • Customer Service
  • Career Counseling
  • SAP BASIS
  • Law
  • ADA
  • Apex
  • Oracle Application Express

Summary

Job#: 3036349

Job Description:
Senior Site Reliability Engineer - Observability & Monitoring

Location: Plano, Texas (Onsite)

Employment Type: 12 Months Contract

Role Overview

We are seeking an experienced Observability and Monitoring Site Reliability Engineer to help design, implement, and operationalize monitoring for an enterprise Event Management platform. This role will focus on defining observability coverage, implementing monitoring instrumentation, building operational dashboards, and improving visibility across platform components, integrations, and services. The primary tools for this role are Dynatrace and Splunk.

Key Responsibilities
  • Define and implement monitoring and observability coverage for the Event Management platform.
  • Establish standards for metrics, logs, traces, events, synthetic checks, and platform telemetry.
  • Build monitoring for IBM Cloud Pak for Watson AIOps, Netcool OMNIbus, Netcool Impact, OpenShift, Linux, Kafka-based services, and ServiceNow integration points.
  • Design and maintain Dynatrace monitoring for applications, infrastructure, synthetic checks, and platform dependencies.
  • Design and maintain Splunk searches, dashboards, alerts, log onboarding patterns, and operational views.
  • Create OpenShift and Kubernetes monitoring using available platform metrics, Prometheus, and Grafana.
  • Monitor Linux-based platform components, including processes, services, file systems, and resource utilization.
  • Monitor Kafka-based integrations, including topic health, consumer lag, and message throughput.
  • Provide end-to-end visibility for event flow from platform ingestion through downstream integration.
  • Develop runbooks, troubleshooting guides, validation procedures, and operational documentation.
Required Qualifications

Technical Skills:
  • Hands-on experience with Dynatrace for infrastructure, application, synthetic, service, and dependency monitoring.
  • Hands-on experience with Splunk, including Search Processing Language (SPL), dashboards, alerts, and field extraction.
  • Understanding of OpenShift or Kubernetes monitoring concepts.
  • Experience monitoring Linux-based services, processes, logs, file systems, and resource utilization.
  • Experience defining monitoring coverage for distributed platforms and integration services.
  • Experience with REST APIs, JSON, webhooks, and system-to-system integrations.
  • Experience with scripting or automation using Python, shell scripting, or PowerShell.
  • Ability to troubleshoot issues across application, infrastructure, platform, and integration layers.
  • Strong documentation skills for runbooks, monitoring standards, and support procedures.


Preferred Qualifications
  • Experience with IBM Cloud Pak for Watson AIOps.
  • Experience with IBM Netcool OMNIbus, including ObjectServer, probes, and gateways.
  • Experience with Netcool Impact, including event enrichment and policy logic.
  • Experience with Prometheus and Grafana.
  • Experience monitoring Kafka, including consumer lag, topic health, and broker health.
  • Experience with ServiceNow event, incident, or integration workflows.
  • Experience monitoring .NET applications and services.
  • Experience with distributed tracing and OpenTelemetry.
  • Experience with Git, CI/CD pipelines, and monitoring-as-code or configuration-as-code.
  • Familiarity with production change management and regulated enterprise environments.


Everforth Apex is a world-class IT services company that serves thousands of clients across the globe. When you join Everforth Apex, you become part of a team that values innovation, collaboration, and continuous learning. We offer quality career resources, training, certifications, development opportunities, and a comprehensive benefits package. Our commitment to excellence is reflected in many awards, including ClearlyRateds Best of Staffing in Talent Satisfaction in the United States and Great Place to Work in the United Kingdom and Mexico.

Everforth Apex uses a virtual recruiter as part of the application process. Click for more details. By applying for this job, you agree to receive calls, AI-generated calls, text messages, or emails from Everforth Apex and its affiliates, and contracted partners. Frequency varies for text messages. Message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You can reply STOP to cancel and HELP for help. You can access our privacy policy at

Everforth Apex Benefits Overview: Everforth Apex offers a range of supplemental benefits, including medical, dental, vision, life, disability, and other insurance plans that offer an optional layer of financial protection. We offer an ESPP (employee stock purchase program) and a 401K program which allows you to contribute typically within 30 days of starting, with a company match after 12 months of tenure. Everforth Apex also offers a HSA (Health Savings Account on the HDHP plan), a SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions, a corporate discount savings program and other discounts. In terms of professional development, Everforth Apex hosts an on-demand training program, provides access to certification prep and a library of technical and leadership courses/books/seminars once you have 6+ months of tenure, and certification discounts and other perks to associations that include CompTIA and IIBA. Everforth Apex has a dedicated customer service team for our Consultants that can address questions around benefits and other resources, as well as a certified Career Coach. You can access a full list of our benefits, programs, support teams and resources within our 'Welcome Packet' as well, which an Everforth Apex team member can provide.

Everforth Apex Systems is an equal opportunity employer. We do not discriminate or allow discrimination on the basis of race, color, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), age, sexual orientation, gender identity, national origin, ancestry, citizenship, genetic information, registered domestic partner status, marital status, disability, status as a crime victim, protected veteran status, political affiliation, union membership, or any other characteristic protected by law. Everforth Apex will consider qualified applicants with criminal histories in a manner consistent with the requirements of applicable law.

If you require an accommodation under the Americans with Disabilities Act to participate in an interview with a virtual recruiter or to use our website for a search or application, please contact our Benefits Department at or . Please note that this contact information is strictly to be used for medical ADA accommodations and that no other inquiries will be answered.

UnitedHealthcare creates and publishes the Transparency in Coverage Machine-Readable Files on behalf of Everforth Apex Systems.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: apexsan
  • Position Id: BHJOB2374_3036349
  • Posted 2 days ago

Company Info

About Apex Systems

Part of the Commercial Segment of ASGN Incorporated, Apex Systems is a leading global technology services company specializing in customizable industry-specific solutions that drive better results and transform businesses for over 25 years.

Delivering Value and Innovation

Apex Systems partners with global and Fortune 500 companies, leveraging cutting-edge technology through strategic alliances to drive businesses forward. These proven solutions and services combined with our unique deployment model that builds qualified, industry specific, fit-for-purpose teams fulfills our clients’ digital visions and achieves results. Our agility and obsession with providing value enables us to support an ever-evolving digital world.

About_Company_One
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Plano, Texas

Today

Easy Apply

Full-time

USD 70.00 - 73.68 per hour

Plano, Texas

Today

Easy Apply

Full-time

USD 68.68 - 73.68 per hour

Plano, Texas

Today

Easy Apply

Full-time

Plano, Texas

Today

Easy Apply

Full-time

Search all similar jobs