Associate Observability Engineer - Information Technology (Kansas City)

Kansas City, MO, US • Posted 30+ days ago • Updated 9 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

  • Leadership
  • Accountability
  • Strategic Thinking
  • Reliability Engineering
  • DevOps
  • Team Leadership
  • Mentorship
  • Continuous Improvement
  • Application Development
  • Cyber Security
  • Service Level
  • Budget
  • MEAN Stack
  • Incident Management
  • IT Management
  • Regulatory Compliance
  • Reporting
  • Vulnerability Management
  • React.js
  • Microsoft Office
  • Critical Thinking
  • Conflict Resolution
  • Problem Solving
  • Attention To Detail
  • Facilitation
  • Team Building
  • Collaboration
  • Recruiting
  • Performance Management
  • Amazon Web Services
  • Google Cloud
  • Google Cloud Platform
  • Microsoft Azure
  • Virtualization
  • Storage
  • Communication
  • Management
  • Terraform
  • Ansible
  • Grafana
  • New Relic
  • Splunk
  • Kubernetes
  • Docker
  • Scripting
  • Software Development
  • Python
  • Bash
  • Cloud Computing
  • Computer Networking
  • SAP BASIS
  • Information Technology
  • Marketing Operations

Summary

Description

The Associate Observability Engineer will lead the strategy and execution of our enterprise-wide observability practice. This is a senior leadership role within the operations function of the Enterprise Technology & Security Services (ETSS) team, responsible for ensuring the reliability, performance, and health of our critical systems across on-premise infrastructure and multi-cloud environments (AWS, Google Cloud Platform, and Azure).

As a leader in this space, you will drive the principle that "Observability is Greater than Monitoring," focusing on building systems that are inherently transparent and accountable.

You will be instrumental in maturing our practice from reactive monitoring to proactive, data-driven observability, enabling teams to innovate with confidence and speed.

This role requires a blend of deep technical expertise, strategic thinking, and proven team leadership to implement and operate observability capabilities at enterprise scale.

Key Responsibilities:

+

Strategy & Vision: Define and implement the vision and strategy for observability across the enterprise, aligning with Site Reliability Engineering (SRE) and DevOps principles.

+

Platform Implementation: Lead the design, implementation, and management of a unified observability platform, leveraging tools such as Prometheus, Grafana, and the ELK Stack to process logs, metrics, traces, and events.

+

Team Leadership: Manage and mentor a team of engineers, fostering a culture of technical excellence, continuous improvement, and operational ownership.

+

Stakeholder Collaboration: Partner with application development, platform engineering, and cybersecurity teams to establish Service Level Objectives (SLOs) and error budgets for critical services.

+

Incident Management: Drive efforts to reduce Mean Time to Detect (MTTD) and Mean Time to Recover (MTTR) by enhancing alerting, automating incident response, and creating standardized runbooks.

+

Technical Leadership: Serve as the subject matter expert on observability for multi-cloud (AWS, Google Cloud Platform, Azure), on-premises infrastructure, and networking, providing guidance and establishing best practices.

+

Automation: Champion and implement automation for monitoring, alerting, and compliance reporting to reduce manual effort and ensure consistency.

+

Governance: Develop and enforce standards for logging, metrics, and tracing to ensure high-quality, actionable telemetry data across all systems.

Qualifications

Required Qualifications:
  • Bachelor's degree in technology, business, or related field and 14 years of relevant experience.
  • Applicable years of experience may be substituted for the degree requirement.
  • Expert knowledge in cloud technologies, automation scripts and infrastructure management.
  • Expert knowledge of security systems and policies and experience in vulnerability management and remediation.
  • Expert knowledge in Python and React software development.
  • Advanced computer skills (e.g., Microsoft Office Suite).
  • Excellent written and verbal communication skills.
  • Ability to lead projects and delegate work tasks to team members.
  • Ability to oversee the execution of work and resolve issues in a team environment.
  • Demonstrated critical thinking skills and ability to work methodically and analytically in a problem-solving environment.
  • Strong attention to detail, facilitation, team building and collaboration.

Strongly Preferred Qualifications:
  • Minimum of ten years of professional experience in an infrastructure engineering, SRE, or a similar role.
  • Proven experience managing technical teams, including hiring, performance management, and career development.
  • Expertise in designing, implementing, and operating observability and monitoring solutions at scale.
  • Deep technical knowledge of at least one major cloud platform (AWS, Google Cloud Platform, or Azure) and experience with multi-cloud environments.
  • Strong understanding of on-premise infrastructure, including virtualization, storage, and networking.
  • Excellent communication skills with the ability to present complex technical information to all levels of management and staff.

Preferred Qualifications:
  • Experience with Infrastructure as Code (IaC) tools like Terraform or Ansible.
  • Hands-on experience with modern observability tools (e.g., Prometheus, Grafana, Datadog, New Relic, Splunk).
  • Familiarity with containerization technologies such as Kubernetes and Docker.
  • Demonstrated ability in scripting or software development (e.g., Python, Go, Bash).
  • Experience in a large enterprise environment with complex, hybrid infrastructure.
  • Professional certifications in cloud platforms or networking.

This job posting will remain open a minimum of 72 hours and on an ongoing basis until filled.

EEO/Disabled/Veterans

Job Information Technology

Primary Location US-MO-Kansas City

Other Locations United States

Schedule: Full-time

Travel: No

Req ID: 260034

Job Hire Type Experienced #LI-MJ #COR N/A
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10394681
  • Position Id: 3c5052911ccfd385189fbd0b93301d82
  • Posted 30+ days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

California

Today

Full-time

USD 126,000.00 - 204,500.00 per year

No location provided

Today

Full-time

USD 85,100.00 - 169,800.00 per year

No location provided

Today

Full-time

USD 119,800.00 - 234,700.00 per year

California

Today

Full-time

USD 90,000.00 - 147,500.00 per year

Search all similar jobs