Observability Engineer

Overview

On Site
Depends on Experience
Contract - W2
Contract - Independent
Contract - 06 Month(s)

Skills

ONEAGENT
APPMON
DQL
DPL
SMARTSCAPE
DYNATRACE

Job Details

Observability Engineer

We're seeking a highly skilled and experienced Observability Engineer to join our team. In this critical role, you'll be responsible for the end-to-end monitoring of our diverse infrastructure and applications, ensuring the stability and performance of our entire technology ecosystem. You'll lead efforts in designing, implementing, and maintaining robust monitoring solutions, while also providing essential training and guidance to our operations staff.


Key Responsibilities:

  • Design, configure, and maintain comprehensive monitoring and alerting solutions using platforms like Dynatrace, SolarWinds, and Splunk to ensure optimal performance and availability across all systems (50% of time).

  • Collaborate closely with technology owners, development teams, and architects to define and instrument observability solutions, gathering requirements for effective monitoring and alerting strategies (25% of time).

  • Provide expert technical guidance, training, and escalation support to the Enterprise Operations Center staff, fostering their growth and ensuring efficient handling of monitoring-related issues (10% of time).

  • Develop and document clear processes and procedures related to application monitoring software and alert response, empowering our teams to react swiftly and effectively to incidents (10% of time).

  • Proactively monitor and respond to incidents in our production environment, minimizing downtime and ensuring business continuity (5% of time).

  • Lead and mentor offshore operations teams from both a management and technical perspective.

  • Research and propose improvements to our observability solutions, creating detailed plans for engineering approval.

  • Actively monitor the Operations teams' queue to ensure timely and appropriate handling of all requests.

  • Work hand-in-hand with Engineering to understand new initiatives and prepare our operations teams to seamlessly support them.


Technical Expertise:

  • Deep understanding and hands-on experience with the Dynatrace platform is essential.

  • Proven experience with monitoring and alerting tools such as SolarWinds and Splunk.

  • Strong grasp of infrastructure monitoring (Server, Network, Database, Identity, Cloud Services).

  • Experience with application monitoring, including iOS, Android mobile apps, internal/external web applications, and custom services.

  • Familiarity with various scripting languages for automation and tool integration.


Key Competencies:

  • Leadership: Demonstrated ability to lead and mentor operations teams, both onshore and offshore.

  • Communication: Excellent verbal and written communication skills, capable of conveying complex technical information clearly.

  • Collaboration: Strong interpersonal skills with the ability to work effectively with cross-functional teams.

  • Problem-Solving: Proactive and analytical approach to identifying and resolving complex technical issues.

  • Accountability & Initiative: Takes ownership of tasks and drives solutions forward.

  • Continuous Learning: Committed to staying current with emerging technologies and best practices in observability.

  • Attention to Detail & Data Analysis: Meticulous in approach, with strong skills in gathering and analyzing data for informed decision-making.


Qualifications:

  • Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent practical experience.

  • Preferred Certifications: ITIL 4, Dynatrace Certified Professional, AWS Cloud Practitioner, Microsoft Certified Azure Fundamentals, Splunk Core Certified Power User.


Working Conditions:

  • Work is primarily performed in an office setting.

  • May require extended hours or weekend work for major incident/outage support and off-business hours activities.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Axiom Global Technologies, Inc.