Technical Operations Center Engineer DYNATRACE

Overview

Remote
Depends on Experience
Contract - W2
Contract - 12 Month(s)
No Travel Required

Skills

Splunk
SCOM
SolarWinds or other performance monitoring tools
PowerShell
Ruby
Perl
Dynatrace
incident Management
Event Monitoring/Event Management

Job Details

Vega Consulting is hiring! TOC Engineer with a strong Dynatrace experience. Candidates must eligible for Contract to hire job opportunities. Sponsorship is not available. Prefer candidates that are located on the East Coast. This role will require shift work.

This role supports the First-to-Know capability of the Technical Operations Center (TOC) and serves as the centralized focal point for observability and event management at CareFirst. Event Monitoring Engineers monitor the performance and capacity of enterprise-wide systems, applications and critical business processes using a variety of tools to identify hardware, software, and environmental anomalies. The successful candidate will proactively look for ways to improve processes, ensure events are meaningful and actionable, look for inefficiencies, and document new processes as they evolve. A great benefit to this team would be someone proficient in scripting and coding.

Responsibilities include:

  • Develop dashboards for critical business processes leveraging Dynatrace dashboards and/or Dynatrace Business Events on SaaS
  • Assist with the planning and execution of the migration from Dynatrace Managed to Dynatrace SaaS/Azure
  • Employ Dynatrace expertise developing performance monitoring tool alerts, dashboards, and data trend analysis in a monitoring tool for infrastructure and application performance monitoring.
  • Assist in the build-out the Technical Operations Center event management and systems monitoring capability leveraging Dynatrace infrastructure and application performance monitoring.
  • Develop and maintain Dynatrace end-to-end performance monitoring for applications, workloads and environments
  • Enable event management and analytics to address the triage and resolution of events
  • Develop and operate the infrastructure, processes, and automation that enables observability of critical assets across the organization in alignment with business priorities.
  • Analyze performance data and act on negative performance trends to identify root cause
  • Gather application data and diagrams, leverage expertise in recommending baseline monitoring thresholds, recommend performance monitoring KPIs and SLAs, and provide monitoring tool infrastructure recommendations
  • Integrate multiple system alerts providing one single pane of glass monitoring solution
  • Use your customer service expertise to help manage tasks and organize large amounts of data to use for instrumentation into an enterprise monitoring solution

Required Qualifications: 5+ years of experience in Dynatrace, synthetic URL monitoring, installing agents, forwarders, APIs, performance monitoring tool alerts, dashboards and data trend analysis

  • 3+ years experience leveraging Dynatrace SaaS, DQL, and Logs on Grail
    Experienced at developing Dynatrace dashboards for business processes, preferably leveraging Dynatrace Business Events
  • Experienced in migrating from Dynatrace Managed to SaaS
    Experience with recommending baseline monitoring thresholds, recommending performance monitoring KPIs and SLAs
  • Experience with gathering and organizing large amounts of data to use for instrumentation into an Enterprise monitoring solution
  • Deep knowledge of relevant enterprise log analysis framework
  • Experience with cloud monitoring in AWS or Azure
  • Familiar with Jira, ServiceNow, or similar tools
  • Excellent verbal and written communication
  • Looks for opportunities to improve and automate
  • Analytical and critical thinker\

Responsibilities include:

  • Provide eyes-on-glass monitoring using Dynatrace and other monitoring tools
  • Support a 24x7 system monitoring service to proactively identify and assess problems
  • Provide oversight, coordination, and visibility for critical business processes
  • Perform system health checks, some manual some automated
  • Identify, investigate, verify, report, communicate, and escalate critical events
  • Review device logs documentation and analysis where applicable
  • Develop runbooks and manage documentation for repeatable processes (Lifecycle Management)
  • Will follow basic triage steps, monitor production systems, and assure their high availability
  • Facilitate and coordinate the necessary IT response to system problems
  • Continuously analyze events and eliminate noise, and non-actionable event trends (Continual Service Improvement)
  • Provide event management support to service owners and IT managers
  • Author reports, trends and anomalies for KPI (Key Performance Indicators) for Event Management and Monitoring
  • Communicate to stakeholders; support and facilitate open communication between all stakeholders.

Required Qualifications: Associate of Arts/Associate of Science and 3+years of experience or equivalent combination such as bachelor s degree and 2+ years experience or no degree and at least 3 years in a NOC/TOC, Command Center roles.

  • 3+ years IT experience and understanding of performance monitoring tools
  • 3+ years Dynatrace monitoring experience
    2+ years operating in a command center in an Incident Management, or Event Monitoring/Event Management role
  • Ability to assess monitoring events and respond or escalate accordingly
  • Knowledge and experience of system and network infrastructures such as LAN and WAN network technologies,
  • server virtualization, enterprise storage area network (SAN) and backup, and database technologies
  • Strong analytical skills and able to collate and interpret data from various sources.
  • Strong communicator, both verbal and written, with a natural aptitude for collaboration
  • 3+ years experience working with Splunk, SCOM, SolarWinds or other performance monitoring tools
  • Process engineering or process management experience
  • Experience working in a ServiceNow environment
  • Experience with Jira, and project management frameworks like Agile Scrum
  • Experience with scripting languages like PowerShell, Ruby, Perl, etc.
  • Experience reporting against and managing Service Level Agreements (SLAs)

Desired Qualifications:

  • 3+ years IT experience and understanding of performance monitoring tools
  • 3+ years Dynatrace monitoring experience
  • 2+ years operating in a command center in an Incident Management, or Event Monitoring/Event Management role
  • Experience with scripting languages like PowerShell, Ruby, Perl, etc
  • 3+ years experience working with Splunk, SCOM, SolarWinds or other performance monitoring tools

If you have the required Dynatrace/TOC background, pls select "Apply Now" and a Vega Consulting Staffing Specialist will reach out to you.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.