Incident Management (IM) Operations Lead

Irvine, CA, US • Posted 16 hours ago • Updated 16 hours ago
Contract Corp To Corp
Contract Independent
54 Weeks
On-site
Depends on Experience
Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

  • Incident Management
  • Telematics
  • Connected Car
  • Operations Management
  • ITIL

Summary

Job Title:       Incident Management (IM) Operations Lead

Location:       ONSITE – Irvine, CA

Duration:      Contract

 

HIRING – Incident Management (IM) Operations Lead with around 10+ years of relevant experience.

 

ROLE SUMMARY

  • We are seeking an Incident Management (IM) Operations Lead for a 24/7 operations program supporting a key player in the automotive industry's connected car space.
  • This role will lead an onsite–offshore IM function, act as the single point of accountability for all incidents and active outages which occur during onsite business hours and ensure adherence to contractual SLAs.
  • The IM Operations Lead will orchestrate end-to-end incident handling—from anomaly detection to bridge initiation, stakeholder communications, resolution, and post-incident governance—with a strong background in Telematics and Connected Vehicle systems (including Remote Services).

 

KEY RESPONSIBILITIES:

  • Lead and mentor the Incident Management team in an onsite–offshore operating model; establish clear handoffs and coverage to support 24/7 operations.
  • Own and drive the lifecycle of every major incident/outage (not a shift-based support engineer role), ensuring rapid triage, containment, and resolution.
  • Serve as the primary point of contact (POC) for all IM matters—including executive escalations—maintaining authoritative situational awareness across platforms.
  • Accountable for all IM SLAs (MTTD, MTTA, MTTR, communication SLAs); ensure measurement, reporting, and continuous improvement.
  • Should be proficient in handling the incidents ranging from high priority ones (P1, P2) to the low priority incidents (P3, P4).
  • Initiate and run incident bridges/war rooms; coordinate cross-functional responders (application, infrastructure, network, OEM partners, and third parties).
  • Oversee proactive detection through data monitoring tools (Dynatrace, Datadog, Grafana, MaxGauge) and ensure alert quality, runbooks, and signal-to-noise optimization.
  • Establish and enforce SOPs for incident declaration, severity classification, response roles and decision logs.
  • Drive disciplined stakeholder communications: timely updates to product, operations, OEM/customer contacts, leadership, and impacted regions; maintain comms cadence and channels.
  • Ensure post-incident governance: facilitate RCA, document contributing causes, corrective and preventive actions (CAPA), and circulate the RCA across teams for sign-off.
  • Partner with Problem Management to track and eradicate recurring issues; maintain problem backlog, KEDB (Known Error Database), and follow-through on remediation.
  • Define and maintain IM dashboards, KPIs, and executive reports; present weekly/monthly service reviews with trend analysis and action plans.
  • Collaborate with Product/Engineering to influence reliability roadmaps (resiliency patterns, observability, capacity, release safeguards).
  • Ensure compliance with information security, data privacy, and OEM contractual obligations during incident handling and communications.
  • Continuously refine IM playbooks, runbooks, and training; conduct simulations/game days and readiness audits across onsite–offshore teams.

 

Domain Expertise: Telematics & Connected Car

  • Hands-on knowledge of Telematics Control Unit (TCU), eSIM/OTA provisioning, backend telematics platforms, and data flows between vehicle, cloud, and mobile apps.
  • Familiarity with Remote Services (e.g., remote lock/unlock, start/stop, charge control, climate pre-conditioning), geo-services, and safety/assist features.
  • Understanding of service dependencies: identity/auth, messaging, device management, CAN bus signals, firmware/OTA update orchestration, and regional compliance.
  • Experience coordinating incidents across OEM partners, Tier-1 suppliers, cloud providers, and customer support operations.

 

Required Qualifications

  • 8 –12+ years in Operations/Service Management with 4+ years leading Incident Management in large-scale, 24/7 environments.
  • Demonstrated experience running bridges for P1/P0 incidents; proven incident commander skills and decision-making under pressure.
  • Strong background in Telematics/Connected Car domain and vehicle remote services concepts.
  • Proficiency with observability and monitoring tools: Dynatrace, Datadog, Grafana, MaxGauge (dashboards, alerting, traces, logs).
  • Expertise in IM processes: detection → triage → severity assignment → bridge initiation → stakeholder updates → resolution → post-incident RCA.
  • Working knowledge of ITIL practices (Incident, Problem, Change, Service Level Management).
  • Should have good hands on experience in using tools like JIRA, Confluence, Jenkins, XMatters.
  • Excellent communication (written/verbal), executive presence, and stakeholder management across onsite–offshore teams.
  • Ability to analyze telemetry and time-series data to drive root cause hypotheses and corrective actions.
  • Experience defining SLAs/SLOs and building KPI dashboards (MTTD/MTTA/MTTR, incident volume, recurrence, comms SLA, customer impact).

 

Tools & Technologies:

  • Dynatrace, Datadog, Grafana, MaxGauge (APM, logs, metrics, dashboards).
  • Incident & Comms: ticketing (Jira/ServiceNow), chat/bridge tools (Teams/Zoom), status pages.
  • Data: time-series analysis, log aggregation, tracing, alerting policies and noise reduction.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91173647
  • Position Id: JB-PN2N6BBL
  • Posted 16 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Costa Mesa, California

14d ago

Easy Apply

Third Party, Contract

Depends on Experience

Torrance, California

Today

Easy Apply

Contract

USD 47.50 - 55.00 per hour

Hybrid in Chino, California

Today

Easy Apply

Contract

44 - 47

Aliso Viejo, California

Today

Full-time

USD 184,600.00 - 277,000.00 per year

Search all similar jobs