osition’s Contributions to Work Group:
- Incident Management & Cross‑Functional Collaboration - Proven experience owning and managing incident tickets through their full lifecycle, working closely with cross‑functional teams to triage issues, coordinate resolution, communicate status, and drive root cause and follow up actions.
- Cloud Platforms & API Operations (AWS) - Demonstrated experience owning and supporting AWS hosted APIs and services in production, with strong knowledge of API design, AWS core services, security, monitoring, incident management, and driving reliability and operational readiness.
- Operational Readiness & Support Enablement - Proven experience understanding end‑to‑end technical and business flows, developing clear and actionable runbooks, and ensuring support teams are fully prepared through documentation, knowledge transfer, and operational readiness activities.
Typical task breakdown:
- Own incident tickets through the full lifecycle, from initial triage to resolution and closure.
- Collaborate with engineering, platform, product, and operations teams to diagnose issues and coordinate fixes.
- Communicate incident status, impact, and resolution progress to stakeholders.
- Lead or contribute to root cause analysis and ensure follow up actions are identified and tracked.
- Ensure platform reliability through monitoring, alerting, security, and operational best practices.
- Respond to and manage production incidents impacting AWS services and APIs.
- Drive reliability, stability, and operational readiness improvements across cloud platforms.
- Understand end‑to‑end technical and business flows to support production services effectively.
- Develop, maintain, and improve clear, actionable runbooks for operational support.
- Lead knowledge transfer sessions to ensure support teams are ready for production support.