Remote
•
Today
Incident Management & Resolution: Act as the primary point of contact for high-priority production incidents. Drive timely resolution, perform root cause analysis (RCA), and implement preventive measures to minimize future occurrences.Application Monitoring & Health: Proactively monitor the health, performance, and capacity of production applications using advanced monitoring tools like Splunk and New Relic. Develop and maintain dashboards, alerts, and runbooks.Change Management: Evaluate, appro
Easy Apply
Full-time
Depends on Experience















