Role: Application Production Support Analyst / Incident Lead
Location: Bellevue, WA / Atlanta, GA / Kansan City, KS / Frisco, TX
Duration: 12+ Months
Responsibilities
- Incident Management & Response
- Skills : devops, production support, incident triage, grafana, incident management, unix, splunk
- Lead and manage Major Incident (P1/P2) bridges, ensuring fast triage and restoration
- Act as the Single Point of Contact (SPOC) during major incidents
- Ensure incidents are resolved within SLA timelines with clear communication throughout the lifecycle
- Coordinate with engineering, infrastructure, DevOps, and database teams during incidents
Requirements:
Perform hands-on troubleshooting for microservices-based applications
Analyze logs using Splunk, identify patterns, and isolate root causes
Monitor application health via Grafana dashboards and s
Support and debug Unix-based batch jobs, failures, and recoveries
Query and analyze Cassandra DB for data validation and issue diagnosis
Troubleshoot services deployed on AWS and Kubernetes (K8s)
Post-Incident & Problem Management
- Lead Root Cause Analysis (RCA) and post-incident reviews
- Track and ensure completion of corrective and preventive actions
- Identify recurring issues and partner with teams to eliminate systemic problems
Operational Excellence
- Contribute to automation and monitoring improvements to reduce MTTR
- Help refine incident processes, playbooks, and escalation models
- Support continuous improvements in observability and resilience
- 6 10 years of experience in Application Production Support or Incident Management
- Strong understanding of microservices architecture and distributed systems
- Splunk (advanced log analysis and querying)
- Grafana and monitoring tools
- Cassandra DB (strong querying and functional knowledge)
- Unix/Linux (batch jobs, shell scripting, troubleshooting)
- AWS (EC2, CloudWatch, core services)
- Kubernetes (K8s) and containerized environments
- Strong experience handling Major Incidents and production bridges
- Ability to work in 24x7 rotational shifts, including weekends
Early response is appreciated....
In compliance with the salary transparency law, the expected pay range for this role is $40-50. Actual compensation depends on experience and interview evaluation.
Thanks
Piyush Verma Lead Technical Recruiter | Empower Professionals
|Official Phone: x 350
-------------------------------------------------------------------------------------------------------------
Fax: | 100 Franklin Square Drive Suite 104 | Somerset, NJ 08873
Certified NJ and NY Minority Business Enterprise (NMSDC)