Overview
Skills
Job Details
Application Support
Java Backend Support
Spring Boot Microservices
- Application support expertise
- Strong hands-on experience in Java and Spring Boot backend development
- Experience with Amazon EKS, Kubernetes, and containerized environments
- Proven experience supporting cloud-native microservices in production
- Familiarity with monitoring, logging, and alerting tools (App D, Grafana, CloudWatch, ELK, Splunk)
- Exposure to monitoring tools like AppDynamic
- Knowledge of CI/CD pipelines and DevOps practices. Understanding of content management
- systems and integration pipelines
- Incident management and ITIL practices
- Troubleshooting and analytical skills in complex distributed systems
- Scripting skills in Python or Bash
- Ownership of incident triage, RCA, and resolution
- Strong communication and cross-team coordination skills
- Experience with maintaining support documentation and runbooks
- Participate in on-call rotations and handle critical incident responses
- Experience mentoring support engineers
Roles & Responsibilities
Lead L2/L3 support for Java/Spring Boot microservices in a cloud-native (EKS/Kubernetes) environment Own incident triage, RCA, and resolution across distributed systems.
Oversee monitoring (AppDynamics, Grafana, ELK, CloudWatch) and ensure proactive alerting.
Drive automation via scripting (Python/Bash) and streamline support workflows.
Collaborate with DevOps on CI/CD and release readiness.
Maintain support runbooks, documentation, and ensure ITIL-aligned operations.
Mentor support engineers and ensure round-the-clock on-call coverage.
Ensure application stability, performance, and effective stakeholder communication.
Team Leadership: Guide and mentor L2/L3 support engineers; build a high-performing and collaborative team.
Incident & Crisis Management: Lead critical incident response with composure; ensure timely resolution and stakeholder communication.
Stakeholder Management: Coordinate effectively with business, development, QA, and DevOps teams to align priorities and expectations.
Resource Planning: Manage team shifts, on-call rotations, and workload distribution to ensure 24x7 coverage.
Process Governance: Enforce ITIL practices (incident, problem, and change management) and drive compliance.
Decision-Making: Make quick, informed decisions under pressure, especially during outages or high-severity issues.
Communication Skills: Provide clear, concise, and timely updates to technical and non-technical stakeholders.
Continuous Improvement: Identify support process gaps and drive automation, tooling, and documentation enhancements.
Performance Management: Track KPIs (MTTR, SLA adherence, incident volume) and drive team accountability.
Risk & Impact Assessment: Evaluate change and deployment risks to production systems and recommend mitigation plans.