Overview
Skills
Job Details
Required Skills & Qualifications:
Note - it`s a remote job
- Experience:
o L1 & L2 Support: Minimum of 10+ years of experience in a similar support role (Incident Management, Problem Management, Service Request Management, including troubleshooting complex issues in cloud environments).
o Soild hands-on experience with AWS cloud services (EC2, EKS, S3, RDS, Lambda, AWS Transfer Family, etc.), Java Spring boot microservices and cloud monitoring tools.
o Strong grounding in ITIL practices Incident, Problem, Change, and Request Management.
- Technical Expertise:
o Deep understanding of cloud-native application support in AWS.
o Expertise in application monitoring tools (e.g., AWS CloudWatch, Splunk, and Datadog).
o Strong troubleshooting and RCA skills, especially in production environments.
o Familiarity with common troubleshooting methodologies and tools for cloud infrastructure.
o Proficiency in scripting or automation using tools like Python, Bash, YAML, and Terraform is a plus.
o Understanding of Java: Ability to understand, troubleshoot, and diagnose issues in Java-based applications or platforms, especially in a cloud environment.
- Problem-Solving & Troubleshooting:
o Ability to effectively diagnose and resolve complex technical issues with cloud-based infrastructure and applications.
o Analytical mindset with a strong ability to apply logical problem-solving skills under pressure.
Soft Skills:
- Strong leadership and mentoring capabilities.
- Excellent verbal and written communication skills for working with both technical and non-technical stakeholders.
- Attention to detail and a proactive approach to identifying and solving problems.
- A strong sense of ownership and accountability for the stability and reliability of production systems.
Shift Requirement:
- This position requires the flexibility to work in rotating shifts to ensure 24/7 support
coverage, including nights, weekends, and holidays as required