Overview
Skills
Job Details
**Key Responsibilities: **
- Design and implement scalable, reliable, and efficient architectures on AWS.
- Manage and optimize Kubernetes clusters using Amazon EKS for container orchestration.
- Utilize Dynatrace for application performance monitoring, ensuring proactive identification and resolution of issues.
- Lead a team of SREs, providing mentorship, guidance, and fostering a culture of continuous improvement.
- Collaborate with development teams to integrate reliability into the software development lifecycle.
- Establish and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure system performance.
- Conduct post-mortems on incidents to identify root causes and implement preventive measures.
- Develop automation tools to enhance operational efficiency and reduce manual interventions.
**Qualifications:**
- Bachelor's degree in Computer Science, Engineering, or related field.
- 15+ years of proven experience as an SRE Architect or in a similar role with a strong focus on AWS services.
- In-depth knowledge of Kubernetes, specifically Amazon EKS.
- Strong proficiency in AWS services (EC2, S3, RDS, Lambda, etc.).
- Proficiency in using Dynatrace for monitoring and performance optimization.
- Strong understanding of cloud architecture principles and best practices.
- Excellent leadership skills with the ability to manage cross-functional teams effectively.
- Strong problem-solving skills and the ability to work under pressure.