Site Reliability Engineer (SRE) Architect

Overview

On Site
Depends on Experience
Full Time
Accepts corp to corp applications

Skills

SRE
Site Reliability Engineer
AWS
Architect
chicago

Job Details

**Key Responsibilities: **

- Design and implement scalable, reliable, and efficient architectures on AWS.

- Manage and optimize Kubernetes clusters using Amazon EKS for container orchestration.

- Utilize Dynatrace for application performance monitoring, ensuring proactive identification and resolution of issues.

- Lead a team of SREs, providing mentorship, guidance, and fostering a culture of continuous improvement.

- Collaborate with development teams to integrate reliability into the software development lifecycle.

- Establish and maintain Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure system performance.

- Conduct post-mortems on incidents to identify root causes and implement preventive measures.

- Develop automation tools to enhance operational efficiency and reduce manual interventions.

**Qualifications:**

- Bachelor's degree in Computer Science, Engineering, or related field.

- 15+ years of proven experience as an SRE Architect or in a similar role with a strong focus on AWS services.

  • In-depth knowledge of Kubernetes, specifically Amazon EKS.
  • Strong proficiency in AWS services (EC2, S3, RDS, Lambda, etc.).

- Proficiency in using Dynatrace for monitoring and performance optimization.

- Strong understanding of cloud architecture principles and best practices.

- Excellent leadership skills with the ability to manage cross-functional teams effectively.

- Strong problem-solving skills and the ability to work under pressure.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Sharpedge Solutions