Cloud Site Reliability Engineer

Overview

On Site
Full Time

Skills

Scalability
SLA
Collaboration
Product Support
Customer Experience
Root Cause Analysis
Reliability Engineering
Cloud Computing
Amazon Web Services
Terraform
Software Engineering
Python
JavaScript
DevOps
Continuous Integration
Continuous Delivery
Computer Science

Job Details

Description

Responsibilities
  • Implement tooling to monitor AWS EKS-based systems focusing on performance, reliability, and scalability.
  • Ensure that architecture and deployment models are sufficient to support SLA commitments and are well prepared for future problems of scale.
  • Leverage cloud technology and platform capabilities to provide operationally sustainable solutions that are robust and cost effective.
  • Apply software engineering best practices to comprehensively address and resolve problems.
  • Collaborate with product support teams to drive efficiency and enhance customer experience through self-service tools and automation.
  • Ensure timely response to incidents and support requests, collaborating effectively on solutions.
  • Conduct root cause analysis and implement preventative measures to minimize toil and impact on customers.
  • Lead and participate in incident retrospectives to enhance future response efforts.
  • Participate in on-call rotations, providing critical support as needed.
Qualifications
  • A successful technical career within reputable technology firms, particularly with large-scale cloud applications.
  • Expertise in Site Reliability Engineering concepts and practices, including the use of observability platforms and monitoring tools.
  • Experience deploying and supporting containerized applications on cloud platforms, preferably EKS on AWS.
  • Proficiency in infrastructure as code technologies, such as Terraform.
  • Strong software engineering skills in languages like Python, JavaScript, or Go.
  • Familiarity with DevOps and CI/CD methodologies.
  • Bachelor's degree in Computer Science or related field.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.