Looking for Sr. Site Reliability Engineer

Overview

On Site
Depends on Experience
Contract - W2
No Travel Required

Skills

Reliability Engineer
aWS

Job Details

Looking for Sr. Site Reliability Engineer - Atlanta, GA

Target Years of Exp: 5 years

Top 5 Must Haves:

  • Extensive/Strong AWS experience---experience in designing, deploying managing scalable/reliable cloud-based infrastructure;
  • Software Engineering background/experience---Python, Javascript, Bash, etc.;
  • In-depth knowledge of infrastructure as code (IaC) tools, like Terraform, GHA, CloudFormation, Ansible;
  • Strong Automation and Scripting Skills,
  • Solid Understanding of CI/CD Pipelines (Jenkins)

Job Description - Key Responsibilities:

  • Lead and mentor a team of SREs, fostering a culture of collaboration, continuous learning, and operational excellence.
  • Drive the adoption of SRE best practices and ensure adherence to reliability and performance standards.
  • Design and implement highly available, scalable, and fault-tolerant systems using AWS. Collaborate with software engineering teams and other SREs to influence design and architecture decisions to improve system reliability and performance.
  • Develop and maintain automation scripts and tools to streamline operations, deployments, and monitoring processes.
  • Utilize Infrastructure as Code (IaC) tools such as Terraform, GitHub Actions, and CloudFormation to manage infrastructure. Implement and maintain robust monitoring, alerting, and logging systems using tools like Splunk, Grafana, or New Relic.
  • Lead incident response efforts, conduct root cause analysis, and implement measures to prevent recurrence.
  • Oversee the design and maintenance of CI/CD pipelines using tools like Jenkins, GitLab CI, or CircleCI.
  • Ensure seamless and efficient code deployment processes, reducing time to market and increasing system reliability. mization:
  • Conduct performance tuning and capacity planning to ensure systems can handle growing workloads. Troubleshooting experience. Identify and resolve performance bottlenecks in infrastructure and applications.

About Xoriant Corporation