(SRE) with Strong Kubernetes || Atlanta GA (onsite) hybrid Locals only

Overview

On Site
Accepts corp to corp applications
Contract - W2
Contract - Independent
Contract - Long term

Skills

kubernetes
SRE

Job Details

Hi,

Hope you are doing good.

Please find the attached JD. Please share details below to process if interested with updated resume.

Title: Site Reliability Engineer (SRE) with Strong Kubernetes

Location: Atlanta GA (onsite) hybrid Locals only

SREs with experience in Kubernetes and cloud platforms. Strong communication skills are essential since this team will need to interact with multiple application teams and, at times, even VPs during critical issues. High expectations are set for these resources, so we need top-quality candidates with both technical skills and excellent communication abilities

Job Summary:

We are seeking an experienced Site Reliability Engineer (SRE) with advanced DevOps expertise to help build, scale, and maintain our infrastructure and services. You will play a critical role in ensuring high availability, performance, scalability, and security of our production systems, while enabling continuous deployment and rapid delivery of features to our customers.

Key Responsibilities:

  • Design, build, and maintain reliable, scalable, and secure cloud-based infrastructure (AWS, Azure, or Google Cloud Platform).
  • Develop and improve observability using monitoring, alerting, logging, and tracing tools (e.g., Prometheus, Grafana, ELK, Datadog, etc.).
  • Automate repetitive tasks and infrastructure using Infrastructure-as-Code (Terraform, CloudFormation, Pulumi).
  • Create and maintain CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, ArgoCD, etc.) to support fast and safe delivery.
  • Lead incident response, root cause analysis, and postmortems to ensure high uptime and rapid recovery.
  • Optimize system performance, reliability, and cost-effectiveness through proactive monitoring and tuning.
  • Collaborate with software engineering teams to define SLAs/SLOs and improve service reliability.
  • Implement and maintain security best practices across environments (e.g., secrets management, IAM, firewalls, etc.).
  • Maintain disaster recovery plans, backups, and high-availability strategies.

Qualifications:

Required:

  • 5+ years of experience as an SRE, DevOps Engineer, or similar role.
  • Proficiency in scripting and automation (Bash, Python, Go, etc.).
  • Strong experience with containerization and orchestration (Docker, Kubernetes, Helm).
  • Solid understanding of Linux systems administration and networking fundamentals.
  • Experience with cloud platforms (AWS, Azure, or Google Cloud Platform).
  • Experience with IaC tools like Terraform or CloudFormation.
  • Familiarity with GitOps and modern deployment practices.
  • Hands-on experience with observability tools (e.g., Prometheus, Grafana, Datadog).
  • Strong troubleshooting and incident response skills.

Preferred:

  • Experience in a high-traffic, microservices-based architecture.
  • Exposure to service meshes (Istio, Linkerd).
  • Certifications (AWS Certified DevOps Engineer, CKA, etc.)
  • Experience with security automation and compliance (e.g., SOC2, ISO27001).

Soft Skills:

  • Strong communication and collaboration abilities.
  • Ability to thrive in a fast-paced, agile environment.
  • Analytical mindset and proactive approach to problem-solving.
  • A passion for automation, performance, and system design.

Full Name

Current Location (City, State)

Contact Number

Email ID

Visa

LinkedIn:

DOB (MM/DD)

Total Years of Experience

Full Education Details:

Rate:

Skill

Rating

Experience

Kubernetes

SRE

cloud-based infrastructure (AWS, Azure, or Google Cloud Platform).

excellent communication***

Prometheus, Grafana, ELK, Datadog, etc

Infrastructure-as-Code (Terraform, CloudFormation, Pulumi

scripting and automation (Bash, Python, Go, etc.).

Kapil Kumar| Senior Talent Acquisition Specialist

Amaze Systems Inc

USA: 8951 Cypress Waters Blvd, Suite 160, Dallas, TX 75019

Canada: 55 York Street, Suite 401, Toronto, ON M5J 1R7

D: +1

E: |

USA | Canada | UK | India

Amaze Systems is an Equal Opportunity Employer (EOE), and does not discriminate based on age, gender, religion, disability, marital status, race and also adheres to laws relating to non-discrimination on the basis of national origin and citizenship status.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Amaze Systems Inc