Overview
Skills
Job Details
Job Title: AWS SRE (Site Reliability Engineer)
Location: Houston, TX
Client: JPMorgan Chase (JPMC)
Employment Type: W2 Contract
Experience: 6 9+ years
Job Summary:
We are seeking a skilled AWS Site Reliability Engineer (SRE) to join our team supporting JPMorgan Chase. The ideal candidate will have deep expertise in AWS, infrastructure automation, monitoring, and production support to ensure the reliability, scalability, and performance of critical systems.
Key Responsibilities:
Implement and manage scalable, highly available cloud infrastructure on AWS.
Design and develop monitoring, alerting, and reliability frameworks.
Maintain system uptime, manage incident response, and reduce mean time to recovery (MTTR).
Collaborate with DevOps, development, and security teams to implement best practices.
Automate infrastructure using Terraform, CloudFormation, or similar IaC tools.
Define and enforce Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
Conduct root cause analysis and post-incident reviews.
Optimize system performance and costs across AWS environments.
Required Skills:
6+ years of experience in Cloud/DevOps/SRE roles with a strong AWS background.
Proficiency with AWS services such as EC2, Lambda, RDS, VPC, S3, CloudWatch, etc.
Hands-on with Terraform, CloudFormation, or other infrastructure-as-code tools.
Strong scripting skills in Python, Shell, or similar.
Experience with CI/CD pipelines, Git, Jenkins, or similar tools.
Expertise in monitoring/logging tools like Prometheus, Grafana, ELK, CloudWatch.
Solid understanding of containerization (Docker, EKS, Kubernetes is a plus).
Strong incident management and troubleshooting skills.
Nice to Have:
Experience working in financial or banking domain environments.
Familiarity with ITIL processes and ticketing systems like ServiceNow.
Experience with SRE principles in enterprise environments.