Overview
On Site
$60,000 - $80,000
Full Time
Skills
AWS
DevOps
SRE
IaC & EC2
RDS
CI/CD
Docker & Kubernetes
AWS CloudWatch & Grafana
Python & Bash
Linux/Unix
Security
Root Cause Analysis
Incident Management & Response
Job Details
Technical Skills:
- AWS Expertise:Deep understanding and hands-on experience with core AWS services like EC2, S3, RDS, Lambda, CloudFormation, CloudWatch, and more.
- Infrastructure as Code (IaC):Proficiency in tools like Terraform or AWS CloudFormation for automating infrastructure provisioning and management.
- CI/CD Pipelines:Experience with tools like Jenkins, AWS CodePipeline, or GitLab CI/CD for automating the software release process.
- Containerization and Orchestration:Familiarity with Docker and Kubernetes for containerizing applications and managing them at scale.
- Monitoring and Observability:Experience with tools like AWS CloudWatch, Prometheus, Grafana, or similar for monitoring system performance and troubleshooting.
- Scripting and Automation:Strong scripting skills in Python, Bash, or similar languages for automating tasks and managing infrastructure.
- System Administration:Knowledge of Linux/Unix system administration, networking, storage, and virtualization.
- Security:Understanding of security best practices in cloud environments, including IAM, Security Groups, and encryption.
Soft Skills:
- Communication and Collaboration:Excellent verbal and written communication skills to collaborate effectively with development, operations, and other teams.
- Problem-Solving and Troubleshooting:Strong analytical and problem-solving skills to diagnose and resolve issues in production systems.
- Adaptability:Ability to adapt to new technologies, methodologies, and rapidly changing environments.
- Customer-centric mindset:Understanding the impact of reliability on the user experience.
- Organizational Skills:Ability to manage multiple tasks, prioritize effectively, and meet deadlines.
SRE-Specific Skills:
- Understanding of SLIs, SLOs, and Error Budgets:Ability to define and monitor key performance indicators for system reliability.
- Incident Management and Response:Experience with incident management processes and tools for quickly resolving production issues.
- Root Cause Analysis:Ability to conduct thorough root cause analysis of incidents and implement preventative measures.
In essence, an AWS DevOps Engineer with SRE focus needs a blend of technical expertise in AWS cloud services and DevOps practices, coupled with strong problem-solving, communication, and collaboration skills to ensure the reliability, scalability, and security of production systems.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.