Senior Site Reliability Engineer

Overview

Remote
On Site
Depends on Experience
Contract - W2

Skills

Amazon Web Services
Ansible
Cloud Computing
DevOps
Docker
Google Cloud Platform
Grafana
Kubernetes
Microservices
Python
Scripting
Terraform

Job Details

Position: Senior Site Reliability Engineer

Contract:W2 Only

Key Responsibilities:
  • Ensure system reliability and performance through monitoring and alerting.
  • Automate tasks and improve system performance using scripting and tools.
  • Manage incidents, conduct root cause analysis, and implement preventive solutions.
  • Collaborate with engineers to design scalable systems and provide mentorship.
  • Optimize performance and conduct capacity planning.
  • Document processes and ensure adherence to security standards.

Qualifications:
  • Bachelor s degree in Computer Science or related field.
  • 10+ years of SRE or DevOps experience.
  • Proficiency with cloud platforms (AWS, Azure, Google Cloud Platform) and automation tools (Ansible, Terraform).
  • Strong scripting skills (Python, Bash) and experience with Docker, Kubernetes.
  • Excellent problem-solving and communication skills.

Preferred:
  • Experience with monitoring and logging tools (Prometheus, Grafana, ELK Stack).
  • Familiarity with microservices and serverless computing.
  • Knowledge of security best practices and compliance standards.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.