Overview
Remote
Up to $110,000
Full Time
Skills
Linux
GCP
Google Cloud Platform
Cloud
Python
Bash
Shell
AWS
Docker
Kubernetes
K8
Job Details
Job Title: Site Reliability Engineer
Location: Remote Work Duration: Full Time
Job Description:
- We are seeking a skilled Site Reliability Engineer (SRE) with strong expertise in Linux systems and Cloud platforms (Google Cloud Platform and AWS).
- In this role, you will ensure high availability, scalability, and performance of cloud-based infrastructure and services.
Key Responsibilities:
- Manage and monitor Linux-based systems in cloud environments
- Design and implement infrastructure solutions using Google Cloud Platform and AWS
- Automate deployment, monitoring, and incident response
- Collaborate with development and operations teams to improve system reliability
- Troubleshoot and resolve issues in production environments
Requirements:
- 3+ years of experience as an SRE
- Strong Linux administration skills
- Hands-on experience with both Google Cloud Platform and AWS
- Proficiency in scripting (Python, Bash, etc.) and infrastructure-as-code (Terraform, CloudFormation)
- Experience with monitoring tools like Prometheus, Grafana, or similar
Preferred:
- Experience with CI/CD pipelines
- Knowledge of containerization (Docker, Kubernetes)
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.