Site Reliability Engineer

  • Chicago, IL
  • Posted 2 days ago | Updated 2 days ago

Overview

On Site
$45 - $50
Contract - W2
Contract - 12 Month(s)

Skills

SRE
PYTHON
Cloud
granfama

Job Details

Title: Site Reliability Engineer

Work Location: - Chicago, IL (Onsite) Duration: Long term
JOB DESCRIPTION:
Senior-level SRE responsible for ensuring reliability, performance, and scalability of Google Cloud Platform-based
platforms supporting a global cloud environment. Focus on automation, observability, and
incident response for mission-critical applications.

Minimum: 7+ years in Site Reliability Engineering or Platform Engineering
Preferred: 7+ years with enterprise-scale cloud environments
Industry: Experience in high-availability, customer-facing systems preferred
Advanced monitoring and observability (Prometheus, Grafana, New Relic, Datadog)
Incident management and post-mortem analysis
SLI/SLO definition and measurement
Chaos engineering and reliability testing
Performance tuning and capacity planning
Automation and scripting (Python, Go, Bash)
Infrastructure as Code (Terraform, Ansible)
Container orchestration (Kubernetes, Docker)
CI/CD pipeline design and implementation
Microservices architecture and distributed systems
Load balancing and traffic management
Database performance optimization
Compute: GCE, GKE, Cloud Run, App Engine
Monitoring: Cloud Operations Suite (Stackdriver), Cloud Logging, Cloud Monitoring
Networking: VPC, Cloud Load Balancing, Cloud CDN
Storage: Cloud Storage, Persistent Disks, Cloud SQL
Security: IAM, VPC Security, Cloud KMS
Cloud Trace and Cloud Profiler for APM
Cloud Deployment Manager and Cloud Build
Anthos for hybrid/multi-cloud management
Error Reporting and Cloud Debugger
BigQuery for log analysis and metrics?

OPTIONAL SKILLS & SKILL PROFICIENCY

Google Cloud Professional Cloud Architect
Google Cloud Professional DevOps Engineer
Certified Kubernetes Administrator (CKA)
Communication: Excellent written and verbal communication for incident response
Problem Solving: Strong analytical and troubleshooting skills
Collaboration: Experience working with development and operations teams
Documentation: Technical writing for runbooks and procedures
On-call: Comfortable with 24/7 on-call responsibilities
Agile: Experience with Agile/Scrum methodologies

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About VensIT Corp