Site Reliability Engineer (SRE) Google Cloud Platform/Kubernetes/Dynatrace

  • Chicago, IL
  • Posted 10 hours ago | Updated 10 hours ago

Overview

On Site
Depends on Experience
Contract - W2
Contract - 12 Month(s)
100% Travel

Skills

Site Reliability Engineer
GCP
Kubernetes
Dynatrace
log monitoring tools
Splunk
RCA
JIRA tickets

Job Details

Job Description:

We are seeking an experienced Site Reliability Engineer (SRE) to join our team in a production support capacity. The ideal candidate should have hands-on experience with Google Cloud Platform, Kubernetes, Dynatrace, and familiarity with log monitoring tools like Splunk or Sumo Logic. The role demands strong incident response, dashboard monitoring, RCA preparation, and client coordination.


Key Responsibilities:

  1. Manage day-to-day production support activities including incident resolution, root cause analysis (RCA), and dashboard monitoring.

  2. Deep understanding of tools like Google Cloud Platform, Kubernetes, Dynatrace, and ability to create alerts/dashboards as needed.

  3. Represent the SRE team in client calls and update stakeholders on ongoing issues and resolutions.

  4. Review and update JIRA tickets with latest findings, RCA, and ensure adherence to SOPs.

  5. Assist in debugging production issues, coordinating with external vendors, and supporting new feature rollouts.

  6. Work closely with on-site teams to provide log insights (via Splunk/Sumologic) and measure front-end performance metrics.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.