SRE Manager

Remote • Posted 4 hours ago • Updated 3 hours ago
Contract W2
Remote
$55 - $60/hr
Company Branding Image
Fitment

Dice Job Match Score™

🧠 Analyzing your skills...

Job Details

Skills

  • SRE
  • DevOps
  • AWS
  • AZURE
  • GCP
  • LINUX
  • IAC
  • CI/CD

Summary

Position: SRE Manager

Location: 100% Remote

Duration:  3 Months + Contract to Hire( CTH )

Client: RB Global( We have SOW with this client )

Work Authorization:

 

 

SRE Manager to lead a team of reliability engineers responsible for the uptime, performance, and efficiency of the customer-facing platforms. You’ll set SLOs and error budgets, build great incident and change practices, and coach engineers to automate everything that can be automated.

Responsibilities

  • Lead & grow the team:Hire, coach, and develop SREs; set goals and establish a blameless, data-driven culture.
  • Own reliability strategy:Define and socialize SLOs/SLIs and error budgets with product/engineering; enforce guardrails and tradeoffs.
  • Operate the platform:Oversee availability, latency, capacity planning, and change management across [AWS/Azure/Google Cloud Platform] and Kubernetes.
  • Incident management:Run on-call and escalation programs (SEV1/2), coordinate response, and ensure high-quality, blameless postmortems with clear follow-ups.
  • Observability:Standardize logs/metrics/traces and dashboards; reduce alert noise; drive adoption of APM/monitoring tools ([Datadog/Dynatrace/PrometheGrafana/New Relic]).
  • Automation & resilience:Champion infra-as-code, CI/CD, chaos/game days, load testing, and toil reduction.
  • Security & compliance partnership:Work with Security, Compliance, and Finance on least-privilege, secrets management, cost efficiency, and audit readiness.
  • Stakeholder alignment:Partner with Product, App Eng, Data, and Support to prioritize reliability work and communicate risk/status to leadership.

Requirements

  • 8+ years in software/platform/reliability engineering, including 2–4 years leadingSRE/DevOps/Platform teams.
  • Proven experience operating large-scale services on [AWS/Azure/Google Cloud Platform]with Kubernetes and containers.
  • Strong fundamentals in Linux, networking, and distributed systems.
  • Hands-on with IaC(Terraform/CloudFormation/Bicep), CI/CD (GitHub Actions/CircleCI/Azure DevOps), and one scripting language (Python/Go/Bash).
  • Deep understanding of observability(metrics, logs, traces) and alerting best practices.
  • Track record running on-callprograms and driving measurable reliability improvements.
  • Excellent communication and stakeholder management; comfortable presenting trade-offs and data to executives.

 

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90967474
  • Position Id: NEEJP00019296
  • Posted 4 hours ago

Company Info

About Whiz Global LLC

Whiz Global LLC is currently accepting resumes for a variety of positions. Please review the database of positions that we are seeking to fill and contact us for additional information about any specific opportunity.


Careers
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

It looks like there aren't any Similar Jobs for this job yet.

Search all similar jobs