SRE Manager

Overview

On Site
Depends on Experience
Accepts corp to corp applications
Contract - W2
Contract - Independent
Able to Provide Sponsorship

Skills

Electronic Commerce
Retail
ITIL
Shopify
e-Commerce

Job Details

Job Title: SRE Manager

Experience Required: 10+ Years

Location: San Ramon, CA. (Day 1 onsite)

Note: Manage L1, L2, and L3 support for eCommerce platforms (e.g., Shopify, Blue Yonder, or similar).

Job Description

We are seeking an experienced Site Reliability Engineering (SRE) Manager with a strong background in e-commerce/retail to lead reliability initiatives, manage production application support, and collaborate with onshore and offshore teams. The candidate will be responsible for monitoring, automating, and improving the reliability, performance, and availability of critical applications.

Key Responsibilities

  • Oversee production application support and coordinate with 24x7 offshore teams.
  • Define and implement monitoring requirements, service levels, and reliability goals.
  • Manage L1, L2, and L3 support for eCommerce platforms (e.g., Shopify, Blue Yonder, or similar).
  • Utilize monitoring tools (New Relic, PagerDuty, AppDynamics, Dynatrace, Splunk, Datadog, CloudWatch, ELK, Prometheus) for alerting, logging, dashboarding, and reporting.
  • Apply SRE principles (logs, metrics, SLIs, SLAs, ITIL processes, incident/change management, CAB, deployments, risk mitigation).
  • Lead critical incident (P1) calls, coordinate RCA activities, and communicate effectively with stakeholders.
  • Build and maintain SOPs, runbooks, and ITSM workflows (JIRA/ServiceNow/BMC Remedy).
  • Generate weekly/monthly status reports (WSR/MSR) for leadership review.

Required Skills & Experience

  • Proven experience as an SRE Lead/Manager in e-commerce/retail domain.
  • Strong production support expertise across applications/services.
  • Hands-on experience with monitoring/alerting tools (preferably New Relic & PagerDuty).
  • Familiarity with ITSM platforms and tools like Postman.
  • Strong knowledge of SRE principles, ITIL framework, and incident/change management.
  • Excellent communication and stakeholder management skills.
  • Ability to collaborate across global, cross-functional teams and time zones.

Soft Skills

  • Clear communicator (written & verbal) with ability to explain technical concepts.
  • Collaborative, innovative, and proactive mindset.
  • Strong organizational skills with the ability to prioritize in a fast-paced environment.
  • Can-do attitude with a focus on ownership and accountability.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Learn Beyond Consulting LLC