Career Opportunity:
Job Title: SRE Program Manager
About CodeForce 360
Making a career choice is amongst the most critical choices one can make, and it’s important for the choice to be calculated with factors such as a company’s run of success since its inception and more. But, when you come across a company that has reputation proven with nothing but an illustrious run of success since the day it began, you don’t need to think of anything else. That’s precisely what some of our employees and prospective employees think when they came across CodeForce 360.
Position Overview
SRE Program Manager
Requirements:
Experience: Minimum 7–10 years in program management, with at least 3–5 years leading large-scale technology initiatives in SRE, DevOps, or Cloud Infrastructure environments.
Technical Literacy: Strong understanding of core SRE principles (SLIs/SLOs, error budgets, toil reduction) and modern infrastructure stacks like Kubernetes, Terraform, and major cloud providers (AWS/Google Cloud Platform/Azure).
Leadership: Proven ability to influence without authority in a matrixed organization, building trust with both senior executives and individual engineers.
Methodology: Expertise in Agile/ Project management or Lean program management frameworks, adapted for infrastructure and operational work rather than just feature development.
Key responsibilities:
- Cross-Functional Strategy: Lead the outreach and alignment between SRE and different business units to define Service Level Objectives (SLOs) and error budgets that balance innovation with stability.
- Integrated Program Planning: Own the end-to-end delivery plan for SRE initiatives—such as platform hardening, disaster recovery testing, and observability rollouts—sequencing workstreams across infrastructure and application teams.
- Dependency & Risk Management: Proactively identify and resolve technical and process-based dependencies between SRE, Product Engineering, Security, and external vendors.
- Governance & Reporting: Establish a lightweight but rigorous operating cadence, including weekly execution reviews and executive status reporting. Use data-driven dashboards to provide visibility into operational readiness and risk.
- Incident Management: Coordinate cross-team incident response for major outages and lead blameless incident review cultures to ensure learnings are turned into actionable roadmap items.
- Intake and Prioritization Ownership: Establish and manage a structured intake process across SRE initiatives, ensuring clear prioritization aligned to business impact, risk, and strategic objectives. Partner closely with business and engineering leaders to continuously refine priorities.
- Execution & Accountability Driver: Act as the central point of coordination to ensure initiatives move forward with clarity and urgency. Proactively track deliverables, remove blockers, and hold cross-functional teams accountable to timelines and outcomes.
- Cross-functional Orchestration: Operate effectively across a highly matrixed environment, coordinating across Product, Engineering, SRE, Architecture, and Business stakeholders to ensure alignment and sustained momentum on key initiatives.
- Outcome Narrative & Executive Communication: Translate technical work into clear business outcomes. Build and maintain executive-level reporting that highlights progress, value delivered, risks, and key wins across the SRE portfolio.
- Operating cadence & Governance: Drive a consistent execution rhythm (weekly, monthly) across initiatives, ensuring progress is measurable, visible, and tied to defined success metrics.
- Portfolio Visibility & Impact Tracking: Create and maintain a centralized view of all SRE initiatives, including status, dependencies, risks, and realized impact (e.g., reliability improvements, incidents reduction, operational efficiency).
How to Apply
Job ID: JPC - 228982
For more information, please contact below:
Bhushan Reddy
Qualified individuals will be contacted for an interview.