Systems Reliability Team Manager - SRTM 26-02522

Hybrid in Oakland, CA, US • Posted 7 hours ago • Updated 7 hours ago
Contract Corp To Corp
Contract W2
Contract Independent
No Travel Required
Hybrid
$120+
Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

  • Vendor Relationships
  • Software Development Methodology
  • Systems Management
  • Vendor Management
  • Product Management
  • Product Strategy
  • Production Support
  • Regulatory Compliance
  • Continuous Integration
  • Data Security
  • Documentation
  • Software Development
  • Storage
  • Performance Metrics
  • Reliability Engineering
  • Risk Management
  • Incident Management
  • Information Systems
  • Leadership
  • Management
  • Privacy
  • Communication
  • Configuration Management
  • Continuous Delivery
  • Continuous Improvement
  • Accountability
  • Auditing
  • Business Strategy
  • Change Management
  • Collaboration
  • Roadmaps

Summary

Job Title: Systems Reliability Team Manager

Location: Oakland, CA
Work Model: Hybrid preferred (currently 2 days onsite). Remote candidates may be considered to expand the candidate pool.
Duration: 9 Months


Job Overview

We are seeking a Systems Reliability Team Manager to lead a team responsible for supporting system reliability, incident resolution, and product delivery coordination within a digital experience and technology organization.

This leadership role combines product management responsibilities with Software Development Life Cycle (SDLC) oversight. The position operates at the front end of the development lifecycle, translating business needs into clear requirements for technology teams responsible for building and maintaining enterprise applications.

The manager will oversee a team responsible for production support, data incident troubleshooting, and system defect resolution, while ensuring platform reliability, stability, and continuous improvement.

In addition to incident and operational oversight, the role is responsible for platform roadmap planning, policy compliance, portfolio delivery coordination, vendor management, and digital product strategy.

Experience with pension or retirement administration systems is preferred but not required.


Key Responsibilities

Leadership & Strategy

  • Lead and manage the Systems Reliability team, providing direction for operational priorities and platform reliability initiatives.

  • Translate organizational strategy into operational priorities and technical deliverables.

  • Define and manage the platform reliability roadmap, ensuring alignment with enterprise strategy and funding priorities.

  • Establish business priorities for technology delivery and system improvements.

  • Oversee the ongoing performance, stability, and evolution of the enterprise platform.


Platform Reliability & Operations

Direct operational activities related to platform reliability, including:

  • Change management processes

  • Release management policies

  • Incident management and response procedures

  • Availability and system performance monitoring

  • Platform performance metrics and KPIs

Ensure accountability for system uptime, stability, and operational performance.


Vendor & Product Strategy

  • Manage vendor relationships and contract performance for core technology platforms.

  • Oversee product strategy initiatives focused on continuous process and system improvements.

  • Coordinate platform delivery initiatives across the broader technology portfolio.


Qualifications

  • Bachelor s Degree or equivalent experience.

  • Experience with portfolio governance, policy development, and enterprise systems management.

  • Demonstrated ability to translate organizational strategy into operational standards, processes, and accountability frameworks.


Knowledge, Skills, and Abilities

The candidate must demonstrate the ability to ensure consistent standards across teams while aligning operational practices with enterprise policies.

Platform & Systems Knowledge

Strong understanding of enterprise platform components including:

  • APIs and integration frameworks

  • Messaging systems

  • Data storage systems

  • Environment management

  • Release management processes


Incident & Reliability Management

  • Experience implementing incident management processes and post-incident analysis practices.

  • Ability to ensure continuous learning and improvement following incidents.


Observability & Monitoring

Experience ensuring proper use of observability tools, including:

  • System logs

  • Metrics monitoring

  • Distributed tracing

  • Alert configuration and tuning


DevOps & Delivery Practices

  • Experience supporting change and configuration management aligned with governance and audit requirements.

  • Understanding of CI/CD pipelines and automated deployment processes.

  • Ability to apply appropriate risk management practices during deployments.


Security & Compliance

  • Understanding of basic security practices and vulnerability remediation processes.

  • Ability to ensure compliance with security and operational policies.


Documentation & Collaboration

  • Ensure consistent documentation standards including:

    • Operational runbooks

    • Architecture diagrams

    • Knowledge base documentation

  • Facilitate collaboration and communication between:

    • Design teams

    • Engineering teams

    • Quality control teams

    • Cross-functional stakeholders


Special Conditions

The selected candidate must exercise strict confidentiality and discretion when managing sensitive information encountered during the course of work.

Sensitive information may include:

  • Employee and member records

  • Health-related information

  • Financial data

  • Strategic plans

  • Proprietary or confidential organizational information

The candidate must ensure that such information is only shared with individuals who have a legitimate business need and must follow all organizational policies and applicable regulations related to data protection and privacy.

This includes the secure handling of physical and digital records and proper use of information systems to prevent unauthorized access or disclosure.

Unauthorized disclosure of confidential work-related information is considered a violation of organizational policies and expectations.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10468931
  • Position Id: SRTM 26-02522
  • Posted 7 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hybrid in Oakland, California

Today

Easy Apply

Contract, Third Party

$120+

San Mateo, California

4d ago

Easy Apply

Contract, Third Party

Oakland, California

Today

Full-time

USD 122,194.00 - 221,818.00 per year

Remote

Today

Easy Apply

Third Party, Contract

Depends on Experience

Search all similar jobs