Sr Manager, Reliability Engineering and Operations

Overview

On Site
USD 149,000.00 - 239,000.00 per year
Full Time

Skills

Creative Problem Solving
Finance
Performance Metrics
Collaboration
IT Operations
Design Architecture
Roadmaps
Trading
Software Development Methodology
Thought Leadership
Software Development
Reliability Engineering
Cloud Computing
IaaS
DevOps
Leadership
Performance Management
Talent Management
Budget
Capacity Management
Management
Continuous Integration
Continuous Delivery
Bitbucket
Bamboo
GitHub
Jenkins
Splunk
Grafana
Programming Languages
Python
Terraform
NoSQL
Database
MongoDB
IBM WebSphere MQ
RabbitMQ
Apache Kafka

Job Details

Your Opportunity

At Schwab, you're empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us "challenge the status quo" and transform the finance industry together.

We believe in the importance of in-office collaboration and fully intend for the selected candidate for this role to work on site in the specified location(s).

As a Sr. Manager in Client Trading Experience Technology (CTET), you will lead System Availability Engineering (SAvE) Teams for Trading platform, playing a critical role in ensuring availability of Trading systems, and guiding the development, automation, tooling and realization of SRE best practices. You will be responsible for:
  • Identifying tactical and strategic opportunities to improve service health, performance, reliability, and telemetry across Trading Platform
  • Leading the team with data driven mindset focusing on addressing key performance metrics such as MTTD, MTTR, Availability in close collaboration with Trading development and IT Operations teams
  • Leading the design, architecture and implementation of availability and resiliency roadmap that delivers on modernized tooling and metrics.
  • Working closely with development team to define a sustainable operating model for Trading applications focusing on platform scale, availability, fault tolerance and performance.
  • Leading the automation and IaaC practices to ensure teams are following patterns to ensure repeatability, consistency and portability.
  • Identifying toil and technical debt, develop a comprehensive plan and lead the team through the process of execution.
  • Driving a shift-left mindset and influence architectural decisions to ensure resiliency and scale at the outset of software development process.
  • Being a hands-on technical leader who will lead the team from the front and be able to inspire thought leadership in the team.

What you have

Required Qualifications:
  • 10+ years of software development and site reliability engineering experience supporting production applications in any public cloud environment, PCF and IaaS.
  • 7+ years in DevOps engineering leadership focusing on complementing production operations with automation and tooling initiatives.
  • 7+ years of people leadership, supporting highly technical individuals including performance management, talent development, driving efficiencies and talent engagement.
  • 5+ years of experience defining, driving and implementing operational best practices (SLOs, SLIs, Error Budgets, Monitoring errors, capacity planning, blameless postmortems and toil management).
  • 5+ years of experience with CI/CD tools, logging, observability and telemetry solutions (Bitbucket, Bamboo, Github, Jenkins, Datadog, Splunk, Prometheus, Grafana etc.)
  • Proficient in programming languages to automate repeatable processes and building IaaC solutions (Python, CloudFormation, Terraform)
  • Knowledge of NoSQL databases (Aerospike, MongoDB preferred)
  • Knowledge of IBM MQ, RabbitMQ and Kafka

In addition to the salary range, this role is also eligible for bonus or incentive opportunities.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.