Senior Site Reliability Engineer - Google Cloud Platform

Overview

On Site
USD 150,000.00 - 175,000.00 per year
Full Time

Skills

Creative Problem Solving
Finance
Pivotal
Scalability
Teamwork
Reliability Engineering
Collaboration
Scrum
Management
Instrumentation
Capacity Management
Continuous Integration
Continuous Delivery
Orchestration
Workflow
Real-time
Product Development
Scripting
Dashboard
Software Development Methodology
Software Development
Process Improvement
System Administration
Microsoft Windows
Virtual Machines
SaaS
Migration
Google Cloud
Google Cloud Platform
Computer Networking
DNS
Dragon NaturallySpeaking
DHCP
Firewall
Routing
High Availability
Linux
Microsoft Windows Administration
.NET
Windows PowerShell
Java
Python
Bash
SQL
Oracle
MongoDB
Database
Actimize
RabbitMQ
IBM WebSphere MQ
Apache Kafka
Splunk
AppDynamics
Computer Science
Financial Services
Agile
Problem Solving

Job Details

Your Opportunity

At Schwab, you're empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us challenge the status quo and transform the finance industry together.

We believe in the importance of in-office collaboration and fully intend for the selected candidate for this role to work on site in the specified location.

As a Senior Reliability Engineer, you'll play a pivotal role in shaping the reliability and scalability of our mission-critical applications, collaborating across teams to deliver solutions that matter. You'll be part of a culture that values integrity, teamwork, and personal growth-where your expertise drives real change and your career journey is supported every step of the way.

What You'll Do
  • Evangelize the Site Reliability Engineering (SRE) mindset and solve problems through systematization.
  • Identify opportunities to build innovative tools and address unique operational challenges for large-scale, enterprise applications.
  • Create scripts to automate operational tasks and integrate solutions into infrastructure.
  • Collaborate with Engineering, Scrum, and Operations teams to provide technical expertise and support key initiatives for system availability and reliability.
  • Triage alerts, diagnose/resolve critical issues, and manage implementation of changes.
  • Develop tools, frameworks, and instrumentation to validate and increase rollout success for applications.
  • Coordinate capacity planning and develop CI/CD orchestration systems to streamline software delivery.
  • Troubleshoot mission-critical application workflows in real time and incorporate feedback into product development.
  • Participate in on-call support rotations.

What you have

Required Qualifications
  • 6-8 years of enterprise-level administration and support experience.
  • 6-8 years of experience in automation scripting, dashboard creation for proactive monitoring, and alert setup.
  • 6-8 years practicing SDLC (Software Development Lifecycle) and process improvements.
  • Hands-on experience with enterprise systems administration, monitoring, and deployment.
  • Experience with Windows 2019/2022 and Linux (VM-hosted).
  • Cloud application configuration, deployment, support, and migration (Google Cloud Platform/PCF is a plus).
  • Knowledge of IP networking (DNS, DHCP, firewalls, IP routing).
  • Familiarity with large-scale distributed systems and high-availability architecture.
  • Linux and Windows system administration, troubleshooting, and tuning.
  • Development experience in .Net, PowerShell, Java, Python, or Bash.
  • Knowledge of SQL, Oracle, or MongoDB databases.
  • Working knowledge of Actimize.
  • Experience with message brokers (Solace, RabbitMQ, IBM MQ, Kafka).
  • Experience with Splunk, AppDynamics, or similar tools.
  • Bachelor's degree in computer science or related discipline.

Preferred Qualifications
  • Financial services industry experience.
  • Experience with Agile methodologies.

Personal Attributes
  • Strong customer orientation and proactive ownership of projects/issues.
  • Extreme sense of ownership in resolving problems in distributed environments.
  • Resolve to dig deeper into technical issues.
  • Self-starter with confidence to independently resolve issues and deliver results.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.