Senior Software Engineer: Site Reliability Engineering

• Posted 1 day ago • Updated 1 day ago
Full Time
On-site
Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

  • Financial Services
  • Banking
  • Payments
  • Optimization
  • Scalability
  • Immigration
  • Onboarding
  • Testing
  • Budget
  • Scripting
  • GitHub
  • Provisioning
  • Configuration Management
  • Management
  • Vulnerability Management
  • Migration
  • Data Centers
  • Collaboration
  • DevOps
  • Software Development
  • Documentation
  • FOCUS
  • Amazon Web Services
  • Microsoft Azure
  • Terraform
  • Ansible
  • Continuous Integration
  • Continuous Integration and Development
  • Continuous Delivery
  • Windows PowerShell
  • Python
  • Golang
  • Linux
  • POSIX
  • Microsoft Windows Administration
  • Computer Networking
  • Firewall
  • Regulatory Compliance
  • Payment Card Industry
  • Computer Science
  • Information Technology
  • Cloud Computing
  • Google Cloud Platform
  • Google Cloud
  • SQL
  • NoSQL
  • Database
  • Grafana
  • Reliability Engineering
  • Service Level
  • Root Cause Analysis
  • FAR
  • Finance
  • Fraud
  • Sustainability
  • Innovation
  • Recruiting
  • Performance Management
  • Promotions
  • Training
  • Military
  • Law

Summary

At Jack Henry, we're more than a technology company, we're a force for good in financial services. We're redefining how community banks and credit unions connect with the people they serve. Our mission is rooted in people inspired innovation, empowering financial institutions to deliver seamless, secure, and human centered experiences. We deliver cutting-edge solutions that are paving the way for the next generation of digital banking and payments, but our true impact begins with our associates. If you're ready to help transform an industry and grow with a company that values purpose, collaboration, and excellence then we'd love to meet you. Our

Enterprise Information Technology (EIT) organization is expanding, and we are seeking a Senior Site Reliability Engineer to help drive a major architectural modernization. In this role, you will move beyond traditional infrastructure maintenance to build a proactive, engineering-led ecosystem across our large-scale multi-cloud and co-location footprint.

Working closely with cross-functional IT teams and business units, you will help design and implement standards for our hybrid cloud datacenter model. A primary focus of this position will be the architectural redesign, optimization, and migration of legacy on-premises workloads into Google Cloud Platform (Google Cloud Platform), ensuring everything we build adheres to rigorous SRE principles.

Mission & Impact:

Everything as Code: Drive repository-led management across our public and private cloud environments to establish consistency and eliminate manual configuration drift.

Engineering over Toil: Heavily leverage Infrastructure as Code (IaC) to automate repetitive tasks, build smooth "paved roads" for product teams, and develop self-healing systems.

SLO-Driven Architecture: Help shift our operational focus from traditional component monitoring to user-facing symptoms, defining meaningful SLOs and error budgets.

Modernization & Migration: Lead the technical execution of re-architecting and redeploying on-premises services into Google Cloud Platform, ensuring scalability, performance, and long-term reliability.

This position may be worked remotely within the United States, with the exception of California.

This position is ineligible for immigration sponsorship and support. Please do not apply if at any time you will need immigration support now or in the future (i.e., H-1B, PERM). All positions, regardless of location, may require an onsite interview or in-person onboarding requirement to verify your identity.

What you'll be responsible for:

Service Reliability and Performance: Drive the reliability and performance of both public cloud (production, testing, and development) and internal server infrastructure environments.

SRE Practice Implementation: Design and implement robust Site Reliability Engineering practices, including defining and monitoring Service Level Objectives (SLOs) and Service Level Indicators (SLIs), focusing on proactive system health and error budgets.

Automation and Toil Reduction: Ruthlessly eliminate manual, repetitive work (toil) through automation. Develop and maintain automation scripts and tooling to streamline operations across the hybrid datacenter model (on-premises and public cloud).

Implement "Everything as Code": Treat the cloud and on-prem operational environment as a software project by using Infrastructure as Code (IaC) with tools like Terraform, Ansible, GitHub for provisioning and configuration.

Configuration and State Management: Design and maintain rigorous configuration management processes to guarantee the consistency and desired state of the hybrid datacenter infrastructure, leveraging tools like Ansible.

Observability and Alerting: Establish and manage comprehensive monitoring and alerting systems to provide deep visibility into the health and performance of services. Build systems that are self-healing and advocate for themselves.

Post-Mortems and Root Cause Analysis (RCA): Lead blameless post-mortems and RCAs for critical incidents, focusing on system-level improvements to prevent recurrence and enhance overall reliability.

Security and Compliance Automation: Develop and implement strategies for efficient patch and vulnerability management across all environments. Automate security remediation efforts to ensure timely vulnerability mitigation and compliance (e.g., CIS, NIST, PCI).

Cloud Evolution and Migration: Support the company's strategic growth into public cloud services (Google Cloud Platform, Azure) and play a key role in the migration and redesign of services from on-premises data centers to Google Cloud Platform, ensuring adherence to SRE principles throughout the transition.

Cross-Functional Collaboration: Partner closely with DevOps and development teams to embed reliability best practices throughout the software development lifecycle, ensuring seamless integration and operation of hybrid datacenter services.

Documentation: Maintain comprehensive and actionable documentation for SRE processes, operational runbooks, and configurations.

May perform other duties as assigned.

What you'll need to have:

Minimum 6 years of experience in cloud and hybrid datacenter operations with a focus on Infrastructure as Code (IaC) and Site Reliability Engineering.

Proficiency with Google Cloud Platform (preferred), AWS, and/or Azure.

Proficient in using GitOps, Terraform and Ansible in a CI/CD (continuous integration and continuous delivery) pipeline.

Experience using PowerShell, Python, or GoLang.

Solid understanding of Linux (POSIX) and Windows System administration as well as networking and firewalls.

Understanding of security best practices and compliance standards such as CIS, NIST and PCI.

Ability to participate in an on-call rotation every 7-8 weeks.

What would be nice for you to have:

Bachelor's degree in Computer Science Information Technology, Engineering.

Relevant industry certifications. Google Associate Cloud Engineer or Google Cloud Architect preferred.

Proficient in ArgoCD and GitOps.

Familiarity with SQL and NoSQL databases.

Experience with Open Telemetry tooling and alerting such as Prometheus, Grafana, ELK Stack, et al.

Experience with Site Reliability Engineering (SRE) principles, including but not limited to Service Level Objectives (SLO) and Service Level Indicators (SLI), TOIL Reduction, Automation, and Root Cause Analysis.

If you got this far, we hope you're feeling excited about this opportunity. Even if you don't feel you meet every single requirement on this posting, we still encourage you to apply. We're looking for passionate, driven individuals who align with our mission and can bring unique perspectives to our team.

Why Jack Henry?

At Jack Henry, we live by the motto: "Do the right thing, do whatever it takes, and have fun." It's more than a tagline, it's the foundation of our culture. We recognize that our associates are the key to our success, and we're deeply committed to their wellbeing. That's why we offer comprehensive benefits designed to support your physical, mental, and financial health so you can thrive both personally and professionally.

We're also leading the way in technology modernization, helping financial institutions evolve with speed, security, and flexibility. Our strategy focuses on delivering secure data access, mitigating fraud, and enabling seamless integration. Empowering our teams to build innovative solutions that meet the evolving needs of accountholders.

Culture of Commitment

Ask our associates why they love Jack Henry, and many will tell you it is because our culture is exceptional. We do great things together. Our culture empowers us to rise to challenges, seek new opportunities, and support one another through change. It's this shared commitment that drives our success. We're proud to foster an environment where inclusion, sustainability, and community impact are more than values, they're how we operate. Visit our Corporate Sustainability site to learn more about our culture and commitment to our people, customers, community, environment, and shareholders.

Equal Employment Opportunity

At Jack Henry, we know we are better together. We value, respect, and protect the uniqueness each of us brings. Innovation flourishes by including all voices and makes our business - and our society - stronger. Jack Henry is an equal opportunity employer and we are committed to providing equal opportunity in all of our employment practices, including selection, hiring, performance management, promotion, transfer, compensation, benefits, education, training, social, and recreational activities to all persons regardless of race, religious creed, color, national origin, ancestry, physical disability, mental disability, genetic information, pregnancy, marital status, sex, gender, gender identity, gender expression, age, sexual orientation, and military and veteran status, or any other protected status protected by local, state or federal law.

No one will be subject to, and Jack Henry prohibits, any form of discipline, reprisal, intimidation, or retaliation for good faith reports or complaints of discrimination of any kind, pursuing any discrimination claim, or cooperating in related investigations.

Requests for full corporate job descriptions may be requested through the interview process at any time.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90922487
  • Position Id: 24290431
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

New York, New York

Today

Full-time

USD 89,000.00 - 178,000.00 per year

New York, New York

Today

Full-time

USD 156,364.00 - 279,957.00 per year

Jersey City, New Jersey

Today

Full-time

USD 133,000.00 - 185,000.00 per year

New York, New York

Today

Full-time

USD 111,000.00 - 218,000.00 per year

Search all similar jobs