Lead Site Reliability Engineer - Applications/Domains

Overview

On Site
Full Time

Skills

Finance
Insurance
Brand
Microsoft TFS
Customer Experience
Employment Authorization
Financial Services
IT Operations
FOCUS
Professional Development
Continuous Integration
Continuous Delivery
Build Automation
Budget
Service Level
Problem Management
Continuous Improvement
Software Design
Scalability
GitHub
Dynatrace
Problem Solving
Conflict Resolution
Reliability Engineering
Terraform
Scripting
Python
Grafana
Cloud Computing
Google Cloud Platform
Google Cloud
Microsoft Azure
Orchestration
Kubernetes
Effective Communication
Amazon Web Services
DevOps
Management
Workflow
Collaboration
Teamwork
Taxes
Health Care
FSA
Military
Law

Job Details

Overview

Who we are

Collaborative. Respectful. A place to dream and do. These are just a few words that describe what life is like at Toyota. As one of the world's most admired brands, Toyota is growing and leading the future of mobility through innovative, high-quality solutions designed to enhance lives and delight those we serve. We're looking for talented team members who want to Dream. Do. Grow. with us.

An important part of the Toyota family is Toyota Financial Services (TFS), the finance and insurance brand for Toyota and Lexus in North America. While TFS is a separate business entity, it is an essential part of this world-changing company- delivering on Toyota's vision to move people beyond what's possible. At TFS, you will help create best-in-class customer experience in an innovative, collaborative environment.

To save time applying, Toyota does not offer sponsorship of job applicants for employment-based visas or any other work authorization for this position at this time.

Who we're looking for
Toyota Financial Services is building out a new Site Reliability Engineering (SRE) team for Domain Applications and we are seeking a Lead SRE engineer to ensure reliability, performance and availability of the applications within each domain.

As a Lead SRE engineer - applications, you will be working with development engineers, product owners, SRE Infrastructure, production engineers and Technology Operations Center personnel with a primary focus on improving observability, automation, overall system health, reliability and uptime.

What you'll be doing

  • Design, code, and maintain automation to streamline operations, reduce manual tasks, and improve system efficiency to enable a robust application environment.
  • Working with observability engineers to enable actionable insights into applications and infrastructure health and performance. Foster a collaborative team culture and support professional development.
  • Ensure scalable & repeatable code deployments with CI/CD pipelines using GitHub & Harness, repeatable deployments with infrastructure as code (IaC) using Terraform.
  • Build automation and operational runbooks primarily using Python scripting.
  • Manage container orchestration platforms and related cloud-native services.
  • Drive reliability improvements through Service Level Objectives (SLOs), error budgets and Service Level Agreements (SLAs) aligned with business goals.
  • Design & implement observability improvements using Dynatrace & CloudWatch.
  • Lead major incident responses and coordinate with stakeholders for resolution and drive problem management to prevent recurrence.
  • Conduct blameless post-incident reviews and drive continuous improvement.
  • Collaborate cross-functionally to embed SRE principles into application design and operation meeting reliability goals.
  • Participate in architectural reviews, providing input on reliability and scalability.


What you bring

  • Experience with DevOps tools like GitHub, Harness & Dynatrace.
  • Experience building self-healing systems and automated remediation workflows.
  • 5+ years of experience in Site Reliability Engineering, DevOps, or related field.
  • Demonstrated experience in problem-solving, key SRE/DevOps concepts & tools with a proven track record of achieving high system reliability and performance.
  • Strong experience with Terraform for AWS IaC.
  • Proficient in scripting and automation with Python and familiar with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack).
  • Deep knowledge of container orchestration (Kubernetes/EKS).
  • Deep understanding of cloud platforms (e.g., AWS, Google Cloud Platform, Azure) and container orchestration technologies (e.g., Kubernetes).
  • Effective communication skills, with the ability to convey complex technical concepts to diverse audiences.


Added Bonus if you have

  • AWS certifications (DevOps Engineer, Solutions Architect, etc.).
  • Familiarity with GitOps, secrets management, and infrastructure monitoring best practices.
  • Experience building self-healing systems and automated remediation workflows.


What we'll bring

During your interview process, our team can fill you in on all the details of our industry-leading benefits and career development opportunities. A few highlights include:

  • A work environment built on teamwork, flexibility, and respect
  • Professional growth and development programs to help advance your career, as well as tuition reimbursement
  • Team Member Vehicle Purchase Discount
  • Toyota Team Member Lease Vehicle Program (if applicable)
  • Comprehensive health care and wellness plans for your entire family
  • Toyota 401(k) Savings Plan featuring a company match, as well as an annual retirement contribution from Toyota regardless of whether you contribute
  • Paid holidays and paid time off
  • Referral services related to prenatal services, adoption, childcare, schools and more
  • Tax Advantaged Accounts (Health Savings Account, Health Care FSA, Dependent Care FSA)
  • Relocation assistance (if applicable)


Belonging at Toyota

Our success begins and ends with our people. We embrace all perspectives and value unique human experiences. Respect for all is our North Star. Toyota is proud to have 10+ different Business Partnering Groups across 100 different North American chapter locations that support team members' efforts to dream, do and grow without questioning that they belong.

Applicants for our positions are considered without regard to race, ethnicity, national origin, sex, sexual orientation, gender identity or expression, age, disability, religion, military or veteran status, or any other characteristics protected by law.

Have a question, need assistance with your application or do you require any special accommodations? Please send an email to .
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Toyota Motor North America