SRE Manager

  • Sunnyvale, CA
  • Posted 60+ days ago | Updated 6 hours ago

Overview

On Site
Full Time

Skills

Business-to-business
Operational Excellence
Mentorship
Incident Management
Root Cause Analysis
Software Architecture
Management
Scalability
Computer Science
Reliability Engineering
Software Engineering
Leadership
DevOps
Terraform
Kubernetes
Managed Services
Amazon Web Services
Amazon EC2
Amazon S3
Amazon RDS
Remote Desktop Services
IaaS
Microservices
High Availability
Load Balancing
Instrumentation
Computer Networking
Virtual Private Cloud
Programming Languages
Python
Golang
Grafana
Splunk
Communication
Collaboration
Network
Cloud Computing
Fortinet
Military

Job Details

Job Description

At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers.

Our team members enjoy solving complex problems, and obsess over getting the details right. We love what we do and are proud of our work to secure clouds and container environments for thousands of B2B customers worldwide.

We are looking for a highly skilled Site Reliability Engineering (SRE) Manager to lead our SRE team in building scalable, reliable, and secure infrastructure that ensures the highest levels of availability and performance.

Job Summary:

As an SRE Manager, you will be responsible for leading a team of Site Reliability Engineers who design, build, and maintain resilient systems. You will play a critical role in enhancing system reliability, improving incident response, automating operations, and driving best practices in infrastructure management. The ideal candidate will have a strong background in software engineering, cloud infrastructure, and operational excellence.

Key Responsibilities:
  • Lead, mentor, and grow a team of Site Reliability Engineers.
  • Develop and implement strategies to improve system reliability, observability, and automation.
  • Establish and maintain SLIs, SLOs, and SLAs to ensure high availability and performance.
  • Drive incident response, root cause analysis, and postmortem processes.
  • Collaborate with software engineering teams to improve application architecture and resiliency.
  • Manage cloud-based infrastructure (AWS) and ensure best practices for security and scalability.
  • Collaborate with cross-functional teams, including developers, security, and product teams.
  • Stay updated with industry trends and introduce new tools and methodologies to enhance reliability and efficiency.

Required Qualifications:
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • 7+ years of experience in Site Reliability Engineering, DevOps, or Software Engineering roles.
  • 3+ years of experience in a leadership or managerial role within an SRE or DevOps team.
  • Extensive experience with Infrastructure as Code (Terraform, etc.), as well as supporting tooling (Atlantis, ArgoCD, etc.).
  • Extensive experience with Kubernetes and supporting tooling (Helm, operators, etc.).
  • Extensive experience with a variety of cloud-managed services and providers.
    • AWS: EKS, EC2, S3, RDS, Secrets Manager, etc.
  • Experience building production-quality cloud infrastructure that enables reliable and rapid deployment of microservices with effective monitoring and built-in high availability and/or fault tolerance.
  • Strong cross-team communication skills.
  • Experience with the building blocks of large-scale systems, including load balancing, distributed/cloud computing, containers, instrumentation, and monitoring.
  • Knowledge of cloud networking, including VPC configuration and cross-cloud connectivity.
  • Familiarity with one or more programming languages (Python, Golang, etc.).
  • Deep understanding of observability tools (Prometheus, Grafana, Splunk, ELK Stack).
  • Excellent communication and collaboration abilities.

About Us

Fortinet (NASDAQ: FTNT) secures the largest enterprise, service provider, and government organizations around the world. Fortinet empowers its customers with intelligent, seamless protection across the expanding attack surface and the power to take on ever-increasing performance requirements of the borderless network - today and into the future. Only the Fortinet Security Fabric architecture can deliver security without compromise to address the most critical security challenges, whether in networked, application, cloud or mobile environments. Fortinet ranks number one in the most security appliances shipped worldwide and more than 500,000 customers trust Fortinet to protect their businesses.

We are committed to providing reasonable accommodations for all qualified individuals with disabilities. If you require assistance or accommodation due to a disability, please contact us at

Fortinet is an equal opportunity employer. We value diversity in our company, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, military/veteran status or any other applicable legally protected characteristics in the location in which the candidate is applying.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.