Software Engineer - Site Reliability Engineering

Remote in Foster City, CA, US • Posted 5 hours ago • Updated 5 hours ago
Full Time
On-site
USD $140,000.00 - 230,000.00 per year
Fitment

Dice Job Match Score™

🤯 Applying directly to the forehead...

Job Details

Skills

  • Reporting
  • Collaboration
  • Software Engineering
  • Systems Architecture
  • Root Cause Analysis
  • Business Continuity Planning
  • Disaster Recovery
  • Reliability Engineering
  • Cloud Computing
  • Amazon Web Services
  • Google Cloud
  • Google Cloud Platform
  • Microsoft Azure
  • Terraform
  • Ansible
  • Management
  • Orchestration
  • Kubernetes
  • Computer Networking
  • Storage
  • Database
  • Scripting
  • Python
  • C
  • C++
  • Java
  • Regulatory Compliance
  • Robotics
  • Machine Learning (ML)
  • LinkedIn
  • Innovation
  • Artificial Intelligence
  • Recruiting

Summary

Zoox is seeking a Site Reliability Engineer to help ensure the availability, performance, and resilience of the services that power the development and operation of our autonomous vehicles. In this role, you will own the full lifecycle of our services-from designing fault-tolerant, maintainable systems to deploying, operating, and continuously improving them in production. As a robotics company, Zoox embraces automation at every layer of our infrastructure, and you'll help drive that ethos forward. You'll work hands-on with systems that process massive volumes of data and support compute-intensive pipelines running on both CPUs and GPUs.

In this role, you will:

  • Architect and optimize scalable systems: You will design, implement, and continuously improve highly reliable infrastructure, directly impacting the success and safety of Zoox's autonomous vehicle platform.
  • >
  • Build proactive monitoring solutions: You will develop advanced monitoring, alerting, and reporting tools to ensure potential issues are identified and resolved before they affect production.
  • >
  • Collaborate across engineering: You will partner closely with software engineering teams to elevate our system architecture, streamline deployment processes, and drive automation initiatives.
  • >
  • Lead incident resolution: You will conduct thorough root cause analyses on production issues and rapidly deploy corrective actions to maintain a resilient and stable environment.
  • >
  • Ensure business continuity: You will safeguard the company's operations by designing and implementing robust disaster recovery plans to keep the Zoox fleet running smoothly under any circumstances.
  • >

Qualifications

  • SRE & Distributed Systems Experience: 5+ years of experience in site reliability engineering or a similar role, with a strong, objective background in managing large-scale distributed systems.
  • >
  • Cloud & Infrastructure as Code (IaC): Proven experience operating within major cloud platforms (AWS, Google Cloud Platform, or Azure) and utilizing IaC tools like Terraform, Ansible, Salt, or CloudFormation.
  • >
  • Container Orchestration: Technical expertise in deploying, managing, and scaling systems using container orchestration technologies such as Kubernetes.
  • >
  • Core Infrastructure Knowledge: Deep, foundational understanding of networking protocols, storage solutions, and database technologies.
  • >
  • Programming Proficiency: Strong, demonstrable programming and scripting skills in languages such as Python, Go, C/C++, or Java.
  • >

Bonus Qualifications

  • Experience in the automotive or autonomous vehicle industry.
  • >
  • Knowledge of security best practices and compliance requirements.
  • >

$140,000 - $230,000 a year

About Zoox

Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We're looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

Accommodations

If you need an accommodation to participate in the application or interview process please reach out to [email protected] or your assigned recruiter.

A Final Note:

You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 80183258
  • Position Id: 89b15223295566b01e375b9a07c5e95a
  • Posted 5 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Palo Alto, California

Today

Full-time

USD 171,000.00 - 260,000.00 per year

Menlo Park, California

Today

Full-time

USD 200,000.00 - 287,500.00 per year

Menlo Park, California

Today

Full-time

USD 137,773.00 - 194,585.00 per year

Menlo Park, California

Today

Full-time

USD 160,000.00 - 210,000.00 per year

Search all similar jobs