reliability engineer Jobs in california

Refine Results
181 - 200 of 206 Jobs

Lead Site Reliability Engineer II, Production Engineering

Cisco Systems, Inc.

San Francisco, California, USA

Full-time

Who We Are Cisco ThousandEyes is a Digital Experience Assurance platform that empowers organizations to deliver flawless digital experiences across every network - even the ones they don't own. Powered by AI and an unmatched set of cloud, internet and enterprise network telemetry data, ThousandEyes enables IT teams to proactively detect, diagnose, and remediate issues - before they impact end- user experiences. ThousandEyes is deeply integrated across the entire Cisco technology portfolio and

Sr. Linux Site Reliability Engineer, IT Manufacturing Site Reliability Engineering

Tesla Motors

Fremont, California, USA

Full-time

We are seeking an enthusiastic SRE to join our dynamic IT Manufacturing Site Reliability Engineering (ITMFG-SRE) team at Tesla. Our team is responsible for building and managing an ecosystem of applications and platforms essential to manufacturing. As a Linux SRE, this role requires experience with hardware, software, networking, and automation to implement scalable solutions for manufacturing sites globally. You'll play a key role in maintaining, optimizing and scaling our infrastructure to sup

Senior Lead Site Reliability Engineer - Remote

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Would you enjoy improving stability and safety of one of the largest global networks? \n Would you enjoy hands-on network operations work on a global scale to improve our operational efficiency? \n Join our Platform Security Engineering Team \n The Platform Security Engineering team is a group of engineers that support and secure Akamai's global network and Linode cloud systems. Our systems provide data security, server integrity, network access, and secure communications infrastructure. This is

Sr. Site Reliability Engineer, Compute SRE

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Sr. Site Reliability Engineer, Bare Metal, Infrastructure

Tesla Motors

Remote or Austin, Texas, USA

Full-time

Tesla cloud as a service seeks a high impact Site Reliability Engineer (SRE) to support our bare-metal provisioning platform at scale. You'll provide direct support to internal customers, resolve complex provisioning issues, and escalate systemic problems to engineering. Your focus: ensuring reliable, automated delivery of bare-metal infrastructure using Kubernetes, Metal , and industry standard tooling across diverse hardware from Supermicro, HPE, and Dell. Responsibilities Provide frontline s

Sr. Site Reliability Engineer, Integration Tools

Tesla Motors

Palo Alto, California, USA

Full-time

The Integration Platforms team develops and operates critical technology to support our ever-expanding customer fleet from prototype to production. As an SRE on this team, you will ensure the reliability, scalability, and performance of our on-vehicle, desktop-based, and web-based systems, collaborating closely with software engineers to design, build, and operate these systems across multiple regions. Join us and you will work alongside world-class software and data engineers on some of the new

Sr. Site Reliability Engineer, Dojo

Tesla Motors

Palo Alto, California, USA

Full-time

We are seeking an experienced Site Reliability Engineer (SRE) to join our team responsible for ensuring the reliability, performance of our Dojo cluster infrastructure. The successful candidate will be responsible for providing exceptional customer response and support, managing third-party systems, and collaborating with various teams to ensure seamless operations. If you have a passion for troubleshooting, automation, and collaboration, we encourage you to apply. Responsibilities Respond to c

Sr. Site Reliability Engineer- C++ - Remote

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Are you a Site Reliability Engineer who loves solving complex problems? \n Do you relish the opportunity to create solutions and make an impact? \n Join our innovative Security Technology Group \n Our Security Edge team designs and develops software that runs one of the world's largest distributed systems. This network allows us to solve problems at a scale that few others can approach. We take pride in helping our customers protect their web sites and transactions over the Internet. \n Partner

Site Reliability Engineer - SRE - Remote position (Required Windows IIS, .NET) - W2 only

Infinite Computer Solutions (ICS)

Remote

Full-time

Building and maintaining GitLab pipelines for .NET applications Hands-on experience with Dynatrace for application performance monitoring and integration of services.Working knowledge of Windows IIS (Internet Information Services) for hosting and troubleshooting.Good understanding and usage of Kafka.Expertise in PowerShell scripting for automation and deployment tasks.Actively involved in identifying and resolving vulnerabilities and security issues.

Senior SRE Engineer (R3386)

Shield AI Inc

San Diego, California, USA

Full-time

Founded in 2015, Shield AI is a venture-backed defense technology company with the mission of protecting service members and civilians with intelligent, autonomous systems. Its products include Hivemind Enterprise-EdgeOS, Pilot, Commander, and Forge-as well as V-BAT and Sentient Vision Systems (wide-area motion imaging software). With offices in San Diego, Dallas, Washington, D.C., Abu Dhabi (UAE), Kyiv (Ukraine), and Melbourne (Australia), Shield AI's technology actively supports U.S. and allie

Site Reliability Engineer Graduate (TikTok Product - USDS) - 2025 Start (BS/MS)

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A224009 Apply to this job Share this listing: Responsibilities About the Team Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed and fault-tolerant systems. Product SREs help ensure the reliability and uptime for the services underpinning the TikTok product. Our team pays great attention to optimizing existing systems, working closely with cross functional

SRE Engineer

Synergis

Remote

Full-time

Job Title: SRE Engineer Job Location: Remote Type: Direct Hire * Status Required *must have prior experience in a SAAS Based Software Company and a startup / or small company environment Synergis client, a software organization focused on an AI powered, unified platform for data discovery, observability, and governance. The Site Reliability Engineer will design and implement automations on their Cloud Infrastructure SRE Engineer Background and Scope Ensure the organization has security policies

FedRAMP Site Reliability Engineer - Early Career (; Boulder, CO or Raleigh, NC ONLY)

Splunk Inc.

Colorado, USA

Full-time

Description This is a US-based position. Candidates must be able to support FedRAMP High. This role is based in the Boulder, CO office or Raleigh, NC office, and will require relocation. Splunk is here to build a safer and more resilient digital world. The world's leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. While customers love our technology, it's our people that make Splunk stand out as an amazing career destinati

Site Reliability Engineer L4/L5 - Live Cloud Platform SRE

Netflix, Inc.

Remote

Full-time

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time. Netflix has been changing how people watch shows and movies, enabling on-demand access to thousands of movies and TV shows. Recently, Netflix has expanded its entertainment

Application Support Tech Lead Analyst

Citi

Remote or Jersey City, New Jersey, USA

Full-time

Citibank, N.A. seeks an Application Support Tech Lead Analyst for its Jersey City, NJ location. Duties: Adopt new technologies and promote Site Reliability Engineer culture. Design, develop, and deploy various technology tools and Citi internal applications. Leverage Generative AI technologies to enhance operational efficiency and support capabilities. Participate in the development of a GenAI-powered chatbot aimed at streamlining tasks such as troubleshooting, query optimization, application-s

Site Reliability Engineer (Starshield) - Top Secret Clearance

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER (STARSHIELD) - TOP SECRET CLEARANCE Starshield leverages SpaceX's Starlink technology and launch capability to support national security efforts. While Starlink is designed for consumer and c

Site Reliability Engineer II - Remote

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies and tackling system problems? Join our highly skilled Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating solutions that manage Compute platform, focusing on identity and access management (IAM) solutions. We do this while maintaining Akamai's mission to ma

Site Reliability Engineer II - Cloud Networking - Remote

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? \n Do you have a passion for cutting edge technologies and tackling system problems? \n Join our highly skilled Site Reliability team \n Our team designs, develops, and manages applications and infrastructure that support Akamai's products and services. We specialize in creating solutions that help improve observability and enforce SLAs across all internal teams. We do all of this while maintaining Akamai's mission to make life b

Sr. Site Reliability Engineer (Starshield) - Top Secret Clearance

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. SITE RELIABILITY ENGINEER (STARSHIELD) - TOP SECRET CLEARANCE Starshield leverages SpaceX's Starlink technology and launch capability to support national security efforts. While Starlink is designed for consumer a

Sr. Site Reliability Engineer, Energy Software

Tesla Motors

Palo Alto, California, USA

Full-time

Tesla is looking for a Site Reliability Engineer to build, enhance, and scale the infrastructure that underpins our Energy IoT applications. These applications provide real-time monitoring, optimization, and control for Tesla's industry-leading energy products, including Powerwall, Megapack, Solar Roof, Supercharger, Wall Connector, Autobidder, and Virtual Power Plants. We are a high-impact team that values curiosity, learning, mentorship, open discourse, and making disciplined decisions by weig