Lead Site Reliability Engineer Jobs in California

Refine Results
41 - 60 of 71 Jobs

Senior Site Reliability Engineer - Observability (FedRAMP IL5)

Splunk Inc.

North Carolina, USA

Full-time

Description Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to our work, customers, having fun, and most significantly to each other's success. The Splunk Observability Cloud provides full-fidelity monitoring and fixing across infrastructure, applications, and user inter

Sr. Site Reliability Engineer, Dojo

Tesla Motors

Palo Alto, California, USA

Full-time

We are seeking an experienced Site Reliability Engineer (SRE) to join our team responsible for ensuring the reliability, performance of our Dojo cluster infrastructure. The successful candidate will be responsible for providing exceptional customer response and support, managing third-party systems, and collaborating with various teams to ensure seamless operations. If you have a passion for troubleshooting, automation, and collaboration, we encourage you to apply. Responsibilities Respond to c

Senior Site Reliability Engineer, Observability, FedRAMP

Splunk Inc.

California, USA

Full-time

Description Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers love our technology, but it's our caring employees that make Splunk stand out as an amazing career destination. No matter where in the world or what level of the organization, we approach our wor

Sr. Site Reliability Engineer, Compute SRE

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Senior Site Reliability Engineer, Test Platform- REMOTE

Cisco Systems, Inc.

Remote or San Francisco, California, USA

Full-time

At Cisco Meraki, we create magic through the energy and passion of our employees, who shape our dynamic community and empower us to solve problems for our customers. This magic unfolds when technology becomes intuitive, functions as intended, and when every individual is valued. By providing our employees with the autonomy to make an impact, we strive to fulfill our mission of simplifying technology so our customers can focus on what matters most to them-whether it's their students, patients, cu

Sr. Site Reliability Engineer, Integration Tools

Tesla Motors

Palo Alto, California, USA

Full-time

The Integration Platforms team develops and operates critical technology to support our ever-expanding customer fleet from prototype to production. As an SRE on this team, you will ensure the reliability, scalability, and performance of our on-vehicle, desktop-based, and web-based systems, collaborating closely with software engineers to design, build, and operate these systems across multiple regions. Join us and you will work alongside world-class software and data engineers on some of the new

Senior Azure SRE

Kforce Technology Staffing

Remote or Tampa, Florida, USA

Contract

RESPONSIBILITIES: Kforce has a client in Tampa, FL that is seeking a highly skilled Senior Infrastructure Engineer to drive the design, automation, and optimization of cloud infrastructure supporting the firm's core technologies and applications. Acting as a key technical expert, you'll ensure our platforms are scalable, resilient, and aligned with strategic IT initiatives. Responsibilities: * Design and automate infrastructure management to improve system reliability, scalability, and performa

Sr. Site Reliability Engineer, Bare Metal, Infrastructure

Tesla Motors

Remote or Austin, Texas, USA

Full-time

Tesla cloud as a service seeks a high impact Site Reliability Engineer (SRE) to support our bare-metal provisioning platform at scale. You'll provide direct support to internal customers, resolve complex provisioning issues, and escalate systemic problems to engineering. Your focus: ensuring reliable, automated delivery of bare-metal infrastructure using Kubernetes, Metal , and industry standard tooling across diverse hardware from Supermicro, HPE, and Dell. Responsibilities Provide frontline s

SRE Specialist

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet has an exciting opportunity for an experienced SRE Specialist to join our FortiGuard operation team. We are managing the consumer-facing services with high traffic volumes around the world. Service Reliability and Security is our top priority. This is a unique opportunity to join an established team of experienced professionals to work on some of the most innovative technology and network security products on the market. Job Responsibilities: Design and deployment of

Senior Site Reliability Engineer-FedRAMP (FULLY REMOTE)

Splunk Inc.

California, USA

Full-time

Description Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to our work, customers, having fun, and most significantly to each other's success. The Splunk Observability Cloud provides full-fidelity monitoring and fixing across infrastructure, applications, and user interf

Principal Site Reliability Engineer (Safety) - Nashville, TN Hybrid

Oracle Corporation

Remote

Full-time

Job Description We offer unique opportunities for smart, hands-on engineers with the expertise and passion to solve difficult architecture, engineering, and process problems. Our customers run their businesses on our cloud, and our mission is to provide them with the most secure cloud services. Our ideal candidate is a site reliability or devops engineer with expertise and passion in finding and improving how services are deployed and operated. If this is you, joining Oracle Cloud Infrastructur

Senior SRE / 5 days ONSITE/ Irvine

Motion Recruitment Partners, LLC

Los Angeles, California, USA

Full-time

Headquartered in the United States, this global provider of reliable networking devices and smart home products is consistently ranked as the world's top provider of Wi-Fi devices. The company is committed to delivering innovative products that enhance people's lives through faster, more reliable connectivity. They serve customers in over 170 countries and continue to grow their global footprint. They are seeking a onsite DevOps Engineer to join their team in irvine, where you'll work with cutti

Sr. Site Reliability Engineer - Top Secret Clearance

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. SITE RELIABILITY ENGINEER - TOP SECRET CLEARANCE As a Senior Site Reliability Engineer, you will architect, develop, and test key aspects of the infrastructure for an in-house solution for analysis, simulation, pr

Sr. DevOps/Site Reliability Engineer

MTW Recruit

Remote

Full-time

No 3rd party inquiries will be processed This is a 100% remote role in Eastern Standard Time zone - preference to EST and CST zones Seeking a talented DevOps/Site Reliability Engineer (SRE) with expertise in Kubernetes, Terraform, Azure, and observability tools like DataDog to deploy and manage scalable, reliable infrastructure. Requirements:Minimum of 7 years of DevOps experience, preferably with SRE background Terraform and Kubernetes are absolutely requiredStrong problem-solving, communicat

Senior Site Reliability Engineer - GPU Clusters

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us

Sr. Site Reliability Engineer, Energy Software

Tesla Motors

Palo Alto, California, USA

Full-time

Tesla is looking for a Site Reliability Engineer to build, enhance, and scale the infrastructure that underpins our Energy IoT applications. These applications provide real-time monitoring, optimization, and control for Tesla's industry-leading energy products, including Powerwall, Megapack, Solar Roof, Supercharger, Wall Connector, Autobidder, and Virtual Power Plants. We are a high-impact team that values curiosity, learning, mentorship, open discourse, and making disciplined decisions by weig

Senior Site Reliability Engineer with Kubernetes - W2 - Remote in EST hours (Posted by SAM)

Global Force USA

Remote

Contract

Requirements: 4 + years of experience working within a cloud engineer/SRE roleExpert knowledge of a cloud service providerExpert knowledge and hands on production experience in Kubernetes (bare metal or managed) cluster setup and management required.Experience with infrastructure as code (IaC) tools like Terraform, Pulumi.Experience with Kubernetes deployment tools like Helm, ArgoCD, FluxStrong awareness of networking and internet protocols.Understanding of identity and access management (IAM)Ex

Sr. Site Reliability Engineer (Starshield) - Top Secret Clearance

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. SITE RELIABILITY ENGINEER (STARSHIELD) - TOP SECRET CLEARANCE Starshield leverages SpaceX's Starlink technology and launch capability to support national security efforts. While Starlink is designed for consumer a

Sr. Spclst , Cloud Engineering

Merck & Company Inc

Remote or Rahway, New Jersey, USA

Full-time

Job Description We are looking for an experienced and enthusiastic Senior Site Reliability Engineer to join our Agile Planning Product Team. As part of the DevXOps Product Line, you will enable product teams to deliver value faster to the business by improving platform services that accelerate agile development projects. You will design scalable solutions, create CI/CD pipelines, and implement automation to enhance reliability and efficiency. As a Senior Reliability Engineer, you will: Become

Site Reliability Engineer (Information Technology)

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER (INFORMATION TECHNOLOGY) SpaceX is looking for an experienced Site Reliability Engineer with working knowledge of Kubernetes, Linux and related containerized technologies. This employee will b