Site Reliability Engineer Jobs in California

Refine Results
121 - 140 of 319 Jobs

Senior Site Reliability Engineer, Product - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A215600 Apply to this job Share this listing: Responsibilities Team Intro: The Product Engineering team monitors and maintains the availability of TikTok, including services such as video playback, content discovery/recommendations, live streaming, and customer service feedback. In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that

Data Ingestion SRE, Data Platform - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A218312 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest dataplaforms in the world that directly supports the TikTok a

Data Ingestion SRE, Data Platform -USDS

TikTok

Los Angeles, California, USA

Full-time

Location : Los Angeles Employment Type : Regular Job Code : A259491 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data plaforms in the world that directly supports the TikT

Internship, Site Reliability Engineer, Applications Engineering (Fall 2025)

Tesla Motors

Fremont, California, USA

Full-time

Consider before submitting an application: This position is expected to start around September 2025 and continue through the Fall term (approximately December 2025) or into Spring 2026 if available and there is an opportunity to do so. We ask for a minimum of 12 weeks, full-time and on-site, for most internships. Our internship program is for students who are actively enrolled in an academic program. entry level candidates seeking employment after graduation and not returning to school should a

Site Reliability Engineer, Trust & Safety - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : VGGP Apply to this job Share this listing: Responsibilities Team Intro: The Trust and Safety (TnS) engineering team of US Tech Service department at TikTok is fast growing and responsible for building machine learning models and systems to identify and defend internet abuse and fraud on our platform. Our mission is to protect billions of users and publishers across the globe every day. We embrace the state-of-the-art machine learnin

Network Site Reliability Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

The Enterprise Network Support and SRE team is looking to add a seasoned Technical SRE lead to help actualize the SRE vision for our network infrastructure. We are looking for an engineer who is passionate about the network and making its operation seamless with a focus on user experience. This role will offer several opportunities to solve problems by being hands-on with troubleshooting, focused on network automation, observability, documentation, and excellence in operations. This Network SRE

Senior Site Reliability Engineer

Salesforce

San Francisco, California, USA

Full-time

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category Software Engineering Job Details About Salesforce We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving you

Principal AI Infrastructure SRE Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has been reinventing computer graphics, PC gaming, and accelerated computing for 30 years. It is a unique legacy of innovation that's fueled by great technology and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, generative AI, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best tal

Sr. Site Reliability Engineer

Adobe Systems

San Jose, California, USA

Full-time

Our Company Changing the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen. We're on a mission to hire the very best and are committed to creating exceptional employee experiences wher

Senior Engineer - Data Warehouse Site Reliability Engineering (SRE) (ship required)

Oracle Corporation

Pleasanton, California, USA

Full-time

Job Description The candidate for this position must qualify the US-Gov requirements - should be a and resident in the US. We are looking for senior engineers with experience in supporting data warehousing products. As a member of the Product development organization, focus will be on working with development teams, providing timely support to customers and identify/implementing process automation, for cloud BI product. BS or higher degree in Computer Science / Engineering or equivalent 3+ y

Staff Site Reliability Engineer, Cell Software

Tesla Motors

Remote or Austin, Texas, USA

Full-time

Tesla is re-thinking how batteries are made from the ground up. We're designing new factories, new equipment, new processes and new software to rapidly scale battery manufacturing, globally. The primary bottleneck to Tesla's future expansion (and the transition to sustainable transport and energy storage) is our ability to produce and procure batteries - that's why we're innovating in-house, with our collection of world-class engineers, to redefine the industry. Software, data and automation all

Site Reliability Engineer (AWS, Linux Administration)

Introlligent Inc.

Elk Grove, California, USA

Contract

Title: SRE (Site Reliability Engineer) Location: Elk Grove, CA (Hybrid) Duration: 6 months Minimum Qualifications: Strong sense of ownership, customer service, and integrity demonstrated through clear communication.Experience with deploying, supporting and monitoring new and existing services, platforms, and application stacks.Proficiency with Python, Bash scripts, GO, REST APIS, and any object oriented programming a must.Proficiency in using monitoring and observability tools such as Prometheus

Linux Systems Engineer (SRE)

Introlligent Inc.

Elk Grove, California, USA

Contract, Third Party

**ONLY LOCAL CANDIDATES**Minimum Qualifications:Strong sense of ownership, customer service, and integrity demonstrated through clear communication.Experience with deploying, supporting and monitoring new and existing services, platforms, and application stacks.Proficiency with Python, Bash scripts, GO, REST APIS, and any object oriented programming a must.Proficiency in using monitoring and observability tools such as Prometheus, Grafana, Splunk, etc.Extensive knowledge with Enterprise Linux (R

Staff Site Reliability Engineer, Cell Software

Tesla Motors

Remote or Fremont, California, USA

Full-time

Tesla is re-thinking how batteries are made from the ground up. We're designing new factories, new equipment, new processes and new software to rapidly scale battery manufacturing, globally. The primary bottleneck to Tesla's future expansion (and the transition to sustainable transport and energy storage) is our ability to produce and procure batteries - that's why we're innovating in-house, with our collection of world-class engineers, to redefine the industry. Software, data and automation all

Sr. Linux Site Reliability Engineer, IT Manufacturing Site Reliability Engineering

Tesla Motors

Fremont, California, USA

Full-time

We are seeking an enthusiastic SRE to join our dynamic IT Manufacturing Site Reliability Engineering (ITMFG-SRE) team at Tesla. Our team is responsible for building and managing an ecosystem of applications and platforms essential to manufacturing. As a Linux SRE, this role requires experience with hardware, software, networking, and automation to implement scalable solutions for manufacturing sites globally. You'll play a key role in maintaining, optimizing and scaling our infrastructure to sup

Principal Site Reliability Engineer

General Motors

Remote

Full-time

Job Description Remote : Reporting where work can/needs to be performed / collaboration should happen. If the person lives w/n 50 miles of such a location, they are expected to come in three times a week. If they do not live withing 50 miles of any of those locations, they don't need to report in. The rapid adoption of advanced software in vehicles marks a new era for automakers and consumers, bringing both advantages and challenges. As part of Site Reliability Engineering (SRE) at General mot

Site Reliability Engineer - Observability (FedRAMP IL5)

Splunk Inc.

Virginia, USA

Full-time

Description Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to our work, customers, having fun, and most significantly to each other's success. The Splunk Observability Cloud provides full-fidelity monitoring and fixing across infrastructure, applications, and user inter

Lead Site Reliability Engineer, Observability - Remote

Cisco Systems, Inc.

Remote

Full-time

Application window is open until further notice. The Meraki cloud supports millions of customer devices from 10 data centers around the world. Meraki's customer base has grown by a factor of 2-3 every year, serving billions of HTTP requests per day globally. Our customers depend on our products to run their critical infrastructure of network switches, security appliances, wireless APs and security cameras. As SREs at Meraki, we are responsible for building and growing the cloud that supports t

FedRAMP Site Reliability Engineer - Early Career (; Boulder, CO or Raleigh, NC ONLY)

Splunk Inc.

Colorado, USA

Full-time

Description This is a US-based position. Candidates must be able to support FedRAMP High. This role is based in the Boulder, CO office or Raleigh, NC office, and will require relocation. Splunk is here to build a safer and more resilient digital world. The world's leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. While customers love our technology, it's our people that make Splunk stand out as an amazing career destinati

Senior Site Reliability Engineer, HPC and LSF

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA is the leader in AI, machine learning and datacenter acceleration. NVIDIA is expanding that leadership into datacenter networking with ethernet switches, NICs and DPUs NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" th