Lead Site Reliability Engineer Jobs in California

Refine Results
61 - 76 of 76 Jobs

Sr. Site Reliability Engineer, Compute SRE

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Principal Site Reliability Engineer Cloud Identity & Trust (SPIFFE/SPIRE)

Pinnacle Software Solutions

San Jose, California, USA

Contract

Job Title: Cloud Compute Platform Architect Experience Level: Mid-SeniorSan Jose, CA Job SummaryWe are seeking an experienced Cloud Compute Platform Architect to design and manage a cutting-edge cloud compute platform based on SPIFFE. The ideal candidate will perform Site Reliability Engineering (SRE) responsibilities, including deployment, capacity management, observability, and performance tuning. You will collaborate closely with our Security Architecture team to define attestation methods fo

Sr. Site Reliability Engineer (Starshield) - Top Secret Clearance

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. SITE RELIABILITY ENGINEER (STARSHIELD) - TOP SECRET CLEARANCE Starshield leverages SpaceX's Starlink technology and launch capability to support national security efforts. While Starlink is designed for consumer a

Director, Site Reliability Engineering

Walmart Inc.

Remote or Bentonville, Arkansas, USA

Full-time

Position Summary What you'll do Are you passionate about pioneering cutting-edge technology leveraging GenAI and big data to revolutionize Walmart's customer service experiences? Do you dream of working on innovative systems that make a significant impact on hundreds of millions of customers across the globe? We are seeking a visionary and hands-on Director of Site Reliability Engineering (SRE) to lead and scale a world-class SRE organization. This leader will be responsible for building a hig

Sr. Site Reliability Engineer, Energy Software

Tesla Motors

Palo Alto, California, USA

Full-time

Tesla is looking for a Site Reliability Engineer to build, enhance, and scale the infrastructure that underpins our Energy IoT applications. These applications provide real-time monitoring, optimization, and control for Tesla's industry-leading energy products, including Powerwall, Megapack, Solar Roof, Supercharger, Wall Connector, Autobidder, and Virtual Power Plants. We are a high-impact team that values curiosity, learning, mentorship, open discourse, and making disciplined decisions by weig

Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Santa Clara, California, USA

Full-time

NVIDIA is the leader in AI, machine learning and datacenter acceleration. NVIDIA is expanding that leadership into datacenter networking with ethernet switches, NICs and DPUs NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" th

Site Reliability Engineer, Connected Warfare

Aduril Industries

Costa Mesa, California, USA

Full-time

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and c

Sr. Linux Site Reliability Engineer, IT Manufacturing Site Reliability Engineering

Tesla Motors

Fremont, California, USA

Full-time

We are seeking an enthusiastic SRE to join our dynamic IT Manufacturing Site Reliability Engineering (ITMFG-SRE) team at Tesla. Our team is responsible for building and managing an ecosystem of applications and platforms essential to manufacturing. As a Linux SRE, this role requires experience with hardware, software, networking, and automation to implement scalable solutions for manufacturing sites globally. You'll play a key role in maintaining, optimizing and scaling our infrastructure to sup

Principal AI Infrastructure SRE Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has been reinventing computer graphics, PC gaming, and accelerated computing for 30 years. It is a unique legacy of innovation that's fueled by great technology and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, generative AI, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best tal

Senior Staff Software Engineer, Reliability Engineering

Airbnb

Remote

Full-time

Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic way. The Community You Will Join: We are a community based on connection and belonging - a community that was born in 2007 when two hosts welc

Senior Staff Machine Learning Engineer - DevOps/Site Reliability Engineer

ServiceNow, Inc.

Santa Clara, California, USA

Full-time

Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today - ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500 . Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But thi

Senior Machine Learning Ops Engineer, Global SRE

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A04380 Apply to this job Share this listing: Responsibilities MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so on. Responsibilities 1) Responsible for setting SLOs of online machine lear

Senior Engineer - Data Warehouse Site Reliability Engineering (SRE) (ship required)

Oracle Corporation

Pleasanton, California, USA

Full-time

Job Description The candidate for this position must qualify the US-Gov requirements - should be a and resident in the US. We are looking for senior engineers with experience in supporting data warehousing products. As a member of the Product development organization, focus will be on working with development teams, providing timely support to customers and identify/implementing process automation, for cloud BI product. BS or higher degree in Computer Science / Engineering or equivalent 3+ y

(USA) Director, Software Engineering

Walmart Inc.

Remote or Bentonville, Arkansas, USA

Full-time

Position Summary What you'll do Are you passionate about leveraging cutting-edge technology to revolutionize the associate and Candidate experience? Do you aspire to play a technical leadership role that shapes the future of Global People Systems? If you possess exceptional Workday and OpenStack technical expertise, along with strong people leadership skills, this opportunity is for you. Join our forward-thinking People Technology engineering organization as Director of Software Engineering, w

Senior, Software Engineer | Backend | Personalization | Sunnyvale

Walmart Inc.

Remote or Bentonville, Arkansas, USA

Full-time

Position Summary What you'll do **Immigration sponsorship is not available in this role.** Walmart Global Tech is looking to hire a seasoned Backend Senior Software Engineerfor Personalization team. If you are an engineer who comes with both strong back-end & front-end devlopment background & are passionate about solving problems at scale, feel free to apply! About Team: Our team works closely with our US stores and eCommerce business to better serve customers by empowering team members, st

IT - Oracle CPQ Developer (Big Machines)

Infobahn Softworld Inc.

Brea, California, USA

Contract, Third Party

Job Title - Oracle CPQ Developer Location - Brea, California Job Description: CPQ Developer (Oracle iQuotes / BigMachines) : Developer will join the Global SFDC/CRM/CPQ Team as an individual contributor to support the CRM applications for the Field Sales, Sales Operations, Marketing, Sales Management, Product Management, Legal, and Finance business functions. The ideal candidate must have extensive experience in the Quote to Cash process with advance experience in complex projects and config