Reliability engineering Jobs in California

Refine Results
81 - 100 of 269 Jobs

IT Litigation Support

Contact Government Services, LLC

San Francisco, California, USA

Full-time

IT Litigation Support Employment Type: Full Time, Mid level Department: Information Technology Contact Government Services is looking for a Litigation Support Technician to work at the United States Attorney's Office. As a Litigation Support Technician for CGS, you will be responsible for providing technical and analytical assistance involving Litigation Support of the United States Attorney's office. CGS brings motivated, highly skilled, and creative people together to solve the government'

Datacenter Resiliency Architect - New College Grad 2025

NVIDIA Corporation

Santa Clara, California, USA

Full-time

Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join the team and see how we can make a lasting impact on the world

Site Reliability Engineer, Compute - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A94176 Apply to this job Share this listing: Responsibilities Site Reliability Engineering(SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. In our team, you'll have the opportunity to manage the complex challenges of scale, while using expertise in coding, algorithms, complexity analysis, and large-scale system design. We embrace a culture of di

Software Engineer, Map ML Platform

Nuro Inc.

Mountain View, California, USA

Full-time

Who We Are Nuro exists to better everyday life through robotics. Founded in 2016, Nuro has spent eight years developing autonomous driving (AD) technology and commercializing AD applications. The Nuro Driver is our world-class autonomous driving system that combines AD hardware with our generalized AI-first self-driving software. Built to learn and improve through data, the Nuro Driver is one of the few driverless autonomous technologies on public roads today. Nuro has raised over $2B in capit

Senior System Software Engineer, AI Solutions Engineering

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA is hiring senior system software engineers in its Infrastructure, Planning and Process Team (IPP), to accelerate AI adoption across various engineering workflows within the company. IPP is a global organization within NVIDIA. The group works with various other teams within NVIDIA such as Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence and Driverless Cars to cater to their infrastructure needs. These cloud services provide almost half a million automated jobs

Sr Site Reliability Engineer (App Service Team)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Senior Datacenter Resiliency Architect

NVIDIA Corporation

Santa Clara, California, USA

Full-time

We are now seeking a Senior Datacenter Resiliency (RAS) Architect! NVIDIA is a learning machine that constantly evolves by seeking exciting opportunities that matter to the world, and that only we can solve. We attract the world's best people, so we can achieve our highest aim: building a company that lets us do our life's work, at the highest level of our craft. NVIDIA is seeking a Resiliency Architect to support the development and validation of GPU (graphical processing units) hardware and s

Principal DevOps Engineer (Cortex)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Senior Site Reliability Engineer, AI Infrastructure

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you! NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for over 30 years. It's a unique legacy of innovation that's fueled by phenomenal technology and outstanding people. Today, we're tapping into the unlimited potential of

Director, Site Reliability Engineering

Walmart Inc.

Remote or Bentonville, Arkansas, USA

Full-time

Position Summary What you'll do Are you passionate about pioneering cutting-edge technology leveraging GenAI and big data to revolutionize Walmart's customer service experiences? Do you dream of working on innovative systems that make a significant impact on hundreds of millions of customers across the globe? We are seeking a visionary and hands-on Director of Site Reliability Engineering (SRE) to lead and scale a world-class SRE organization. This leader will be responsible for building a hig

Reliability Engineer, Thermal & Mechanical, Energy Products

Tesla Motors

Palo Alto, California, USA

Full-time

As a Mechanical Reliability Engineer focusing on Tesla's energy products, specifically Megapack, you will play a key role in designing reliability into Tesla's industrial energy storage systems ensuring the products meet the highest standards of reliability. This role follows the reliability lifecycle of the product from concept to design, validation testing/analysis, manufacturing, and field operation to design-in, confirm, and grow exceptional reliability at every stage. You will investigate f

Tech Lead Manager (Machine Learning Focused) - TikTok Search Algorithms (Ranking, Relevance, Understanding, User Engagement, NLP)

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : S0113 Apply to this job Share this listing: Responsibilities About the Team On the TikTok Search Team, you will have the opportunity to develop and apply cutting edge machine learning technologies in real-time large-scale systems, which serve billions of search requests every day. Via advanced NLP and multi-modal models, our projects impact and improve the search experience for hundreds of millions of users globally. We embrace a cu

Sr Principal FinOps/DevOps Engineer (Cortex)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Site Reliability Engineer

McKesson Corporation

Remote or Columbus, Ohio, USA

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patien

Senior Machine Learning Engineer - TikTok Search Algorithm (Ranking, Relevance, Understanding, User Engagement)

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : JHU52 Apply to this job Share this listing: Responsibilities About the Team On the TikTok Search Team, you will have the opportunity to develop and apply cutting edge machine learning technologies in real-time large-scale systems, which serve billions of search requests every day. Via advanced NLP and multi-modal models, our projects impact and improve the search experience for hundreds of millions of users globally. We embrace a cu

Senior DevOps Engineer

Relativity Space

Long Beach, California, USA

Full-time

At Relativity Space, we're building rockets to serve today's needs and tomorrow's breakthroughs. Our Terran R vehicle will deliver customer payloads to orbit, meeting the growing demand for launch capacity. But that's just the start. Achieving commercial success with Terran R will unlock new opportunities to advance science, exploration, and innovation, pioneering progress that reaches beyond the known. Joining Relativity means becoming part of something where autonomy, ownership, and impact ex

Software Engineer, Enterprise Application - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A171267 Apply to this job Share this listing: Responsibilities About the Team Join our innovative and rapidly expanding Enterprise Applications team. We are looking for a talented Software Engineer with a strong background in both Software Engineering (SWE) and Site Reliability Engineering (SRE). This role is crucial in building, maintaining, and enhancing scalable and reliable enterprise applications. You will collaborate with cros

Senior Machine Learning Engineering Manager

Adobe Systems

San Jose, California, USA

Full-time

Our Company Changing the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen. We're on a mission to hire the very best and are committed to creating exceptional employee experiences wher

Mainframe Automation Lead, Systems Administration

Kyndryl

Dallas, Texas, USA

Full-time

Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The Role As a Systems Administrator at Kyndryl, you'll tackle complex challenges across diverse platforms and services. You'll play a key role in modern

Senior Service Engineer

Robert Half

Remote or Redmond, Washington, USA

Contract

Description We're supporting a high-impact managed services project with a well-known technology company based in the Redmond, WA area. This role will focus on backend lab and infrastructure support-ensuring smooth, secure, and automated systems for internal teams working in a fast-paced development environment. Responsibilities: Manage and maintain Azure IaaS environments, including virtual machine provisioning, storage solutions, networking, and policy management. Administer and optimize Wind