Reliability engineering Jobs in San Jose, CA

Refine Results
61 - 80 of 174 Jobs

Senior Site Reliability Engineer - Data Infrastructure

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A33665 Apply to this job Share this listing: Responsibilities Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. We seamlessly merge software development and infrastructure operations to design, build, and manage large-scale, highly distributed systems. We take pride in overseeing one of the industry's most extensive cloud infrastructures. As software development evolves, building systems fro

Senior Backend Software Engineer, Global E-commerce User Growth - Search Engine Optimization

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A148209B Apply to this job Share this listing: Responsibilities The e-commerce industry has seen tremendous growth in recent years and has become a hotly contested space amongst leading Internet companies, and its future growth cannot be underestimated. With millions of loyal users globally, we believe TikTok is an ideal platform to deliver a brand new and better e-commerce experience to our users. The growth engineering team owns t

Site Reliability Engineer Graduate (TikTok Product - USDS) - 2025 Start (BS/MS)

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A224009 Apply to this job Share this listing: Responsibilities About the Team Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed and fault-tolerant systems. Product SREs help ensure the reliability and uptime for the services underpinning the TikTok product. Our team pays great attention to optimizing existing systems, working closely with cross functional

Site Reliability Engineer, Product - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : 5B7P Apply to this job Share this listing: Responsibilities About the team The Product Engineering team monitors and maintains the availability of TikTok, including services such as video playback, content discovery/recommendations, live streaming, and customer service feedback. In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that

Tech lead Site Reliability Engineer, Edge - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A104498 Apply to this job Share this listing: Responsibilities Site Reliability Engineering combines software and system engineering with system operations to build and run large-scale, massively distributed infrastructure. Our Edge SREs ensure infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. We dive deep into the stack, including network, hardware, OS, and applications, to quickly resol

Lead Reliability Validation Engineer, Energy & Charging Products

Tesla Motors

Palo Alto, California, USA

Full-time

As a Lead Validation Engineer focusing on manufacturing reliability of Tesla Energy & Charging products, you will play a key role in ensuring the reliability designed into Tesla's industrial and residential products is maintained throughout product manufacturing. This role follows the reliability lifecycle of the product from concept to design, development testing/analysis, manufacturing, and field operation to design-in, confirm, and grow exceptional reliability at every stage. You will partner

Principal Software Engineer - Enterprise AI Platform

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Database Administrator

Kforce Technology Staffing

Remote or Oakland, California, USA

Contract

RESPONSIBILITIES: Kforce's notable Utilities client is looking for a Database Administrator to join their team. This role is 100% remote. A virtual desktop will be provided; you will need to use your personal CPU. Project Description: * Applies expert judgment to support complex projects, collaborating on schedules, resources, and risks * Independently handles critical tasks while following database standards and security protocols * Works with team members to maintain system reliability and en

Sr. Linux Site Reliability Engineer, IT Manufacturing Site Reliability Engineering

Tesla Motors

Fremont, California, USA

Full-time

We are seeking an enthusiastic SRE to join our dynamic IT Manufacturing Site Reliability Engineering (ITMFG-SRE) team at Tesla. Our team is responsible for building and managing an ecosystem of applications and platforms essential to manufacturing. As a Linux SRE, this role requires experience with hardware, software, networking, and automation to implement scalable solutions for manufacturing sites globally. You'll play a key role in maintaining, optimizing and scaling our infrastructure to sup

Senior Site Reliability Engineer, Product - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A215600 Apply to this job Share this listing: Responsibilities Team Intro: The Product Engineering team monitors and maintains the availability of TikTok, including services such as video playback, content discovery/recommendations, live streaming, and customer service feedback. In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that

Network Site Reliability Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

The Enterprise Network Support and SRE team is looking to add a seasoned Technical SRE lead to help actualize the SRE vision for our network infrastructure. We are looking for an engineer who is passionate about the network and making its operation seamless with a focus on user experience. This role will offer several opportunities to solve problems by being hands-on with troubleshooting, focused on network automation, observability, documentation, and excellence in operations. This Network SRE

Staff Engineer, Autonomy Developer Productivity & Infrastructure (Autonomy)

Rivian

Palo Alto, California, USA

Full-time

About Rivian Rivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract. As a company, we constantly challenge what's possible, never simply accepting what has always been done. We reframe old problems, seek new solutions and operate comfortably in areas that are unknown. Our backgrounds are diverse, but our team shares a love of the outdoors and a desire to protect it

Associate Vice President, Hardware Systems Engineering - Cloud Platform BU

Marvell Semiconductor Inc.

Santa Clara, California, USA

Full-time

About Marvell Marvell's semiconductor solutions are the essential building blocks of the data infrastructure that connects our world. Across enterprise, cloud and AI, automotive, and carrier architectures, our innovative technology is enabling new possibilities. At Marvell, you can affect the arc of individual lives, lift the trajectory of entire industries, and fuel the transformative potential of tomorrow. For those looking to make their mark on purposeful and enduring innovation, above and be

Senior Frontend Software Engineer, Marketplace

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Internship, Software Engineer Opticaster, Energy Engineering (Fall 2025)

Tesla Motors

Palo Alto, California, USA

Full-time

Consider before submitting an application: This position is expected to start August/September 2025 and continue through the entire Fall term (i.e. through December 2025/January 2026) or into Winter/Spring 2026 if available. We ask for a minimum of 12 weeks, full-time and on-site, for most internships. International Students: If your work authorization is through CPT, please consult your school on your ability to work 40 hours per week before applying. You must be able to work 40 hours per week

Distinguished Engineer, Machine Learning - Safety

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Senior Azure DevOps Engineer- San Jose, CA

Elekta Inc.

San Jose, California, USA

Full-time

Are you a current Elekta employee? Please click here to apply through our internal career site Find Jobs - Elekta. Want to join a team with a mission to improve and save lives? We continually look for motivated and skilled individuals who are interested in supporting our customers - healthcare professionals who use our products to help patients and their communities. We currently have the following opportunity available - please contact us for more details! We don't just build technology. We

CDN Site Reliability Engineer (SRE) L5

Netflix, Inc.

Los Gatos, California, USA

Full-time

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time. How do you spark joy in hundreds of millions of people? It starts with a vision - that technology can give voice to stories around the world. In delivering those much-loved

Site Reliability Engineer, Trust & Safety - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : VGGP Apply to this job Share this listing: Responsibilities Team Intro: The Trust and Safety (TnS) engineering team of US Tech Service department at TikTok is fast growing and responsible for building machine learning models and systems to identify and defend internet abuse and fraud on our platform. Our mission is to protect billions of users and publishers across the globe every day. We embrace the state-of-the-art machine learnin

Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

We are seeking Software Engineers with previous experience building and running private and public clouds at production scale. As part of the DGX Cloud team, you'll have the opportunity to support our customers' journeys in AI training and inference development by building the platforms, tools, and services that defend the operational capacity of our bare-metal, accelerated compute infrastructure and codify reliability best-practices in the broader DGX Cloud platform ecosystem. What you'll be d