Lead Site Reliability Engineer Jobs in San Jose, CA

Refine Results
21 - 40 of 50 Jobs

Senior Site Reliability Engineer

Circles Inc.

Remote or San Francisco, California, USA

Full-time

Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data - globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that can help raise global economic prosperity and enhance inclusion. Our infrastructure - including USDC, a blockchain-based dollar - helps busines

Sr. Site Reliability Engineer, Compute SRE

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Senior Site Reliability Engineer - GPU Clusters

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us

Sr. Site Reliability Engineer, Energy Software

Tesla Motors

Palo Alto, California, USA

Full-time

Tesla is looking for a Site Reliability Engineer to build, enhance, and scale the infrastructure that underpins our Energy IoT applications. These applications provide real-time monitoring, optimization, and control for Tesla's industry-leading energy products, including Powerwall, Megapack, Solar Roof, Supercharger, Wall Connector, Autobidder, and Virtual Power Plants. We are a high-impact team that values curiosity, learning, mentorship, open discourse, and making disciplined decisions by weig

Senior Site Reliability Engineer, Test Platform- REMOTE

Cisco Systems, Inc.

Remote or San Francisco, California, USA

Full-time

At Cisco Meraki, we create magic through the energy and passion of our employees, who shape our dynamic community and empower us to solve problems for our customers. This magic unfolds when technology becomes intuitive, functions as intended, and when every individual is valued. By providing our employees with the autonomy to make an impact, we strive to fulfill our mission of simplifying technology so our customers can focus on what matters most to them-whether it's their students, patients, cu

Senior SRE Engineer

M&T BANK CORPORATION

Remote or Buffalo, New York, USA

Full-time

Job Overview: We are looking for a highly motivated SR SRE Engineer with a strong background in Observability to join our growing team. This role requires a seasoned professional to guide our team in building, scaling, and maintaining observability solutions that help ensure our systems and services are highly available, performant, and secure. Responsibilities: Lead the development and implementation of observability tools and practices across multiple platforms, including monitoring, logging

Staff Site Reliability Engineer

Block Inc

California, USA

Full-time

Block is one company built from many blocks, all united by the same purpose of economic empowerment. The blocks that form our foundational teams - People, Finance, Counsel, Hardware, Information Security, Platform Infrastructure Engineering, and more - provide support and guidance at the corporate level. They work across business groups and around the globe, spanning time zones and disciplines to develop inclusive People policies, forecast finances, give legal counsel, safeguard systems, nurture

SRE Manager

Fortinet

Sunnyvale, California, USA

Full-time

Job Description At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess over getting the details right. We love what we do and are proud of our work to secure clouds and container environments for thousands of B2B customers worldwide. We are looking for a highly skilled Site Reliability Engineering (SRE) Manager to lead our SRE team in building scalabl

Principal Site Reliability Engineer - Storage

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies and tackling distributed system problems? Join our highly skilled Storage Team! We design, deploy, and manage applications and infrastructure that supports Akamai's internal and customer-facing cloud storage platforms. We do this while maintaining Akamai's mission to make life better for billions of people, billions of times a day. Partner with the best In this role, you'll

Staff Site Reliability Engineer, Cell Software

Tesla Motors

Remote or Austin, Texas, USA

Full-time

Tesla is re-thinking how batteries are made from the ground up. We're designing new factories, new equipment, new processes and new software to rapidly scale battery manufacturing, globally. The primary bottleneck to Tesla's future expansion (and the transition to sustainable transport and energy storage) is our ability to produce and procure batteries - that's why we're innovating in-house, with our collection of world-class engineers, to redefine the industry. Software, data and automation all

Senior Site Reliability Engineer - Azure - Remote

UnitedHealth Group

Remote or Eden Prairie, Minnesota, USA

Full-time

For those who want to invent the future of health care, here's your opportunity. We're going beyond basic care to health programs integrated across the entire continuum of care. Join us to start Caring. Connecting. Growing together. The Sr Site Reliability Engineer will architect, develop, and maintain Optum Serve's cloud environment in both the commercial and government cloud. The role will work closely with software engineers, architects, and DevOps engineers to architect and maintain a secur

Sr. Site Reliability Engineer: Splunk Cloud Services

Splunk Inc.

Colorado, USA

Full-time

Description Sr. Site Reliability Engineer: Splunk Cloud Services Job Description Join us as we pursue our exciting vision to make machine data accessible, usable and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we're committed to our work, customers, having fun and most significantly to each other's success. Learn more about Splunk careers and how you can become a part of our

Sr. Linux Site Reliability Engineer, IT Manufacturing Site Reliability Engineering

Tesla Motors

Fremont, California, USA

Full-time

We are seeking an enthusiastic SRE to join our dynamic IT Manufacturing Site Reliability Engineering (ITMFG-SRE) team at Tesla. Our team is responsible for building and managing an ecosystem of applications and platforms essential to manufacturing. As a Linux SRE, this role requires experience with hardware, software, networking, and automation to implement scalable solutions for manufacturing sites globally. You'll play a key role in maintaining, optimizing and scaling our infrastructure to sup

Senior Site Reliability Engineer - Observability (FedRAMP IL5)

Splunk Inc.

North Carolina, USA

Full-time

Description Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to our work, customers, having fun, and most significantly to each other's success. The Splunk Observability Cloud provides full-fidelity monitoring and fixing across infrastructure, applications, and user inter

Senior Site Reliability Engineer, Observability, FedRAMP

Splunk Inc.

California, USA

Full-time

Description Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers love our technology, but it's our caring employees that make Splunk stand out as an amazing career destination. No matter where in the world or what level of the organization, we approach our wor

Principal AI Infrastructure SRE Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has been reinventing computer graphics, PC gaming, and accelerated computing for 30 years. It is a unique legacy of innovation that's fueled by great technology and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, generative AI, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best tal

Senior Azure SRE

Kforce Technology Staffing

Remote or Tampa, Florida, USA

Contract

RESPONSIBILITIES: Kforce has a client in Tampa, FL that is seeking a highly skilled Senior Infrastructure Engineer to drive the design, automation, and optimization of cloud infrastructure supporting the firm's core technologies and applications. Acting as a key technical expert, you'll ensure our platforms are scalable, resilient, and aligned with strategic IT initiatives. Responsibilities: * Design and automate infrastructure management to improve system reliability, scalability, and performa

Sr. Site Reliability Engineer, Bare Metal, Infrastructure

Tesla Motors

Remote or Austin, Texas, USA

Full-time

Tesla cloud as a service seeks a high impact Site Reliability Engineer (SRE) to support our bare-metal provisioning platform at scale. You'll provide direct support to internal customers, resolve complex provisioning issues, and escalate systemic problems to engineering. Your focus: ensuring reliable, automated delivery of bare-metal infrastructure using Kubernetes, Metal , and industry standard tooling across diverse hardware from Supermicro, HPE, and Dell. Responsibilities Provide frontline s

Senior Site Reliability Engineer-FedRAMP (FULLY REMOTE)

Splunk Inc.

California, USA

Full-time

Description Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to our work, customers, having fun, and most significantly to each other's success. The Splunk Observability Cloud provides full-fidelity monitoring and fixing across infrastructure, applications, and user interf

Principal Site Reliability Engineer (Safety) - Nashville, TN Hybrid

Oracle Corporation

Remote

Full-time

Job Description We offer unique opportunities for smart, hands-on engineers with the expertise and passion to solve difficult architecture, engineering, and process problems. Our customers run their businesses on our cloud, and our mission is to provide them with the most secure cloud services. Our ideal candidate is a site reliability or devops engineer with expertise and passion in finding and improving how services are deployed and operated. If this is you, joining Oracle Cloud Infrastructur