Lead Site Reliability Engineer Jobs in San Jose, CA

Refine Results
21 - 40 of 67 Jobs

Principal Site Reliability Engineer - Enterprise AI Platform

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As

Senior Site Reliability Engineer - Data Infrastructure

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A33665 Apply to this job Share this listing: Responsibilities Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. We seamlessly merge software development and infrastructure operations to design, build, and manage large-scale, highly distributed systems. We take pride in overseeing one of the industry's most extensive cloud infrastructures. As software development evolves, building systems fro

Senior Staff Site Reliability Engineer (Cortex Observability)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Sr Site Reliability Engineer (App Service Team)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Senior Site Reliability Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer. At NVIDIA, you'll be part of the team shaping the future of computing and guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: Own the solutions you build, collaborating with cross-functional teams to successfully implement them.Collaborate with various teams

Senior Site Reliability Engineer, Product - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A215600 Apply to this job Share this listing: Responsibilities Team Intro: The Product Engineering team monitors and maintains the availability of TikTok, including services such as video playback, content discovery/recommendations, live streaming, and customer service feedback. In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that

Senior Site Reliability Engineer

Circles Inc.

Remote or San Francisco, California, USA

Full-time

Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data - globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that can help raise global economic prosperity and enhance inclusion. Our infrastructure - including USDC, a blockchain-based dollar - helps busines

Senior Site Reliability Engineer - Global SRE, Monetization Technology

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A180769 Apply to this job Share this listing: Responsibilities TikTok is one of the fastest growing apps in the world, and we're seeking Site Reliability Engineers (SREs) to join our monetization technology team. The monetization technology team works on building and running large-scale, globally distributed, fault-tolerant ads systems. SREs keep the systems up and running with the highest level of availability, ensuring our users h

Staff Site Reliability Engineer, Cell Software

Tesla Motors

Remote or Fremont, California, USA

Full-time

Tesla is re-thinking how batteries are made from the ground up. We're designing new factories, new equipment, new processes and new software to rapidly scale battery manufacturing, globally. The primary bottleneck to Tesla's future expansion (and the transition to sustainable transport and energy storage) is our ability to produce and procure batteries - that's why we're innovating in-house, with our collection of world-class engineers, to redefine the industry. Software, data and automation all

Senior Site Reliability Engineer (Cortex Cloud Security Posture Management)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Principal Site Reliability Engineer (Cortex Cloud Security Posture Management)

PaloAlto Networks

Santa Clara, California, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Senior Site Reliability Engineer, HPC and LSF

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA is the leader in AI, machine learning and datacenter acceleration. NVIDIA is expanding that leadership into datacenter networking with ethernet switches, NICs and DPUs NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" th

Staff Site Reliability Engineer, Fleetnet

Tesla Motors

Remote or Palo Alto, California, USA

Full-time

We are a product focused global team creating the next-generation of server-side infrastructure and code to support the growing suite of Tesla products and services. We are looking for seasoned SREs with domain expertise in areas related to developing infrastructure as a service, Kubernetes, Gitops, K8s Operator development, and platform security. The Fleetnet SRE team is part of the Vehicle Software division and is embedded with our backend application, data platform and navigation development

Senior Site Reliability Engineer

Generac Power Systems Inc

Remote or Denver, Colorado, USA

Full-time

We are Generac, a leading energy technology company committed to powering a smarter world. Over the 60 plus years of Generac's history, we've been dedicated to energy innovation. From creating the home standby generator market category, to our current evolution into an energy technology solutions company, we continue to push new boundaries. Over the 60 plus years of Generac's history, we've been dedicated to energy innovation. From creating the home standby generator market category, to our cu

Senior Site Reliability Engineer

General Motors

Remote

Full-time

Job Description Develop and design software applications for driverless technology company. Duties may include: Build out and improve observability systems, tools and the related codebase. Contribute code, perform code reviews, and create technical designs that improve performance and reliability of observability systems using software and systems engineering skills. Partner with other Software Engineering teams to better understand use-cases and guide the engineers to use the existing tools eff

Senior Site Reliability Engineer

GlobalLogic Inc.

Remote

Full-time

Job Description: Design, deploy, and scale our Prometheus architecture to handle 100+ million active series and beyond.Deploy and operate large, high-performance Elasticsearch clusters holding 2000+TB of data.Deploy and grow high-throughput data pipelines built on Kafka, handling hundreds of thousands of events per second.Design and build an alerting system that allows engineering teams to construct alerts from multiple data sources and alerting workflows.Write libraries and APIs that give engin

Principal Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies? Join our Compute Team! Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We do this while maintaining Akamai's mission at the forefront of what we do: make life better for billions of people, billions of times a day. Partner with the best As a Principal Site Reliability Engineer in the Virtualizatio

Senior Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Are you excited by the prospect of working with innovative security products? Do you enjoy creating innovative and strategic solutions to solve complex problems? Join Guardicore (now Akamai Enterprise Security Group) Guardicore (now Akamai Enterprise Security Group!) is changing the way organizations protect their data centers and clouds. Our team boasts some of the most talented and experienced cyber security and data center. We're always looking for new people to inspire us and make us bett

Principal Site Reliability Engineer - Remote

UnitedHealth Group

Remote or Bernards, New Jersey, USA

Full-time

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health op

Senior Site Reliability Engineer

McKesson Corporation

Remote or Columbus, Ohio, USA

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patien