Site Reliability Engineer Jobs in San Jose, CA

Refine Results
61 - 80 of 261 Jobs

Sr. Site Reliability Engineer - U.S. Citizen - This role sits within Optum Serves Technology Product organization

Widescope Consulting and Contracting Services

Remote

Full-time

Job Title: Sr. Site Reliability Engineer Location: Headquarters / Telecommute Classification (HR only): Exempt Non-Exempt Reports To (Title): COO Widescope Consulting and Contracting JOB SUMMARY The statements below are not intended to be all-inclusive of the duties and responsibilities of the position. Based on leadership decisions and business needs, all other duties as assigned will be expected for each position.Grafana Widescope Consulting and Contracting is proud to serve our nation's mi

Principal AI Infrastructure SRE Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has been reinventing computer graphics, PC gaming, and accelerated computing for 30 years. It is a unique legacy of innovation that's fueled by great technology and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, generative AI, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best tal

Sr. Site Reliability Engineer

Adobe Systems

San Jose, California, USA

Full-time

Our Company Changing the world through digital experiences is what Adobe's all about. We give everyone-from emerging artists to global brands-everything they need to design and deliver exceptional digital experiences! We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen. We're on a mission to hire the very best and are committed to creating exceptional employee experiences wher

SRE/Devops/Kubernetes/Python

Infonex Technologies, Inc.

Pleasanton, California, USA

Contract

Position: Devops/KUBERNETES -Open Position-CA Type: contract Duration: 12+ months Location: Pleasanton, CA Job Description: Required Skills: Spark Hadoop/CDH H2O/Steam MapR Kubernetes Docker Tensorflow Apache Airflow Jupyterhub Rstudio PyTorch ELK OpenVino MySql GitLab Traefik Prometheus, Grafana, Node Manager, Alert Manager Vault Notes: Currently client has on prem environment The client wants experience in containerization with Kubernetes, Vault, Slurm with Rstudio hook all the components

Internship, Site Reliability Engineer, Applications Engineering (Fall 2025)

Tesla Motors

Fremont, California, USA

Full-time

Consider before submitting an application: This position is expected to start around September 2025 and continue through the Fall term (approximately December 2025) or into Spring 2026 if available and there is an opportunity to do so. We ask for a minimum of 12 weeks, full-time and on-site, for most internships. Our internship program is for students who are actively enrolled in an academic program. entry level candidates seeking employment after graduation and not returning to school should a

Senior Site Reliability Engineer

Randstad Digital

Remote or St. Louis, Missouri, USA

Contract

job summary: Story Behind the Need Who is Resiliency Engineering Enablement? Partner with application and infrastructure teams to define Disaster Recovery (DR) standardsDesign, deploy and manage Tier 1 DR capabilities.Standardize and evangelize DR implementation patternsDefine and evangelize observability and ops excellence standards as related to DRDefine and maintain failover criteriaDefine, maintain and test Technical Recovery Guides (TRG) location: Saint Louis, Missouri job type: Contract

Senior Site Reliability Engineer, HPC and LSF

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA is the leader in AI, machine learning and datacenter acceleration. NVIDIA is expanding that leadership into datacenter networking with ethernet switches, NICs and DPUs NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" th

Staff Site Reliability Engineer, Cell Software

Tesla Motors

Remote or Fremont, California, USA

Full-time

Tesla is re-thinking how batteries are made from the ground up. We're designing new factories, new equipment, new processes and new software to rapidly scale battery manufacturing, globally. The primary bottleneck to Tesla's future expansion (and the transition to sustainable transport and energy storage) is our ability to produce and procure batteries - that's why we're innovating in-house, with our collection of world-class engineers, to redefine the industry. Software, data and automation all

Sr. Linux Site Reliability Engineer, IT Manufacturing Site Reliability Engineering

Tesla Motors

Fremont, California, USA

Full-time

We are seeking an enthusiastic SRE to join our dynamic IT Manufacturing Site Reliability Engineering (ITMFG-SRE) team at Tesla. Our team is responsible for building and managing an ecosystem of applications and platforms essential to manufacturing. As a Linux SRE, this role requires experience with hardware, software, networking, and automation to implement scalable solutions for manufacturing sites globally. You'll play a key role in maintaining, optimizing and scaling our infrastructure to sup

SRE (Linux / Golang Automation)

Bayside Solutions

Remote

Contract

Site Reliability Engineer (Linux / Golang Automation) W2 Contract Salary Range: $124,800 - $145,600 per year Location: Remote Role - PST Job Summary: We require a Site Reliability Engineer with a strong background and experience supporting extensive virtualization and Linux compute platforms. Requirements and Qualifications: Experience automating with Golang Experience with Infrastructure as a Service orchestration tools (OpenStack, CloudStack, etc.) Strong experience supporting Linux and

CDN Site Reliability Engineer (SRE) L5

Netflix, Inc.

Los Gatos, California, USA

Full-time

Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time. How do you spark joy in hundreds of millions of people? It starts with a vision - that technology can give voice to stories around the world. In delivering those much-l

Senior Engineer - Data Warehouse Site Reliability Engineering (SRE) (ship required)

Oracle Corporation

Pleasanton, California, USA

Full-time

Job Description The candidate for this position must qualify the US-Gov requirements - should be a and resident in the US. We are looking for senior engineers with experience in supporting data warehousing products. As a member of the Product development organization, focus will be on working with development teams, providing timely support to customers and identify/implementing process automation, for cloud BI product. BS or higher degree in Computer Science / Engineering or equivalent 3+ y

Site Reliability Engineer - Openstack

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet is recruiting a Site Reliability Engineer- OPENSTACK to join our FortiStack team. This team is responsible for the management, operation and continued development of our Openstack-based private cloud platform. This position would represent a great fit for Openstack specialists or IT professionals with a combination of virtualization, Openstack, storage and networking experience. As a Site Reliability Engineer- OpenStack, you will: Play a leading role in the operation,

SRE Specialist

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet has an exciting opportunity for an intermediate SRE Specialist to join our MIS operation team. We are managing consumer-facing services with high traffic volumes around the world. Service Reliability and Security is our top priority. This is a unique opportunity to join an established team of experienced professionals to work on some of the most innovative technology and network security products on the market. Job Responsibilities Linux System Administration: Admini

SRE Specialist - System

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet has an exciting opportunity for an intermediate SRE Specialist to join our FortiGuard operation team. We are managing consumer-facing services with high traffic volumes around the world. Service Reliability and Security is our top priority. This is a unique opportunity to join an established team of experienced professionals to work on some of the most innovative technology and network security products on the market. Job Responsibilities Linux System Administration:

SRE Specialist - Infrastructure

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet has an exciting opportunity for an intermediate SRE Specialist to join our FortiGuard operation team. We are managing consumer-facing services with high traffic volumes around the world. Service Reliability and Security is our top priority. This is a unique opportunity to join an established team of experienced professionals to work on some of the most innovative technology and network security products on the market. Job Responsibilities Linux Server Administration (U

Senior Site Reliability Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer. At NVIDIA, you'll be part of the team shaping the future of computing and guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: Own the solutions you build, collaborating with cross-functional teams to successfully implement them.Collaborate with various teams

Senior Site Reliability Engineer - DGX Cloud

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demand knowledge across different systems, networking, coding, database, capacity management, continuous delivery and deployment and open source cloud enabling technologies like Kubernetes and OpenStack. SRE at N

Senior Site Reliability Engineer

Circles Inc.

Remote or San Francisco, California, USA

Full-time

Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data - globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that can help raise global economic prosperity and enhance inclusion. Our infrastructure - including USDC, a blockchain-based dollar - helps busines

Staff Site Reliability Engineer, Fleetnet

Tesla Motors

Remote or Palo Alto, California, USA

Full-time

We are a product focused global team creating the next-generation of server-side infrastructure and code to support the growing suite of Tesla products and services. We are looking for seasoned SREs with domain expertise in areas related to developing infrastructure as a service, Kubernetes, Gitops, K8s Operator development, and platform security. The Fleetnet SRE team is part of the Vehicle Software division and is embedded with our backend application, data platform and navigation development