site reliability engineer Jobs in san jose, ca

Refine Results
61 - 80 of 250 Jobs

Senior Site Reliability Engineer, HPC and LSF

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA is the leader in AI, machine learning and datacenter acceleration. NVIDIA is expanding that leadership into datacenter networking with ethernet switches, NICs and DPUs NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" th

Staff Site Reliability Engineer, Cell Software

Tesla Motors

Remote or Fremont, California, USA

Full-time

Tesla is re-thinking how batteries are made from the ground up. We're designing new factories, new equipment, new processes and new software to rapidly scale battery manufacturing, globally. The primary bottleneck to Tesla's future expansion (and the transition to sustainable transport and energy storage) is our ability to produce and procure batteries - that's why we're innovating in-house, with our collection of world-class engineers, to redefine the industry. Software, data and automation all

Apigee SRE Automation Lead

Nityo Infotech Corporation

Remote

Contract

Role: Apigee SRE / Automation Lead Remote Contract Job Job Summary: We are seeking a highly skilled and experienced Apigee SRE / Automation Lead to oversee the reliability, scalability, and automation of our API infrastructure. This role demands deep expertise in Apigee platform operations, infrastructure automation, and system reliability engineering. The ideal candidate will have hands-on experience with tools like Terraform, Ansible, DoJo, and scripting, along with a strong background in Lin

CDN Site Reliability Engineer (SRE) L5

Netflix, Inc.

Los Gatos, California, USA

Full-time

Netflix is one of the world's leading entertainment services, with 283 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time. How do you spark joy in hundreds of millions of people? It starts with a vision - that technology can give voice to stories around the world. In delivering those much-loved

Technical Program Manager

Milestone Technologies, Inc.

Sunnyvale, California, USA

Contract

Technical Program Manager with integration/migration experience Platform / Production / Site Reliability Engineering. Hybrid Role- Sunnyvale, CA 12 month W2 contract W2 ONLY (no C2C or 3rd Parties) Rate: $70 - 75/hr. - OR Please let me know what you seek. Looking for: SolidJira experience Strongnetworking background and experiencewith the infra stacks such asdatabases (Cassandra, MySQL), and search platforms, Kubernetes, etc. Migration from prem to cloud a must. (any is fine, they are using Go

Senior Engineer - Data Warehouse Site Reliability Engineering (SRE) (ship required)

Oracle Corporation

Pleasanton, California, USA

Full-time

Job Description The candidate for this position must qualify the US-Gov requirements - should be a and resident in the US. We are looking for senior engineers with experience in supporting data warehousing products. As a member of the Product development organization, focus will be on working with development teams, providing timely support to customers and identify/implementing process automation, for cloud BI product. BS or higher degree in Computer Science / Engineering or equivalent 3+ y

SRE (Linux / Golang Automation)

Bayside Solutions

Remote

Contract

Site Reliability Engineer (Linux / Golang Automation) W2 Contract Salary Range: $124,800 - $145,600 per year Location: Remote Role - PST Job Summary: We require a Site Reliability Engineer with a strong background and experience supporting extensive virtualization and Linux compute platforms. Requirements and Qualifications: Experience automating with Golang Experience with Infrastructure as a Service orchestration tools (OpenStack, CloudStack, etc.) Strong experience supporting Linux and

Site Reliability Engineer - Openstack

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet is recruiting a Site Reliability Engineer- OPENSTACK to join our FortiStack team. This team is responsible for the management, operation and continued development of our Openstack-based private cloud platform. This position would represent a great fit for Openstack specialists or IT professionals with a combination of virtualization, Openstack, storage and networking experience. As a Site Reliability Engineer- OpenStack, you will: Play a leading role in the operation,

SRE Specialist - System

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet has an exciting opportunity for an intermediate SRE Specialist to join our FortiGuard operation team. We are managing consumer-facing services with high traffic volumes around the world. Service Reliability and Security is our top priority. This is a unique opportunity to join an established team of experienced professionals to work on some of the most innovative technology and network security products on the market. Job Responsibilities Linux System Administration:

SRE Specialist

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet has an exciting opportunity for an experienced SRE Specialist to join our FortiGuard operation team. We are managing the consumer-facing services with high traffic volumes around the world. Service Reliability and Security is our top priority. This is a unique opportunity to join an established team of experienced professionals to work on some of the most innovative technology and network security products on the market. Job Responsibilities: Design and deployment of

SRE Specialist - Infrastructure

Fortinet

Sunnyvale, California, USA

Full-time

Job Description Fortinet has an exciting opportunity for an intermediate SRE Specialist to join our FortiGuard operation team. We are managing consumer-facing services with high traffic volumes around the world. Service Reliability and Security is our top priority. This is a unique opportunity to join an established team of experienced professionals to work on some of the most innovative technology and network security products on the market. Job Responsibilities Linux Server Administration (U

Senior Site Reliability Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer. At NVIDIA, you'll be part of the team shaping the future of computing and guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: Own the solutions you build, collaborating with cross-functional teams to successfully implement them.Collaborate with various teams

Senior Site Reliability Engineer - DGX Cloud

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demand knowledge across different systems, networking, coding, database, capacity management, continuous delivery and deployment and open source cloud enabling technologies like Kubernetes and OpenStack. SRE at N

Senior Site Reliability Engineer

Circles Inc.

Remote or San Francisco, California, USA

Full-time

Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data - globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that can help raise global economic prosperity and enhance inclusion. Our infrastructure - including USDC, a blockchain-based dollar - helps busines

Staff Site Reliability Engineer, Fleetnet

Tesla Motors

Remote or Palo Alto, California, USA

Full-time

We are a product focused global team creating the next-generation of server-side infrastructure and code to support the growing suite of Tesla products and services. We are looking for seasoned SREs with domain expertise in areas related to developing infrastructure as a service, Kubernetes, Gitops, K8s Operator development, and platform security. The Fleetnet SRE team is part of the Vehicle Software division and is embedded with our backend application, data platform and navigation development

Sr. Site Reliability Engineer, Dojo

Tesla Motors

Palo Alto, California, USA

Full-time

We are seeking an experienced Site Reliability Engineer (SRE) to join our team responsible for ensuring the reliability, performance of our Dojo cluster infrastructure. The successful candidate will be responsible for providing exceptional customer response and support, managing third-party systems, and collaborating with various teams to ensure seamless operations. If you have a passion for troubleshooting, automation, and collaboration, we encourage you to apply. Responsibilities Respond to c

Sr. Site Reliability Engineer, Compute SRE

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Lead Observability Engineer Sumo Logic

VLink Inc

Remote

Third Party, Contract

VLink is a leading global provider of software engineering services with next-gen technologies and best-in-class talent. With offices in 7+ countries from North America-Europe to APAC & expansion plans in Middle East, VLink has helped SMBs, and large enterprises achieve their business goals, and gained the trust of Fortune-250 companies. VLink is a 'Great Place to Work Certified ' and has been a consistent winner as- Best Places to Work in CT. Trust, collaboration, and accountability are the th

Sr. Site Reliability Engineer, Energy Software

Tesla Motors

Palo Alto, California, USA

Full-time

Tesla is looking for a Site Reliability Engineer to build, enhance, and scale the infrastructure that underpins our Energy IoT applications. These applications provide real-time monitoring, optimization, and control for Tesla's industry-leading energy products, including Powerwall, Megapack, Solar Roof, Supercharger, Wall Connector, Autobidder, and Virtual Power Plants. We are a high-impact team that values curiosity, learning, mentorship, open discourse, and making disciplined decisions by weig

Senior Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Are you excited by the prospect of working with innovative security products? Do you enjoy creating innovative and strategic solutions to solve complex problems? Join Guardicore (now Akamai Enterprise Security Group) Guardicore (now Akamai Enterprise Security Group!) is changing the way organizations protect their data centers and clouds. Our team boasts some of the most talented and experienced cyber security and data center. We're always looking for new people to inspire us and make us bett