reliability engineer Jobs in san jose, ca

Refine Results
81 - 100 of 144 Jobs

Site Reliability Engineer, Infrastructure and Assurance Services - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A236046A Apply to this job Share this listing: Responsibilities The Infra SRE-Infrastructure-Assurance team extends TikTok infrastructure's operability, observability, visibility, and automation. We aim to provide holistic insights and solutions to TikTok infrastructure with minimal manual interventions. We're young and fast-growing. Our team values transparency, collaboration, hard-work and innovations. We believe in planning and l

Site Reliability Engineer - Global SRE, Monetization Technology

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : R2861 Apply to this job Share this listing: Responsibilities TikTok is one of the fastest growing apps in the world, and we're seeking Site Reliability Engineers (SREs) to join our monetization technology team. The monetization technology team works on building and running large-scale, globally distributed, fault-tolerant ads systems. SREs keep the systems up and running with the highest level of availability, ensuring our users hav

Data Site Reliability Engineer, Video Platform - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A30565 Apply to this job Share this listing: Responsibilities Team Intro This is a Site Reliability Engineer role, focusing on the data pipeline reliability for the Video Platform team in USDS. Data SREs monitor data and keep production batch and realtime processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete and correct data possible. In order to enhance collaboration a

Principal Site Reliability Engineer Cloud Identity & Trust (SPIFFE/SPIRE)

Pinnacle Software Solutions

San Jose, California, USA

Contract

Job Title: Cloud Compute Platform Architect Experience Level: Mid-SeniorSan Jose, CA Job SummaryWe are seeking an experienced Cloud Compute Platform Architect to design and manage a cutting-edge cloud compute platform based on SPIFFE. The ideal candidate will perform Site Reliability Engineering (SRE) responsibilities, including deployment, capacity management, observability, and performance tuning. You will collaborate closely with our Security Architecture team to define attestation methods fo

Staff Site Reliability Engineer, Cell Software

Tesla Motors

Remote or Fremont, California, USA

Full-time

Tesla is re-thinking how batteries are made from the ground up. We're designing new factories, new equipment, new processes and new software to rapidly scale battery manufacturing, globally. The primary bottleneck to Tesla's future expansion (and the transition to sustainable transport and energy storage) is our ability to produce and procure batteries - that's why we're innovating in-house, with our collection of world-class engineers, to redefine the industry. Software, data and automation all

Site Reliability Engineer (with strong dev background)

Artech, LLC

Remote

Contract

Summary Our organization builds and provides systems and infrastructure that fuel our core services. We are the foundation on which our software developers build the products that our customers love. We are looking for passionate and dedicated Site Reliability Engineers to continue our focus on providing our customers the highest quality Services experience. Our services have to scale globally, stay highly available, and "just work. If you love designing, engineering and running systems and inf

Senior Site Reliability Engineer

Circles Inc.

Remote or San Francisco, California, USA

Full-time

Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data - globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that can help raise global economic prosperity and enhance inclusion. Our infrastructure - including USDC, a blockchain-based dollar - helps busines

Senior Site Reliability Engineer - Global SRE, Monetization Technology

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A180769 Apply to this job Share this listing: Responsibilities TikTok is one of the fastest growing apps in the world, and we're seeking Site Reliability Engineers (SREs) to join our monetization technology team. The monetization technology team works on building and running large-scale, globally distributed, fault-tolerant ads systems. SREs keep the systems up and running with the highest level of availability, ensuring our users h

Internship, Site Reliability Engineer, Applications Engineering (Fall 2025)

Tesla Motors

Fremont, California, USA

Full-time

Consider before submitting an application: This position is expected to start around September 2025 and continue through the Fall term (approximately December 2025) or into Spring 2026 if available and there is an opportunity to do so. We ask for a minimum of 12 weeks, full-time and on-site, for most internships. Our internship program is for students who are actively enrolled in an academic program. entry level candidates seeking employment after graduation and not returning to school should a

Sr. Site Reliability Engineer, Dojo

Tesla Motors

Palo Alto, California, USA

Full-time

We are seeking an experienced Site Reliability Engineer (SRE) to join our team responsible for ensuring the reliability, performance of our Dojo cluster infrastructure. The successful candidate will be responsible for providing exceptional customer response and support, managing third-party systems, and collaborating with various teams to ensure seamless operations. If you have a passion for troubleshooting, automation, and collaboration, we encourage you to apply. Responsibilities Respond to c

Site Reliability Engineer Graduate (TikTok Product - USDS) - 2025 Start (BS/MS)

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A224009 Apply to this job Share this listing: Responsibilities About the Team Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed and fault-tolerant systems. Product SREs help ensure the reliability and uptime for the services underpinning the TikTok product. Our team pays great attention to optimizing existing systems, working closely with cross functional

Staff Site Reliability Engineer, Incident and Disaster

Dropbox Inc

Remote

Full-time

Dropbox is a Virtual First company. For this role, we are hiring in Zones 2 and 3. Please refer to our Compensation section below to see what neighborhoods fall under each Zone. Role Description The Incident and Disaster Team aims to reduce Customer pain by speeding up incident response through standardized incident management processes and tooling as well as through incident prevention strategies such as disaster readiness , chaos testing, safer tooling, stronger controls, automated conformanc

Site Reliability Engineer

AE Business Solutions

Remote

Full-time

AEBS is seeking a Site Reliability Engineer to take on a fully-remote, contract position! The Site Reliability Engineer must be able to work central time zone hours throughout this engagement. *No C2C inquiries, please Ideal Skills/Background: 3+ years of experience implementing Infrastructure-as-Code (IAC) with Terraform 3+ years of Python development experience 3+ years of experience in designing/developing/operating applications in the Azure cloud 2+ years of experience building and running

Senior Platform Engineer - 100% remote from anywhere in the US

Calance

Remote or New York, New York, USA

Full-time

Position Summary: Our client is seeking a highly skilled and experienced Senior Platform Developer II to join their team. This pivotal role will be instrumental in building, scaling, and maintain the robust and secure infrastructure that powers our mission-critical platform. You will be a force multiplier, enabling our development teams to deliver features faster and more reliably. You will champion infrastructure-as-code principles, contribute code to platform scalability, drive automation acr

Site Reliability Engineer

Madison-Davis, LLC

Remote

Contract

Role: Drive the technical implementation of monitoring and alerting strategies across enterprise-scale applications and infrastructure.Collaborate directly with development teams to ensure each new initiative includes the correct telemetry, log tagging, and alert payloads.Act as a liaison to Level 2 and Level 3 support teams to maintain and enhance monitoring dashboards used by the enterprise command center (EMC).Standardize alert formats to ensure consistency with SRE policies and support downs

Site Reliability Engineer

Zachary Piper Solutions, LLC

Remote

Full-time

Piper Companies is seeking a Remote Site Reliability Engineer to join a leading cybersecurity and cloud consulting firm. The Site Reliability Engineer will play a key role in building and maintaining secure, scalable infrastructure while supporting automation, compliance, and operational excellence across client environments. Responsibilities of the Site Reliability Engineer include: Develop and deploy automation scripts, tooling, and infrastructure to meet client needsManage patching processes

Site Reliability Engineer

General Dynamics

Remote or Aurora, Colorado, USA

Full-time

Basic Qualifications Bachelor's degree in Computer Science, a related field or equivalent experience is required plus a minimum of 5 years of relevant experience; or Master's degree plus 3 years of relevant experience. CLEARANCE REQUIREMENTS: Department of Defense TS/SCI security clearance is required at time of hire. Applicants selected will be subject to a U.S. Government security investigation and must meet eligibility requirements for access to classified information. Due to the nature of

Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies and tackling system problems? Join our highly skilled Site Reliability team Our team monitors and measures the reliability of our suite of Compute products and platform. In collaboration with Engineering and Product teams, we improve the performance and reliability of the products we support. Partner with the best You will apply statistical data analysis and an understandin

Sr. Site Reliability Engineer, Compute SRE

Roblox

San Mateo, California, USA

Full-time

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Principal Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies? Join our Compute Team! Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We do this while maintaining Akamai's mission at the forefront of what we do: make life better for billions of people, billions of times a day. Partner with the best As a Principal Site Reliability Engineer in the Virtualizatio