Site Reliability Engineer Jobs in California

Refine Results
61 - 80 of 306 Jobs

Senior Site Reliability Engineer

LiveRamp

San Francisco, California, USA

Full-time

LiveRamp is the data collaboration platform of choice for the world's most innovative companies. A groundbreaking leader in consumer privacy, data ethics, and foundational identity, LiveRamp is setting the new standard for building a connected customer view with unmatched clarity and context while protecting precious brand and consumer trust. LiveRamp offers complete flexibility to collaborate wherever data lives to support the widest range of data collaboration use cases-within organizations, b

Site Reliability Engineer - AI Cloud

SUPERMICRO COMPUTER INC

San Jose, California, USA

Full-time

Job Req ID: 26861 About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, pass

Site Reliability Engineer II - Real-Time

Esri

Redlands, California, USA

Full-time

Overview Join us to work collaboratively with our talented team of dynamic and passionate engineers to deliver capabilities that enable our customers to make a difference. You'll deploy and operate ArcGIS Velocity and ArcGIS Workflow Manager SaaS solutions. You will also have the opportunity to design, deploy, and operate next-generation real-time and big data GIS software-as-a-service (SaaS) capabilities for thousands of cloud users worldwide. Our teams have a broad mix of experience levels a

Site Reliability Engineer, Capcut - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A92523 Apply to this job Share this listing: Responsibilities Team Intro: CapCut is an all-in-one video editing app that empowers creators to express themselves and transform videos into creative masterpieces. In addition to its basic features, such as video editing, text, stickers, filters, colors and music, CapCut offers free advanced features, including keyframe animation, smooth slow-motion effects, chroma key, Picture-in-Pictur

Senior Site Reliability Engineer

Circles Inc.

Remote or San Francisco, California, USA

Full-time

Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data - globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that can help raise global economic prosperity and enhance inclusion. Our infrastructure - including USDC, a blockchain-based dollar - helps busines

Staff Site Reliability Engineer, Incident and Disaster

Dropbox Inc

Remote

Full-time

Dropbox is a Virtual First company. For this role, we are hiring in Zones 2 and 3. Please refer to our Compensation section below to see what neighborhoods fall under each Zone. Role Description The Incident and Disaster Team aims to reduce Customer pain by speeding up incident response through standardized incident management processes and tooling as well as through incident prevention strategies such as disaster readiness , chaos testing, safer tooling, stronger controls, automated conformanc

Site Reliability Engineer - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A181103B Apply to this job Share this listing: Responsibilities Site Reliability Engineering(SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. In our team, you'll have the opportunity to manage the complex challenges of scale, while using expertise in coding, algorithms, complexity analysis, and large-scale system design. We embrace a culture of

Software Engineer LMTS (Site Reliability Engineering)

Salesforce

San Francisco, California, USA

Full-time

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category Software Engineering Job Details About Salesforce We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving you

SRE/Devops/Kubernetes/Python

Infonex Technologies, Inc.

Pleasanton, California, USA

Contract

Position: Devops/KUBERNETES -Open Position-CA Type: contract Duration: 12+ months Location: Pleasanton, CA Job Description: Required Skills: Spark Hadoop/CDH H2O/Steam MapR Kubernetes Docker Tensorflow Apache Airflow Jupyterhub Rstudio PyTorch ELK OpenVino MySql GitLab Traefik Prometheus, Grafana, Node Manager, Alert Manager Vault Notes: Currently client has on prem environment The client wants experience in containerization with Kubernetes, Vault, Slurm with Rstudio hook all the components

Senior II Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? \n Do you have a passion for cutting edge technologies? \n Join our Compute Team! \n Our team designs, develops, and manages applications and infrastructure that support Akamai Cloud's products and services. Our SRE teams solve reliability, security, and usability at scale for our global fleet while maintaining Akamai's mission at the forefront of what we do: make life better for billions of people, billions of times a day. \n Pa

Senior DevOps Engineer

SVMT, Inc

San Francisco, California, USA

Full-time

We re building from scratch and we need a DevOps expert to lead the way. Our current setup is in an early stage (single VMs, semi-manual Kubernetes). You ll design and implement scalable, secure systems with complete autonomy. You re a fit if you have: 4+ years DevOps/SRE experience (5+ ideal)Strong with Kubernetes, CI/CD, cloud infra, IaCStartup experience and a bias for actionBonus: B2B SaaS, AI/ML, or contract-heavy product experienceWhy You ll Love It Greenfield infra build it your wayWork w

Senior Site Reliability Engineer Staff, Classified Systems

Lockheed Martin Corporation

Sunnyvale, California, USA

Full-time

Job Description As a Site Reliability Engineer, you will: Design, implement, and maintain highly available and scalable systems and infrastructure to support classified applications and services Develop and implement reliability-focused engineering practices, such as continuous integration, continuous deployment, and continuous monitoring, while ensuring compliance with classified system requirements Collaborate with development teams to ensure that reliability and scalability are considered th

Senior Site Reliability Engineer, Core AI Infrastructure

Coinbase

Remote

Full-time

Ready to be pushed beyond what you think you're capable of? At Coinbase, our mission is to increase economic freedom in the world. It's a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform - and with it, the future global financial system. To achieve our mission, we're seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the

Cloud Games Site Reliability Engineer L5 - Open Connect

Netflix, Inc.

Remote

Full-time

Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time. The Netflix Open Connect Content Delivery Network is our in-house, custom-built network and server infrastructure responsible for streaming all of your favorite movies

Site Reliability Engineer II

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies and tackling system problems? Join our highly skilled Storage team Our team designs, deploys, and manages applications and infrastructure that supports Akamai's internal and customer-facing cloud storage platforms. We do this while maintaining Akamai's mission to make life better for billions of people, billions of times a day. Partner with the best In this role, you will c

Site Reliability Engineer (SRE) - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A259032A Apply to this job Share this listing: Responsibilities The security team is missioned to run and operate security infrastructures, platforms and technologies, as well as to support cross-functional teams to protect our users, products and infrastructures. In this team you'll have a unique opportunity to have first-hand exposure to the strategy of the company in key security initiatives, especially in deploying and maintaini

Cloud DevOps Engineer - W2 - CTH - Remote (Posted by SAM)

Global Force USA

Remote

Contract

Required Qualifications 10+ years in DevOps, SRE, or infrastructure roles, including time in senior or lead positionsStrong Linux background and experience managing large-scale systemsExperience automating deployments and operational tasksStrong skills in debugging distributed systems and understanding infrastructure failuresHands-on experience with on-prem enterprise software delivery and supportProficiency in scripting (Bash, Python, PowerShell, etc.)Strong communication and collaboration ski

Site Reliability Engineer, Hardware and Infrastructure (Starshield)

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, HARDWARE AND INFRASTRUCTURE (STARSHIELD) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world's larges

Site Reliability Engineer, Kubernetes Platform (Starshield)

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, KUBERENTES PLATFORM (STARSHIELD) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world's largest US gov

(USA) Staff, Site Reliability Engineer

Walmart Inc.

Remote or Denver, Colorado, USA

Full-time

Position Summary Are you ready to lead and innovate in the world of cloud databases? Join Walmart/VIZIO as a Staff, Site Reliability Engineer and drive the future of database reliability and scalability. With your expertise, you'll ensure seamless operations and cutting-edge solutions, making a significant impact on our technology landscape. What you'll do About the Team: The Database Reliability Engineering Team is dedicated to ensuring the dependability and performance of our database infras