Site Reliability Engineer Jobs in California

Refine Results
61 - 80 of 313 Jobs

Senior Site Reliability Engineer

Aduril Industries

Costa Mesa, California, USA

Full-time

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and c

Sr Implementation Lead, SRE (CoP)

Northern Trust

Remote or Chicago, Illinois, USA

Full-time

About Northern Trust: Northern Trust, a Fortune 500 company, is a globally recognized, award-winning financial institution that has been in continuous operation since 1889. Northern Trust is proud to provide innovative financial services and guidance to the world's most successful individuals, families, and institutions by remaining true to our enduring principles of service, expertise, and integrity. With more than 130 years of financial experience and over 22,000 partners, we serve the world'

Senior Site Reliability Engineer - Connected Factory

Aduril Industries

Costa Mesa, California, USA

Full-time

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and c

Senior Site Reliability Engineer

LiveRamp

San Francisco, California, USA

Full-time

LiveRamp is the data collaboration platform of choice for the world's most innovative companies. A groundbreaking leader in consumer privacy, data ethics, and foundational identity, LiveRamp is setting the new standard for building a connected customer view with unmatched clarity and context while protecting precious brand and consumer trust. LiveRamp offers complete flexibility to collaborate wherever data lives to support the widest range of data collaboration use cases-within organizations, b

Site Reliability Engineer - AI Cloud

SUPERMICRO COMPUTER INC

San Jose, California, USA

Full-time

Job Req ID: 26861 About Supermicro: Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, pass

Site Reliability Engineer II - Real-Time

Esri

Redlands, California, USA

Full-time

Overview Join us to work collaboratively with our talented team of dynamic and passionate engineers to deliver capabilities that enable our customers to make a difference. You'll deploy and operate ArcGIS Velocity and ArcGIS Workflow Manager SaaS solutions. You will also have the opportunity to design, deploy, and operate next-generation real-time and big data GIS software-as-a-service (SaaS) capabilities for thousands of cloud users worldwide. Our teams have a broad mix of experience levels a

Site Reliability Engineer, Capcut - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A92523 Apply to this job Share this listing: Responsibilities Team Intro: CapCut is an all-in-one video editing app that empowers creators to express themselves and transform videos into creative masterpieces. In addition to its basic features, such as video editing, text, stickers, filters, colors and music, CapCut offers free advanced features, including keyframe animation, smooth slow-motion effects, chroma key, Picture-in-Pictur

Senior Site Reliability Engineer

Circles Inc.

Remote or San Francisco, California, USA

Full-time

Circle is a financial technology company at the epicenter of the emerging internet of money, where value can finally travel like other digital data - globally, nearly instantly and less expensively than legacy settlement systems. This ground-breaking new internet layer opens up previously unimaginable possibilities for payments, commerce and markets that can help raise global economic prosperity and enhance inclusion. Our infrastructure - including USDC, a blockchain-based dollar - helps busines

Staff Site Reliability Engineer, Incident and Disaster

Dropbox Inc

Remote

Full-time

Dropbox is a Virtual First company. For this role, we are hiring in Zones 2 and 3. Please refer to our Compensation section below to see what neighborhoods fall under each Zone. Role Description The Incident and Disaster Team aims to reduce Customer pain by speeding up incident response through standardized incident management processes and tooling as well as through incident prevention strategies such as disaster readiness , chaos testing, safer tooling, stronger controls, automated conformanc

Senior DevOps Engineer

SVMT, Inc

San Francisco, California, USA

Full-time

We re building from scratch and we need a DevOps expert to lead the way. Our current setup is in an early stage (single VMs, semi-manual Kubernetes). You ll design and implement scalable, secure systems with complete autonomy. You re a fit if you have: 4+ years DevOps/SRE experience (5+ ideal)Strong with Kubernetes, CI/CD, cloud infra, IaCStartup experience and a bias for actionBonus: B2B SaaS, AI/ML, or contract-heavy product experienceWhy You ll Love It Greenfield infra build it your wayWork w

Site Reliability Engineer - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A181103B Apply to this job Share this listing: Responsibilities Site Reliability Engineering(SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. In our team, you'll have the opportunity to manage the complex challenges of scale, while using expertise in coding, algorithms, complexity analysis, and large-scale system design. We embrace a culture of

Software Engineer LMTS (Site Reliability Engineering)

Salesforce

San Francisco, California, USA

Full-time

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category Software Engineering Job Details About Salesforce We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving you

SRE/Devops/Kubernetes/Python

Infonex Technologies, Inc.

Pleasanton, California, USA

Contract

Position: Devops/KUBERNETES -Open Position-CA Type: contract Duration: 12+ months Location: Pleasanton, CA Job Description: Required Skills: Spark Hadoop/CDH H2O/Steam MapR Kubernetes Docker Tensorflow Apache Airflow Jupyterhub Rstudio PyTorch ELK OpenVino MySql GitLab Traefik Prometheus, Grafana, Node Manager, Alert Manager Vault Notes: Currently client has on prem environment The client wants experience in containerization with Kubernetes, Vault, Slurm with Rstudio hook all the components

Senior II Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? \n Do you have a passion for cutting edge technologies? \n Join our Compute Team! \n Our team designs, develops, and manages applications and infrastructure that support Akamai Cloud's products and services. Our SRE teams solve reliability, security, and usability at scale for our global fleet while maintaining Akamai's mission at the forefront of what we do: make life better for billions of people, billions of times a day. \n Pa

Senior Site Reliability Engineer Staff, Classified Systems

Lockheed Martin Corporation

Sunnyvale, California, USA

Full-time

Job Description As a Site Reliability Engineer, you will: Design, implement, and maintain highly available and scalable systems and infrastructure to support classified applications and services Develop and implement reliability-focused engineering practices, such as continuous integration, continuous deployment, and continuous monitoring, while ensuring compliance with classified system requirements Collaborate with development teams to ensure that reliability and scalability are considered th

Senior Site Reliability Engineer, Core AI Infrastructure

Coinbase

Remote

Full-time

Ready to be pushed beyond what you think you're capable of? At Coinbase, our mission is to increase economic freedom in the world. It's a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform - and with it, the future global financial system. To achieve our mission, we're seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the

Cloud Games Site Reliability Engineer L5 - Open Connect

Netflix, Inc.

Remote

Full-time

Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time. The Netflix Open Connect Content Delivery Network is our in-house, custom-built network and server infrastructure responsible for streaming all of your favorite movies

Cloud DevOps Engineer - W2 - CTH - Remote (Posted by SAM)

Global Force USA

Remote

Contract

Required Qualifications 10+ years in DevOps, SRE, or infrastructure roles, including time in senior or lead positionsStrong Linux background and experience managing large-scale systemsExperience automating deployments and operational tasksStrong skills in debugging distributed systems and understanding infrastructure failuresHands-on experience with on-prem enterprise software delivery and supportProficiency in scripting (Bash, Python, PowerShell, etc.)Strong communication and collaboration ski

Site Reliability Engineer II

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies and tackling system problems? Join our highly skilled Storage team Our team designs, deploys, and manages applications and infrastructure that supports Akamai's internal and customer-facing cloud storage platforms. We do this while maintaining Akamai's mission to make life better for billions of people, billions of times a day. Partner with the best In this role, you will c

Site Reliability Engineer, Hardware and Infrastructure (Starshield)

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, HARDWARE AND INFRASTRUCTURE (STARSHIELD) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world's larges