reliability engineer Jobs in california

Refine Results
121 - 140 of 208 Jobs

Site Reliability Engineer - Recommendation Infrastructure

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A166137A Apply to this job Share this listing: Responsibilities Our Recommendation Infrastructure Team is responsible for building up and optimizing the architecture for our recommendation system to provide the most stable and best experience for our TikTok users. SREs in our team keep the systems up and running with the highest level of availability, and create highly automated systems and pipelines. What You'll Do Engage in and im

Site Reliability Engineer, Infrastructure Security

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : 3CNV Apply to this job Share this listing: Responsibilities Our Infrastructure Engineering team supports the company's fast growth by building and operating hyper-scale datacenters, managing the life cycle of server fleet, providing cloud solutions, and developing various infrastructure services and making sure they are scalable and are reliable. Responsibilities - Conduct security reviews of core corporate and production infrastruc

Site Reliability Engineer, Compute Platform

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A174647 Apply to this job Share this listing: Responsibilities Team Introduction Our Compute Platform SRE team supports all Big Data services and products across the company. We are a newly established team and waiting for talents like you to shape the team's future together. We are responsible for the reliability of all the company's major data warehouse products, services, and query engines. We serve business needs across domains

Site Reliability Engineer - Data Infrastructure

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A07367 Apply to this job Share this listing: Responsibilities Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. We seamlessly merge software development and infrastructure operations to design, build, and manage large-scale, highly distributed systems. We take pride in overseeing one of the industry's most extensive cloud infrastructures. As software development evolves, building systems fro

Site Reliability Engineer, Client Platform

Coinbase

Remote

Full-time

Ready to be pushed beyond what you think you're capable of? At Coinbase, our mission is to increase economic freedom in the world. It's a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform - and with it, the future global financial system. To achieve our mission, we're seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the

Site Reliability Engineer - Remote

Donnelley Financial Solutions

Remote

Full-time

Join a dynamic team at the pulse of global markets, where we deliver innovative software and service solutions for essential financial reporting and capital markets transactions. At DFIN, we are a values-driven organization that empowers you to build a fulfilling career while bringing your authentic self to work every day. Our "Win as One" mentality ensures that our team's success is directly linked to Client, Shareholder and Employee Satisfaction. Recognized by Newsweek as one of AMERICA'S MOST

Site Reliability Engineer-FedRAMP (FULLY REMOTE)

Splunk Inc.

Colorado, USA

Full-time

Description Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to our work, customers, having fun, and most significantly to each other's success. The Splunk Observability Cloud provides full-fidelity monitoring and fixing across infrastructure, applications, and user inter

Senior Fullstack Product Software Engineer, Dash

Dropbox Inc

Remote

Full-time

Dropbox is a Virtual First company. For this role, we are hiring in Zones 2 and 3. Please refer to our Compensation section below to see what neighborhoods fall under each Zone. Company Description Dropbox isn't just a workplace-it's a living lab for more enlightened ways of working. We're a global community of bold visionaries and resourceful doers who are shaping the future of Dropbox-and with it the future of work. Our Virtual First model combines the autonomy of a distributed workplace with

Site Reliability Engineer

Fortinet

Sunnyvale, California, USA

Full-time

Job Description At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess over getting the details right. We love what we do and are proud of our work to secure clouds and container environments for thousands of b2b customers worldwide. Our team is growing, and we are looking for engineers with passion for automation. You will help support the Lacework p

Site Reliability Engineer, Kubernetes Platform (Starshield)

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, KUBERENTES PLATFORM (STARSHIELD) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world's largest US gov

Site Reliability Engineer, Hardware and Infrastructure (Starshield)

SpaceX

Hawthorne, California, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, HARDWARE AND INFRASTRUCTURE (STARSHIELD) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world's larges

Automation Developer / Site Reliability Engineer

Princeton IT Services

MX

Contract

Job Title: Platform SRE Automation Developer / Site Reliability Engineer Job Location: Remote in Mexico Job Type; Full time contract Job Summary: This team's engineers support the growing consumer credit card business. The platform is built on a microservice architecture on a modern technology stack hosted in AWS public cloud and uses state of the art development practices and tooling for SDLC, with observability tools such as Datadog, Prometheus, Splunk, etc.Our engineers are responsible

Site Reliability Engineer (Entry Level - Hybrid)

Thomson Reuters U.S. Inc.

Remote or Eagan, Minnesota, USA

Full-time

Site Reliability Engineer Thomson Reuters is the leading source of intelligent information for the world's businesses and professionals, providing customers with a competitive advantage. Our team is responsible for ensuring the reliability and resiliency of our largest online products. This role executes reliability engineering, operational readiness, ongoing change planning and implementation, incident detection and resolution, and compliance for a portfolio of applications and infrastructure

Site Reliability Engineer:: San Jose, CA

VLink Inc

San Jose, California, USA

Contract

Hybrid: 2-3 days/Week onsite depending on work Role: Site Reliability Engineer Location: San Jose, CA Duration: 12 Months Job Description: Primary: Linux Administration (e.g., common commands to check/alter config)PythonGo or Java (Preferable)Debugging & Troubleshooting (Application and Infrastructure) production performance issuesKnowledge of MQKubernetes AdministrationCICD Tooling & DevOps Automation Secondary: Shell ScriptingKnowledge of ContainersHave exposure to distributed systems, e.g.,

Deployment, Site Reliability Engineer - Connected Warfare

Aduril Industries

Costa Mesa, California, USA

Full-time

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and c

Site Reliability Engineer (SRE) - OSTTRA Hybrid

S&P Consultants

Remote or New York, New York, USA

Full-time

About the Role: The Role: Site Reliability Engineer (SRE) The Team: SRE is a global team that provides technical support across the suite of OSTTRA products. The SRE team works closely with a highly competent Technical Operation Centre (TOC), Development and Infrastructure teams to deliver proactive tasks to improve the supportability of our platforms. Our work helps to ensure that OSTTRA provides a high-quality service and maintains client satisfaction. The Impact: Together, we build, suppor

Site Reliability Engineer, Edge Services- USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A124549A Apply to this job Share this listing: Responsibilities Team Insight: CDN Site Reliability Engineering combines software and network engineering with system operations to build and run large-scale, massively distributed infrastructure. Our Edge SREs ensure infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. We dive deep into the stack, including network, OS, and applications, to qui

Sr SRE (Python & Kubernetes)

Ledgent Technology

Irvine, California, USA

Full-time

No Corp-to-Corp, No 3rd party firms . Job Title: Sr Site Reliability Engineer SRE Location: 100% onsite in Irvine, CA Employment Type: Direct-hire Compensation: $140,000 to $180,000 (based on level of experience). . Partnered with a client who is at the forefront of the future innovation hub of next-generation networking, IoT smart home products, and software services. Be a part of a pivotal time in propelling the global ventures. Join their mission in shaping a technology-driven future. They

Senior Site Reliability Engineer, Core AI Infrastructure

Coinbase

Remote

Full-time

Ready to be pushed beyond what you think you're capable of? At Coinbase, our mission is to increase economic freedom in the world. It's a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform - and with it, the future global financial system. To achieve our mission, we're seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the

Site Reliability Engineer Intern (US remote - Fall 2025)

Splunk Inc.

Colorado, USA

Full-time

Description Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers love our technology, but it's our caring employees that make Splunk stand out as an amazing career destination. No matter where in the world or what level of the organization, we approach our wor