Reliability Engineer Jobs in Seattle, WA

Refine Results
61 - 80 of 132 Jobs

Site Reliability Engineer, Recommendation Infrastructure - USDS

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : LWR2 Apply to this job Share this listing: Responsibilities About the team The Product Engineering team monitors and maintains the availability of TikTok, including services such as video playback, content discovery/recommendations, live streaming, and customer service feedback. In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that r

Site Reliability Engineer, Edge Services - USDS

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : A69162 Apply to this job Share this listing: Responsibilities Team Insight: CDN Site Reliability Engineering combines software and network engineering with system operations to build and run large-scale, massively distributed infrastructure. Our Edge SREs ensure infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. We dive deep into the stack, including network, OS, and applications, to quickl

Site Reliability Engineer, Data Platform- USDS

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : AJP Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data plaforms in the world. You'll need to ensure the data, servic

Senior Site Reliability Engineer, Product - USDS

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : Y5404 Apply to this job Share this listing: Responsibilities About the team The Product Engineering team monitors and maintains the availability of TikTok, including services such as video playback, content discovery/recommendations, live streaming, and customer service feedback. In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that

Senior Site Reliability Engineer - Data Infrastructure

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : G9596 Apply to this job Share this listing: Responsibilities Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. We seamlessly merge software development and infrastructure operations to design, build, and manage large-scale, highly distributed systems. We take pride in overseeing one of the industry's most extensive cloud infrastructures. As software development evolves, building systems from

Sr. Kubernetes Platform Site Reliability Engineer (Starlink)

SpaceX

Redmond, Washington, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. KUBERNETES PLATFORM SITE RELIABILITY ENGINEER (STARLINK) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy Starlink, the world's most advanced broadband internet system. Starli

Sr. Hardware / Infrastructure Site Reliability Engineer (Starlink)

SpaceX

Redmond, Washington, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. HARDWARE / INFRASTRUCTURE SITE RELIABILITY ENGINEER (STARLINK) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy Starlink, the world's most advanced broadband internet system.

Senior Site Reliability Engineer

General Motors

Remote

Full-time

Job Description Develop and design software applications for driverless technology company. Duties may include: Build out and improve observability systems, tools and the related codebase. Contribute code, perform code reviews, and create technical designs that improve performance and reliability of observability systems using software and systems engineering skills. Partner with other Software Engineering teams to better understand use-cases and guide the engineers to use the existing tools eff

Site Reliability Engineer, Connected Warfare

Aduril Industries

Seattle, Washington, USA

Full-time

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and c

Site Reliability Engineer, Systems - Infrastructure Engineering- Seattle

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : J9448 Apply to this job Share this listing: Responsibilities Our Infrastructure Engineering team supports the company's fast growth by building and operating hyper-scale datacenters, managing the life cycle of server fleet, providing cloud solutions, and developing various infrastructure services and making sure they are scalable and are reliable. Roles and Responsibilities - Operate basic system infrastructures like DNS, NTP, authen

Site Reliability Engineer - AML Global Recommendation - USDS

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : A58903 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) of the AML (Applied Machine Learning) team combines system engineering and the art of machine learning to develop and run a massively distributed AI/ML recommendation system for the United States and all around the world. On the SRE team, you'll have the opportunity to sharpen your expertise in coding, performance analysis, and large-scal

Infrastructure Site Reliability Engineer (Entry Level) - USDS

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : PNH2 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. In our team, you'll have the opportunity to manage the complex challenges of scale, while using expertise in coding, algorithms, complexity analysis, and large-scale system design. We embrace a culture of dive

Site Reliability Engineer (with strong dev background)

Artech, LLC

Remote

Contract

Summary Our organization builds and provides systems and infrastructure that fuel our core services. We are the foundation on which our software developers build the products that our customers love. We are looking for passionate and dedicated Site Reliability Engineers to continue our focus on providing our customers the highest quality Services experience. Our services have to scale globally, stay highly available, and "just work. If you love designing, engineering and running systems and inf

Senior Site Reliability Engineer, Trust & Safety - USDS

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : O4910 Apply to this job Share this listing: Responsibilities Team Intro The USDS TikTok Product Engineering SRE team works with engineering and product teams to build, maintain and run large-scale, globally distributed, observable, fault-tolerant systems. SREs on this team will deliver on production ownership and be responsible for observability and automation across complex, large-scale service mesh architectures. In order to enhanc

Site Reliability Engineer (Starshield) - Top Secret Clearance

SpaceX

Redmond, Washington, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER (STARSHIELD) - TOP SECRET CLEARANCE Starshield leverages SpaceX's Starlink technology and launch capability to support national security efforts. While Starlink is designed for consumer and c

Staff Site Reliability Engineer, Incident and Disaster

Dropbox Inc

Remote

Full-time

Dropbox is a Virtual First company. For this role, we are hiring in Zones 2 and 3. Please refer to our Compensation section below to see what neighborhoods fall under each Zone. Role Description The Incident and Disaster Team aims to reduce Customer pain by speeding up incident response through standardized incident management processes and tooling as well as through incident prevention strategies such as disaster readiness , chaos testing, safer tooling, stronger controls, automated conformanc

Site Reliability Engineer

AE Business Solutions

Remote

Full-time

AEBS is seeking a Site Reliability Engineer to take on a fully-remote, contract position! The Site Reliability Engineer must be able to work central time zone hours throughout this engagement. *No C2C inquiries, please Ideal Skills/Background: 3+ years of experience implementing Infrastructure-as-Code (IAC) with Terraform 3+ years of Python development experience 3+ years of experience in designing/developing/operating applications in the Azure cloud 2+ years of experience building and running

Senior Platform Engineer - 100% remote from anywhere in the US

Calance

Remote or New York, New York, USA

Full-time

Position Summary: Our client is seeking a highly skilled and experienced Senior Platform Developer II to join their team. This pivotal role will be instrumental in building, scaling, and maintain the robust and secure infrastructure that powers our mission-critical platform. You will be a force multiplier, enabling our development teams to deliver features faster and more reliably. You will champion infrastructure-as-code principles, contribute code to platform scalability, drive automation acr

Site Reliability Engineer

Madison-Davis, LLC

Remote

Contract

Role: Drive the technical implementation of monitoring and alerting strategies across enterprise-scale applications and infrastructure.Collaborate directly with development teams to ensure each new initiative includes the correct telemetry, log tagging, and alert payloads.Act as a liaison to Level 2 and Level 3 support teams to maintain and enhance monitoring dashboards used by the enterprise command center (EMC).Standardize alert formats to ensure consistency with SRE policies and support downs

Site Reliability Engineer

Zachary Piper Solutions, LLC

Remote

Full-time

Piper Companies is seeking a Remote Site Reliability Engineer to join a leading cybersecurity and cloud consulting firm. The Site Reliability Engineer will play a key role in building and maintaining secure, scalable infrastructure while supporting automation, compliance, and operational excellence across client environments. Responsibilities of the Site Reliability Engineer include: Develop and deploy automation scripts, tooling, and infrastructure to meet client needsManage patching processes