site reliability engineer Jobs

Refine Results
381 - 400 of 1,017 Jobs

Senior Site Reliability Engineer - (Platform)

Coinbase

Remote

Full-time

Ready to be pushed beyond what you think you're capable of? At Coinbase, our mission is to increase economic freedom in the world. It's a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform - and with it, the future global financial system. To achieve our mission, we're seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the

Principal Site Reliability Engineer

Microsoft Corporation

Redmond, Washington, USA

Full-time

Microsoft is a company where passionate innovators come to collaborate, envision what can be and take their careers further. This is a world of more possibilities, more innovation, more openness, and the sky is the limit thinking in a cloud-enabled world. Microsoft's Azure Data engineering team is leading the transformation of analytics in the world of data with products like databases, data integration, big data analytics, messaging & real-time analytics, and business intelligence. The product

ClickHouse SRE, Data Platform -USDS

TikTok

New York, New York, USA

Full-time

Location : New York Employment Type : Regular Job Code : A33433 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data plaforms in the world that directly supports the TikTok a

ClickHouse SRE, Data Platform -USDS

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : A211956 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data plaforms in the world that directly supports the TikTok a

ClickHouse SRE, Data Platform -USDS

TikTok

Los Angeles, California, USA

Full-time

Location : Los Angeles Employment Type : Regular Job Code : A30614 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest data plaforms in the world that directly supports the TikTo

Site Reliability Engineer, Systems - Infrastructure Engineering

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A1965 Apply to this job Share this listing: Responsibilities Our Infrastructure Engineering team supports the company's fast growth by building and operating hyper-scale datacenters, managing the life cycle of server fleet, providing cloud solutions, and developing various infrastructure services and making sure they are scalable and are reliable. Roles and Responsibilities - Operate basic system infrastructures like DNS, NTP, authe

Site Reliability Engineer, Infrastructure Security- Seattle

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : Z0733 Apply to this job Share this listing: Responsibilities Our Infrastructure Engineering team supports the company's fast growth by building and operating hyper-scale datacenters, managing the life cycle of server fleet, providing cloud solutions, and developing various infrastructure services and making sure they are scalable and are reliable. Responsibilities - Conduct security reviews of core corporate and production infrastruc

Senior Site Reliability Engineer, Product - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A215600 Apply to this job Share this listing: Responsibilities Team Intro: The Product Engineering team monitors and maintains the availability of TikTok, including services such as video playback, content discovery/recommendations, live streaming, and customer service feedback. In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that

Data Ingestion SRE, Data Platform -USDS

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : A31281 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest dataplaforms in the world that directly supports the TikTok app

Data Ingestion SRE, Data Platform -USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A218312 Apply to this job Share this listing: Responsibilities Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed services and infrastructures. As a site reliability engineer in the data platform area, you will have the opportunity to manage the services and infrastructures in one of the largest dataplaforms in the world that directly supports the TikTok a

Internship, Site Reliability Engineer, Applications Engineering (Fall 2025)

Tesla Motors

Fremont, California, USA

Full-time

Consider before submitting an application: This position is expected to start around September 2025 and continue through the Fall term (approximately December 2025) or into Spring 2026 if available and there is an opportunity to do so. We ask for a minimum of 12 weeks, full-time and on-site, for most internships. Our internship program is for students who are actively enrolled in an academic program. entry level candidates seeking employment after graduation and not returning to school should a

Site Reliability Engineer, Trust & Safety - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : VGGP Apply to this job Share this listing: Responsibilities Team Intro: The Trust and Safety (TnS) engineering team of US Tech Service department at TikTok is fast growing and responsible for building machine learning models and systems to identify and defend internet abuse and fraud on our platform. Our mission is to protect billions of users and publishers across the globe every day. We embrace the state-of-the-art machine learnin

Site Reliability Engineer, Systems - Infrastructure Engineering- Seattle

TikTok

Seattle, Washington, USA

Full-time

Location : Seattle Employment Type : Regular Job Code : J9448 Apply to this job Share this listing: Responsibilities Our Infrastructure Engineering team supports the company's fast growth by building and operating hyper-scale datacenters, managing the life cycle of server fleet, providing cloud solutions, and developing various infrastructure services and making sure they are scalable and are reliable. Roles and Responsibilities - Operate basic system infrastructures like DNS, NTP, authen

Network Site Reliability Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

The Enterprise Network Support and SRE team is looking to add a seasoned Technical SRE lead to help actualize the SRE vision for our network infrastructure. We are looking for an engineer who is passionate about the network and making its operation seamless with a focus on user experience. This role will offer several opportunities to solve problems by being hands-on with troubleshooting, focused on network automation, observability, documentation, and excellence in operations. This Network SRE

Enterprise Architect

TechWish

Manassas, Virginia, USA

Full-time

Job Title: Enterprise ArchitectLocation: Manassas, VA(Hybrid)Face 2 Face interviewJD:Reporting to Head of Service & Data Architecture, the Enterprise IT Monitoring Architect is responsible for the detailed technical design of real-time monitoring, alerting, and reporting of the enterprise infrastructure (Network, Servers, Storage, Databases, etc.)Enterprise Architecture ProfessionalEnterprise Monitoring and Observability ProfessionalOpenTelemetry ProfessionalSRE ProfessionalTOGAF ProfessionalDes

Software Engineer, SRE, Payments - USDS

TikTok

New York, New York, USA

Full-time

Location : New York Employment Type : Regular Job Code : A173999 Apply to this job Share this listing: Responsibilities Team Intro: The Global Payment team of the US Tech Service department of TikTok provides all-round payment solutions for the company's USA products, overseas commercialization, and the company's overseas travel and procurement, including channel access, product order design, user interaction, capital management, tax and exchange optimization, settlement Reconciliation an

Senior Site Reliability Engineer

Salesforce

San Francisco, California, USA

Full-time

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts. Job Category Software Engineering Job Details About Salesforce We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving you

Site Reliability Engineering - Sr. Software Development Engineer

Blue Origin, LLC

Seattle, Washington, USA

Full-time

Application close date: Applications will be accepted on an ongoing basis until the requisition is closed. At Blue Origin, we envision millions of people living and working in space for the benefit of Earth. We're working to develop reusable, safe, and low-cost space vehicles and systems within a culture of safety, collaboration, and inclusion. Join our team of problem solvers as we add new chapters to the history of spaceflight! This role is part of Enterprise Technology (ET), where we're deve

Principal AI Infrastructure SRE Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

NVIDIA has been reinventing computer graphics, PC gaming, and accelerated computing for 30 years. It is a unique legacy of innovation that's fueled by great technology and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, generative AI, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best tal

Site Reliability Engineer (Azure)

UBS AG - Investment Banking

Nashville, Tennessee, USA

Full-time

Your role Are you motivated to work in a complex, global environment where ideas are valued and effort is appreciated? We are looking for a DevOps/Cloud Site Reliability Engineer to join our team and help us to: deploy, monitor, and support applications in a Kubernetes multi-tenant environment and promote overall stability of the systems through monitoring, change control management, and implementation of best practices to maintain production environments support Gitlab Pipelines and Terraform