reliability engineer Jobs in san jose, ca

Refine Results
61 - 80 of 140 Jobs

Site Reliability Engineer - Recommendation Infrastructure

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A166137A Apply to this job Share this listing: Responsibilities Our Recommendation Infrastructure Team is responsible for building up and optimizing the architecture for our recommendation system to provide the most stable and best experience for our TikTok users. SREs in our team keep the systems up and running with the highest level of availability, and create highly automated systems and pipelines. What You'll Do Engage in and im

Site Reliability Engineer, Infrastructure Security

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : 3CNV Apply to this job Share this listing: Responsibilities Our Infrastructure Engineering team supports the company's fast growth by building and operating hyper-scale datacenters, managing the life cycle of server fleet, providing cloud solutions, and developing various infrastructure services and making sure they are scalable and are reliable. Responsibilities - Conduct security reviews of core corporate and production infrastruc

Site Reliability Engineer - Data Infrastructure

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A07367 Apply to this job Share this listing: Responsibilities Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. We seamlessly merge software development and infrastructure operations to design, build, and manage large-scale, highly distributed systems. We take pride in overseeing one of the industry's most extensive cloud infrastructures. As software development evolves, building systems fro

Site Reliability Engineer, Compute Platform

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A174647 Apply to this job Share this listing: Responsibilities Team Introduction Our Compute Platform SRE team supports all Big Data services and products across the company. We are a newly established team and waiting for talents like you to shape the team's future together. We are responsible for the reliability of all the company's major data warehouse products, services, and query engines. We serve business needs across domains

Technical Lead, Site Reliability Engineer, Fleetnet

Tesla Motors

Remote or Palo Alto, California, USA

Full-time

We are a small team of experts focused on creating the next-generation server-side infrastructure for Tesla. We're the invisible link connecting every Tesla product, whether it's vehicles, robots, robotaxis, chargers or even mobile apps to bring customers the best user experience possible. We're looking for strong, hands on, technical leader with domain expertise in one or more of: containers, public clouds, or private clouds. Today, over 10 million Tesla users rely on our services to safely and

Site Reliability Engineer, Edge Services- USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A124549A Apply to this job Share this listing: Responsibilities Team Insight: CDN Site Reliability Engineering combines software and network engineering with system operations to build and run large-scale, massively distributed infrastructure. Our Edge SREs ensure infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. We dive deep into the stack, including network, OS, and applications, to qui

Senior Site Reliability Engineer Staff, Classified Systems

Lockheed Martin Corporation

Sunnyvale, California, USA

Full-time

Job Description As a Site Reliability Engineer, you will: Design, implement, and maintain highly available and scalable systems and infrastructure to support classified applications and services Develop and implement reliability-focused engineering practices, such as continuous integration, continuous deployment, and continuous monitoring, while ensuring compliance with classified system requirements Collaborate with development teams to ensure that reliability and scalability are considered th

Site Reliability Engineer, Systems - Infrastructure Engineering

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A1965 Apply to this job Share this listing: Responsibilities Our Infrastructure Engineering team supports the company's fast growth by building and operating hyper-scale datacenters, managing the life cycle of server fleet, providing cloud solutions, and developing various infrastructure services and making sure they are scalable and are reliable. Roles and Responsibilities - Operate basic system infrastructures like DNS, NTP, authe

Site Reliability Engineer, Recommendation Infrastructure - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : JRXR2 Apply to this job Share this listing: Responsibilities About the Team The USDS TikTok Recommendations Infra SRE team works with engineering and product teams to build and run large-scale, globally distributed, observable, fault-tolerant systems. SREs on this team will deliver on production ownership and be responsible for observability and automation across complex, large-scale service mesh architectures. In order to enhance c

Senior Site Reliability Engineer, Product - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A215600 Apply to this job Share this listing: Responsibilities Team Intro: The Product Engineering team monitors and maintains the availability of TikTok, including services such as video playback, content discovery/recommendations, live streaming, and customer service feedback. In order to enhance collaboration and cross-functional partnerships, among other things, at this time, our organization follows a hybrid work schedule that

Senior Site Reliability Engineer - Data Infrastructure

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A33665 Apply to this job Share this listing: Responsibilities Our data infrastructure Site Reliability Engineering (SRE) team is a pioneer in innovation. We seamlessly merge software development and infrastructure operations to design, build, and manage large-scale, highly distributed systems. We take pride in overseeing one of the industry's most extensive cloud infrastructures. As software development evolves, building systems fro

Sr. DevOps/Site Reliability Engineer (SRE)

JKV International

Mountain View, California, USA

Contract

Job Title: Sr. DevOps/Site Reliability Engineer (SRE)Location: Mountain View, CA (Onsite)Position Type: Fulltime | Independent | H1B TransferInterview Process: Final In-Person (F2F) Interview Required About the Role:We are looking for a passionate and experienced Sr. DevOps/Site Reliability Engineer (SRE) to join our dynamic Platform Engineering team. You will work on cutting-edge cloud platforms like Azure, AWS, or Google Cloud Platform, leveraging state-of-the-art CI/CD tools to support modern

Senior Site Reliability Engineer

General Motors

Remote

Full-time

Job Description Develop and design software applications for driverless technology company. Duties may include: Build out and improve observability systems, tools and the related codebase. Contribute code, perform code reviews, and create technical designs that improve performance and reliability of observability systems using software and systems engineering skills. Partner with other Software Engineering teams to better understand use-cases and guide the engineers to use the existing tools eff

Tech lead Site Reliability Engineer, Edge - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A104498 Apply to this job Share this listing: Responsibilities Site Reliability Engineering combines software and system engineering with system operations to build and run large-scale, massively distributed infrastructure. Our Edge SREs ensure infrastructure services are reliable, fault-tolerant, efficiently scalable and cost-effective. We dive deep into the stack, including network, hardware, OS, and applications, to quickly resol

Senior Site Reliability Engineer

NVIDIA Corporation

Santa Clara, California, USA

Full-time

Join our team in Santa Clara, CA, USA as a Senior Site Reliability Engineer. At NVIDIA, you'll be part of the team shaping the future of computing and guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world. What you'll be doing: Own the solutions you build, collaborating with cross-functional teams to successfully implement them.Collaborate with various teams

Senior Site Reliability Engineer - USDS (Multiple Positions)

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A33375A Apply to this job Share this listing: Responsibilities About TikTok U.S. Data Security TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U

Site Reliability Engineer, Trust & Safety - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : VGGP Apply to this job Share this listing: Responsibilities Team Intro: The Trust and Safety (TnS) engineering team of US Tech Service department at TikTok is fast growing and responsible for building machine learning models and systems to identify and defend internet abuse and fraud on our platform. Our mission is to protect billions of users and publishers across the globe every day. We embrace the state-of-the-art machine learnin

Site Reliability Engineer - Global SRE, Monetization Technology

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : R2861 Apply to this job Share this listing: Responsibilities TikTok is one of the fastest growing apps in the world, and we're seeking Site Reliability Engineers (SREs) to join our monetization technology team. The monetization technology team works on building and running large-scale, globally distributed, fault-tolerant ads systems. SREs keep the systems up and running with the highest level of availability, ensuring our users hav

Site Reliability Engineer, Infrastructure and Assurance Services - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A236046A Apply to this job Share this listing: Responsibilities The Infra SRE-Infrastructure-Assurance team extends TikTok infrastructure's operability, observability, visibility, and automation. We aim to provide holistic insights and solutions to TikTok infrastructure with minimal manual interventions. We're young and fast-growing. Our team values transparency, collaboration, hard-work and innovations. We believe in planning and l

Data Site Reliability Engineer, Video Platform - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A30565 Apply to this job Share this listing: Responsibilities Team Intro This is a Site Reliability Engineer role, focusing on the data pipeline reliability for the Video Platform team in USDS. Data SREs monitor data and keep production batch and realtime processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete and correct data possible. In order to enhance collaboration a