reliability engineer Jobs

Refine Results
461 - 480 of 726 Jobs

Senior Staff Machine Learning Engineer - DevOps/Site Reliability Engineer

ServiceNow, Inc.

Santa Clara, California, USA

Full-time

Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today - ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500 . Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But thi

Senior Site Reliability Engineer - SAP Basis and Cloud Operations

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Tech Lead Manager, Site Reliability Engineer, Product - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A73877A Apply to this job Share this listing: Responsibilities The USDS TikTok Product Engineering SRE team works with engineering and product teams to build, maintain and run large-scale, globally distributed, observable, fault-tolerant systems. SREs on this team will deliver on production ownership and be responsible for observability and automation across complex, large-scale service mesh architectures. In order to enhance collab

Site Reliability Engineer (Middleware Platforms Kafka/Tibco EMS/API)

Euclid Innovations

Fort Mill, South Carolina, USA

Third Party, Contract

Job Description:We are seeking an experienced Site Reliability Engineer (SRE) with strong middleware platform expertise to enhance monitoring, reliability, and operational excellence across enterprise systems. The ideal candidate will have hands-on experience with Kafka, Tibco EMS, and API platforms, and a proven track record in monitoring automation and dashboard optimization. Key Responsibilities: Build specifications for a single pane of glass solution for middleware monitoring across compone

Sr. AWS DevOps Engineer

SYSTEM SOFT TECHNOLOGIES LLC

Denver, Colorado, USA

Contract

Onsite position - Denver, Colorado No Sponsorship No Corp to Corp DEVOPS ENGINEER REQUIREMENTS:Must possess prior and recent experience Engineering/Architecting AWS cloud platforms as a Cloud Engineer, Site Reliability Engineer, or DevOps Engineer within a professional and ideally enterprise environmentProven expertise with AWS, particularly with AWS Core Services such as but not needing all: EC2 (Elastic Compute Cloud), ECS (Elastic Container Service), EKS (Elastic Kubernetes Service), RDS (R

Software Engineer, SRE

FIS

Georgia, USA

Full-time

Job Description About FIS Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in financial services and technology. Our talented people empower us, and we believe in being part of a team that is open, collaborative, entrepreneurial, passionate and above all fun. Atlanta, GA Hybrid (two days in-office, three days virtual) Current and future sponsorship are not available for this position About the Tea

Site Reliability Engineer (SRE) - USDS

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A259032A Apply to this job Share this listing: Responsibilities The security team is missioned to run and operate security infrastructures, platforms and technologies, as well as to support cross-functional teams to protect our users, products and infrastructures. In this team you'll have a unique opportunity to have first-hand exposure to the strategy of the company in key security initiatives, especially in deploying and maintaini

Linux Administator/Linux Engineer/Unix,Linux Administrator

Introlligent Inc.

Elk Grove, California, USA

Third Party, Contract

Title: SRE (Site Reliability Engineer) Location: Elk Grove, CA (Hybrid) Duration: 12months Client : Apple Minimum Qualifications: Strong sense of ownership, customer service, and integrity demonstrated through clear communication.Experience with deploying, supporting and monitoring new and existing services, platforms, and application stacks.Proficiency with Python, Bash scripts, GO, REST APIS, and any object oriented programming a must.Proficiency in using monitoring and observability tools suc

Mid Level SRE

Motion Recruitment Partners, LLC

Portsmouth, New Hampshire, USA

Full-time

A tech-driven organization based in Portsmouth, NH is hiring a Mid-Level Site Reliability Engineer (SRE) to join their NetOps Team. This is a full-time, on-site role focused on building and supporting infrastructure for a high-scale network monitoring and observability platform. You will be working at a scale that is difficult to find at most companies outside of the FAANG organizations. You'll be working with Terraform modules, Helm chart deployments, and Kubernetes environments, not simply co

Space Force - Site Reliability Engineer (SRE) - Advanced Software Engineer

General Dynamics

Scottsdale, Arizona, USA

Full-time

Basic Qualifications Requires a Bachelor's degree in Systems Engineering, or a related Science, Engineering or Mathematics field. Also requires 5+ years of job-related experience, or a Master's degree plus 3 years of job-related experience. Agile experience preferred. CLEARANCE REQUIREMENTS: Department of Defense Secret security clearance is preferred at time of hire. Candidates must be able to obtain a Secret clearance within a reasonable amount of time from date of hire. Applicants selected

Senoir Compute SRE

Apple, Inc.

No location provided

Full-time

People at Apple don't just build products - they craft the kind of experience that has revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we found it. The Apple Services Engineering(ASE) team builds and provides systems and infrastructure that power Apple's services (such as iCloud, iTunes, Siri, and Maps). We are the foundation on which

Compute SRE

Apple, Inc.

No location provided

Full-time

People at Apple don't just build products - they craft the kind of experience that has revolutionized entire industries. The diverse collection of our people and their ideas inspire innovation in everything we do. Imagine what you could do here! Join Apple, and help us leave the world better than we found it. The Apple Services Engineering(ASE) team builds and provides systems and infrastructure that power Apple's services (such as iCloud, iTunes, Siri, and Maps). We are the foundation on which

Senior Site Reliability Engineer

Apple, Inc.

No location provided

Full-time

The Apple Services Engineering (ASE) team is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it on a massive scale, meeting Apple's high expectations with high performance to deliver a huge variety of entertainment in over 35 languages to more than 150 countries. These engineers build secure, end-to-end solutions. They develop the cust

Site Reliability/DevOps Engineer

Judge Group, Inc.

Montgomery, Ohio, USA

Contract

Location: Montgomery, OH Salary: $65.00 USD Hourly - $75.00 USD Hourly Description: Senior Site Reliability/DevOps Engineer About the Role As a Senior Site Reliability/DevOps Engineer, you will design, build, and maintain scalable infrastructure and deployment systems across cloud, on-premises, and retail environments. You'll collaborate closely with software engineering teams to automate processes, enhance system reliability, and ensure operational excellence. This role requires a strong

Site Reliability/DevOps Engineer

Kforce Technology Staffing

Blue Ash, Ohio, USA

Contract, Third Party

RESPONSIBILITIES: Kforce has a client in Blue Ash, OH that is seeking a Site Reliability/DevOps Engineer who will specialize in developing scalable methods for building, deploying, and supporting cloud, onprem and store focused enterprise services and systems. They will work closely with Software Engineers to deploy and operate solutions, automate and streamline processes, build and maintain tools for deployment, monitoring of platform, and troubleshoot and resolve issues in development, test, a

Senior Site Reliability Engineer (SRE) - iCloud

Apple, Inc.

No location provided

Full-time

The Apple Service Engineering - Edge & Messaging SRE team is looking for Site Reliability Engineers to build and run the services that hundreds of millions of customers use every day. This team provides systems that are foundational for many of Apple's services such as iCloud, iMessage, and FaceTime, and more. The best candidates will have both demonstrated Software Development skills and strong Linux / Systems / Cloud expertise. Our customers count on us to provide extraordinary availability, s

Senior Site Reliability Engineer, Observability

mongoDB, inc

New York, New York, USA

Full-time

MongoDB's mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft

Senior Site Reliability Engineer, Observability

mongoDB, inc

No location provided

Full-time

MongoDB's mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft

Site Reliability Engineer Graduate (Compute Platform) - 2026 Start (BS/MS)

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A01156 Apply to this job Share this listing: Responsibilities We are looking for talented individuals to join our team in 2026. As a graduate, you will get opportunities to pursue bold ideas, tackle complex challenges, and unlock limitless growth. Launch your career where inspiration is infinite at TikTok. Team introduction Our Compute Platform SRE team supports all Big Data services and products across the company. We are a newly e

Site Reliability Engineer | Linux Platform Development | Experienced Hire

Susquehanna International Group

Pennsylvania, USA

Full-time

Overview Susquehanna is seeking a Linux Engineer to design and build infrastructure for trading platforms, working closely with network engineers and software developers to create scalable systems. You'll optimize Linux infrastructure, including trading systems, research compute, databases, and support systems. What You'll Do: Automate and Evolve: Contribute to our library of home-grown tools, written primarily in Python and Bash, utilizing tools such as Jenkins to automate bare-metal and virtua