Reliability Engineer Jobs in New York, NY

Refine Results
41 - 60 of 112 Jobs

Site Reliability Engineer - USDS

TikTok

New York, New York, USA

Full-time

Location : New York Employment Type : Regular Job Code : A84289 Apply to this job Share this listing: Responsibilities Site Reliability Engineering(SRE) at TikTok combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. In our team, you'll have the opportunity to manage the complex challenges of scale, while using expertise in coding, algorithms, complexity analysis, and large-scale system design. We embrace a culture of di

Site Reliability Engineer - USDS (Multiple Positions)

TikTok

New York, New York, USA

Full-time

Location : New York Employment Type : Regular Job Code : A221437A Apply to this job Share this listing: Responsibilities About TikTok U.S. Data Security TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep

Senior Lead Site Reliability Engineer

JPMorgan Chase & Co.

Jersey City, New Jersey, USA

Full-time

Job Description As a Senior Lead Site Reliability Engineer at JPMorgan Chase within the Infrastructure & Production Management sector of Consumer & Community Banking, you will be tasked with closely collaborating with stakeholders to establish non-functional requirements (NFRs) and set service availability targets for various applications and product lines. Your role will be crucial in incorporating these NFRs during the design and testing stages of product development, accurately evaluating cu

Senior Site Reliability Engineer, Atlas

mongoDB, inc

New York, New York, USA

Full-time

MongoDB's mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft

Site Reliability Engineer for CIAM

Barclays

Hanover, New Jersey, USA

Full-time

Job Description Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning.Resolution, analysis and response to system outages and disruptions, and implement measures to prevent si

Site Reliability Engineer / NYC / On-site

Motion Recruitment Partners, LLC

New York, New York, USA

Full-time

This is an opportunity to join a fast-paced infrastructure team focused on scaling cloud-native systems that support complex AI and data workloads. This is a full-time role based in New York City, working with AWS, Kubernetes, Helm, Terraform, Datadog, and scripting in Bash and Python to ensure reliability, automation, and observability across systems. You'll be part of a cutting-edge environment operating at the intersection of fintech and AI, helping build platforms that power smarter financia

Senior Site Reliability Engineer

General Motors

Remote

Full-time

Job Description Develop and design software applications for driverless technology company. Duties may include: Build out and improve observability systems, tools and the related codebase. Contribute code, perform code reviews, and create technical designs that improve performance and reliability of observability systems using software and systems engineering skills. Partner with other Software Engineering teams to better understand use-cases and guide the engineers to use the existing tools eff

Staff Site Reliability Engineer - Federal - 3rd Shift

ServiceNow, Inc.

Remote or San Diego, California, USA

Full-time

Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today - ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500 . Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But thi

Site Reliability Engineer

Hexaware Technologies, Inc

Remote

Full-time

Position: Site Reliability Engineer 100% Remote Hiring: Contract / Fulltime Overview: SRE with deep expertise in Azure cloud-native observability implementation, Datadog, and production incident management. Also, possess strong Azure cloud-native tech stack performance tuning skills to optimize system reliability, scalability, and efficiency. Required Skills and Qualifications: Strong experience with Azure cloud-native observability tools (e.g., Azure Monitor, Log Analytics, Application Insights

SITE RELIABILITY ENGINEER

Mindbank Consulting Group

Remote

Full-time

JOB DESCRIPTION: As a Site Reliability Engineer (SRE) on our team, you will play a critical part in ensuring our systems and services reliability, scalability, and performance. You will work closely with cross-functional teams, including engineering, cloud infrastructure, and security, to deliver resilient, observable, and compliant solutions in AWS GovCloud and commercial cloud environments. This role requires applying consultative and technical expertise to support cloud initiatives with the

Site Reliability Engineer DevOps | REMOTE (ship required)

Oracle Corporation

Remote

Full-time

Job Description Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position. Come and join us!

Senior Site Reliability Engineer / DevOps - REMOTE (ship required)

Oracle Corporation

Remote

Full-time

Job Description Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position. Come and join us!

Staff Site Reliability Engineer, Incident and Disaster

Dropbox Inc

Remote

Full-time

Dropbox is a Virtual First company. For this role, we are hiring in Zones 2 and 3. Please refer to our Compensation section below to see what neighborhoods fall under each Zone. Role Description The Incident and Disaster Team aims to reduce Customer pain by speeding up incident response through standardized incident management processes and tooling as well as through incident prevention strategies such as disaster readiness , chaos testing, safer tooling, stronger controls, automated conformanc

Site Reliability Engineer

AE Business Solutions

Remote

Full-time

AEBS is seeking a Site Reliability Engineer to take on a fully-remote, contract position! The Site Reliability Engineer must be able to work central time zone hours throughout this engagement. *No C2C inquiries, please Ideal Skills/Background: 3+ years of experience implementing Infrastructure-as-Code (IAC) with Terraform 3+ years of Python development experience 3+ years of experience in designing/developing/operating applications in the Azure cloud 2+ years of experience building and running

Senior Site Reliability Engineer

GlobalLogic Inc.

Remote

Full-time

Job Description: Design, deploy, and scale our Prometheus architecture to handle 100+ million active series and beyond.Deploy and operate large, high-performance Elasticsearch clusters holding 2000+TB of data.Deploy and grow high-throughput data pipelines built on Kafka, handling hundreds of thousands of events per second.Design and build an alerting system that allows engineering teams to construct alerts from multiple data sources and alerting workflows.Write libraries and APIs that give engin

Senior Site Reliability Engineer

Centene Corporation

Michigan, USA

Full-time

You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world. As a diversified, national organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on workplace flexibility. Position Purpose: Helps lead projects that are focused on managing and maintaining optimum platform infrastructure performance, reliability, and security using SRE practices, observability tools,

Lead-Site Reliability Engineer

Kodeva LLC

Remote

Full-time

Role - LEAD/ Senior Site Reliability Engineer Remote Full time Experience Range: 8 to 15 years Mandatory Skills: Linux, AWS or Google Cloud Platform, Kubernetes, Python/Shell or any scripting. Additional Skills (good to have) : Prometheus & Grafana, Kafka Administration, Docker, Jenkins, Terraform Location: Anywhere in USA and Canada 1. Good experience of maintaining production systems on AWS and/or Google Cloud Platform as SRE 2. Hands on with Linux and Python, Shell or any scripting language.

Site Reliability Engineer only w2

Symphony Corporation

Remote

Contract

Site Reliability Engineer 6 Months Remote only W-2 The client is looking for a site reliability engineer.

Site Reliability Engineer

General Dynamics

Remote or Aurora, Colorado, USA

Full-time

Basic Qualifications Bachelor's degree in Computer Science, a related field or equivalent experience is required plus a minimum of 5 years of relevant experience; or Master's degree plus 3 years of relevant experience. CLEARANCE REQUIREMENTS: Department of Defense TS/SCI security clearance is required at time of hire. Applicants selected will be subject to a U.S. Government security investigation and must meet eligibility requirements for access to classified information. Due to the nature of

Senior Site Reliability Engineer

Generac Power Systems Inc

Remote or Denver, Colorado, USA

Full-time

We are Generac, a leading energy technology company committed to powering a smarter world. Over the 60 plus years of Generac's history, we've been dedicated to energy innovation. From creating the home standby generator market category, to our current evolution into an energy technology solutions company, we continue to push new boundaries. Over the 60 plus years of Generac's history, we've been dedicated to energy innovation. From creating the home standby generator market category, to our cu