reliability engineer Jobs in jersey city, nj

Refine Results
41 - 60 of 117 Jobs

Site Reliability Engineer - USDS (Multiple Positions)

TikTok

New York, New York, USA

Full-time

Location : New York Employment Type : Regular Job Code : A221437A Apply to this job Share this listing: Responsibilities About TikTok U.S. Data Security TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep

Senior Lead Site Reliability Engineer

JPMorgan Chase & Co.

Jersey City, New Jersey, USA

Full-time

Job Description Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability. As a Principal Site Reliability Engineer at JPMorgan Chase within the Enterprise technology, Employee Platforms team, you work with your fellow stakeholders to define non-functional requirements (NFRs) and availability targets for the services in your application and product lines. You will ensure those

Senior Site Reliability Engineer, Atlas

mongoDB, inc

New York, New York, USA

Full-time

MongoDB's mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft

Site Reliability Engineer for CIAM

Barclays

Hanover, New Jersey, USA

Full-time

Job Description Purpose of the role To apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them. Accountabilities Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning.Resolution, analysis and response to system outages and disruptions, and implement measures to prevent si

Site Reliability Engineer / NYC / On-site

Motion Recruitment Partners, LLC

New York, New York, USA

Full-time

This is an opportunity to join a fast-paced infrastructure team focused on scaling cloud-native systems that support complex AI and data workloads. This is a full-time role based in New York City, working with AWS, Kubernetes, Helm, Terraform, Datadog, and scripting in Bash and Python to ensure reliability, automation, and observability across systems. You'll be part of a cutting-edge environment operating at the intersection of fintech and AI, helping build platforms that power smarter financia

Senior Site Reliability Engineer

General Motors

Remote

Full-time

Job Description Develop and design software applications for driverless technology company. Duties may include: Build out and improve observability systems, tools and the related codebase. Contribute code, perform code reviews, and create technical designs that improve performance and reliability of observability systems using software and systems engineering skills. Partner with other Software Engineering teams to better understand use-cases and guide the engineers to use the existing tools eff

Staff Site Reliability Engineer - Federal - 3rd Shift

ServiceNow, Inc.

Remote or San Diego, California, USA

Full-time

Company Description It all started in sunny San Diego, California in 2004 when a visionary engineer, Fred Luddy, saw the potential to transform how we work. Fast forward to today - ServiceNow stands as a global market leader, bringing innovative AI-enhanced technology to over 8,100 customers, including 85% of the Fortune 500 . Our intelligent cloud-based platform seamlessly connects people, systems, and processes to empower organizations to find smarter, faster, and better ways to work. But thi

Site Reliability Engineer

Hexaware Technologies, Inc

Remote

Full-time

Position: Site Reliability Engineer 100% Remote Hiring: Contract / Fulltime Overview: SRE with deep expertise in Azure cloud-native observability implementation, Datadog, and production incident management. Also, possess strong Azure cloud-native tech stack performance tuning skills to optimize system reliability, scalability, and efficiency. Required Skills and Qualifications: Strong experience with Azure cloud-native observability tools (e.g., Azure Monitor, Log Analytics, Application Insights

Site Reliability Engineer DevOps | REMOTE (ship required)

Oracle Corporation

Remote

Full-time

Job Description Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position. Come and join us!

SITE RELIABILITY ENGINEER

Mindbank Consulting Group

Remote

Full-time

JOB DESCRIPTION: As a Site Reliability Engineer (SRE) on our team, you will play a critical part in ensuring our systems and services reliability, scalability, and performance. You will work closely with cross-functional teams, including engineering, cloud infrastructure, and security, to deliver resilient, observable, and compliant solutions in AWS GovCloud and commercial cloud environments. This role requires applying consultative and technical expertise to support cloud initiatives with the

Senior Site Reliability Engineer / DevOps - REMOTE (ship required)

Oracle Corporation

Remote

Full-time

Job Description Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position. Come and join us!

Staff Site Reliability Engineer, Incident and Disaster

Dropbox Inc

Remote

Full-time

Dropbox is a Virtual First company. For this role, we are hiring in Zones 2 and 3. Please refer to our Compensation section below to see what neighborhoods fall under each Zone. Role Description The Incident and Disaster Team aims to reduce Customer pain by speeding up incident response through standardized incident management processes and tooling as well as through incident prevention strategies such as disaster readiness , chaos testing, safer tooling, stronger controls, automated conformanc

Site Reliability Engineer

AE Business Solutions

Remote

Full-time

AEBS is seeking a Site Reliability Engineer to take on a fully-remote, contract position! The Site Reliability Engineer must be able to work central time zone hours throughout this engagement. *No C2C inquiries, please Ideal Skills/Background: 3+ years of experience implementing Infrastructure-as-Code (IAC) with Terraform 3+ years of Python development experience 3+ years of experience in designing/developing/operating applications in the Azure cloud 2+ years of experience building and running

Site Reliability Engineer

Madison-Davis, LLC

Remote

Contract

Role: Drive the technical implementation of monitoring and alerting strategies across enterprise-scale applications and infrastructure.Collaborate directly with development teams to ensure each new initiative includes the correct telemetry, log tagging, and alert payloads.Act as a liaison to Level 2 and Level 3 support teams to maintain and enhance monitoring dashboards used by the enterprise command center (EMC).Standardize alert formats to ensure consistency with SRE policies and support downs

Site Reliability Engineer only w2

Symphony Corporation

Remote

Contract

Site Reliability Engineer 6 Months Remote only W-2 The client is looking for a site reliability engineer.

Senior Site Reliability Engineer

GlobalLogic Inc.

Remote

Full-time

Job Description: Design, deploy, and scale our Prometheus architecture to handle 100+ million active series and beyond.Deploy and operate large, high-performance Elasticsearch clusters holding 2000+TB of data.Deploy and grow high-throughput data pipelines built on Kafka, handling hundreds of thousands of events per second.Design and build an alerting system that allows engineering teams to construct alerts from multiple data sources and alerting workflows.Write libraries and APIs that give engin

Site Reliability Engineer II

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Are you excited by the opportunity to monitor and produce internet solutions? Would being free to innovate Would being free to innovate excite you? Join our highly skilled security Join our highly skilled Security team Our Team develops and sells Akamai's carrier network security products to fixed and mobile network service providers. We specialize in delivering highly scalable network infrastructure and access-based security products to our customers. We collaborate to enable our customers t

Site Reliability Engineer

General Dynamics

Remote or Aurora, Colorado, USA

Full-time

Basic Qualifications Bachelor's degree in Computer Science, a related field or equivalent experience is required plus a minimum of 5 years of relevant experience; or Master's degree plus 3 years of relevant experience. CLEARANCE REQUIREMENTS: Department of Defense TS/SCI security clearance is required at time of hire. Applicants selected will be subject to a U.S. Government security investigation and must meet eligibility requirements for access to classified information. Due to the nature of

Senior Site Reliability Engineer

Generac Power Systems Inc

Remote or Denver, Colorado, USA

Full-time

We are Generac, a leading energy technology company committed to powering a smarter world. Over the 60 plus years of Generac's history, we've been dedicated to energy innovation. From creating the home standby generator market category, to our current evolution into an energy technology solutions company, we continue to push new boundaries. Over the 60 plus years of Generac's history, we've been dedicated to energy innovation. From creating the home standby generator market category, to our cu

Site Reliability Engineer (Amdocs)

Highbrow

Remote

Full-time

Key Responsibilities Design, build, and maintain scalable, reliable, and secure infrastructure across production and staging environments. Automate operational tasks and processes using code (Python, Go, Bash, etc.). Drive infrastructure as code (IaC) practices using tools like Terraform, Ansible, or similar. Monitor, troubleshoot, and improve system availability, latency, and performance. Collaborate closely with development, QA, and product teams to design scalable system architecture. Conduct