site reliability engineer Jobs in nyc, ny

Refine Results
41 - 60 of 203 Jobs

Senior Site Reliability Engineer - Observability and Telemetry Platform

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized discipline which demands knowledge across different systems, networking, coding, database, capacity management, continuous delivery and deployment and open source cloud enabling technologies like Kubernetes and OpenStack. SRE at

Cloud Site Reliability Engineer - Azure/AWS (34084)

Myticas LLC

Remote or CA

Contract

Cloud Site Reliability Engineer - AWS & Azure Responsibilities Oversee the design and improvement of infrastructure using SRE best practices, including IaC, recovery automation, and systems that detect and resolve issues independently. Manage and fine-tune critical services across both cloud and on-prem environments: Kubernetes clusters, CI/CD pipelines, artifact registries, and custom workloads. Enhance observability through intelligent logging, metrics, tracing, and alerting. Ensuring systems

Engineering (SRE) Leader

EPAM Systems

Remote

Full-time

We are looking for a highly skilled and motivated Engineering (SRE) Leader with strong Cloud expertise to join our team. This is a unique opportunity to collaborate with dedicated professionals across the globe, drive impactful projects, and cultivate a high-performing engineering team. In this role, you will oversee people management, drive solution architecture, and ensure successful project delivery while empowering clients with cutting-edge Cloud solutions. If you are an innovative technol

Senior Site Reliability Engineer, NIM Factory

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology-and amazing people. Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent.

Senior Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you like collaborating across teams to solve complex problems? Do you have a passion for cutting edge technologies and tackling system problems? Join our critical Nameserver SRE team Our team is responsible for defining, measuring, publishing and optimizing key performance indicators of Akamai's nameserver platform. We take a holistic view of complex systems and identify the measures which matter most to customers. Partnering with multiple teams we address difficult problems that go beyond

Principal Site Reliability Engineer, AI Infrastructure

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you! NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for over 30 years. It's an outstanding legacy of innovation that's fueled by phenomenal technology and exceptional people. Today, we're tapping into the unlimited potenti

Principal Architect, Site Reliability Engineering - GeForce Now

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

NVIDIA is the world leader in accelerated computing-from gaming to data centers to AI and robotics. We are a team of trailblazers reinventing computing at the intersection of graphics, high-performance computing, and AI. If you're driven to tackle sophisticated challenges, push boundaries, and build technology that powers the future, NVIDIA is the place for you. We are looking for an expert and transformative Principal Architect for Site Reliability Engineering (SRE) to join our GeForce Now Engi

Senior Site Reliability Engineer

Johnson & Johnson

Remote or Santa Clara, California, USA

Full-time

At Johnson & Johnson, we believe health is everything. Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments are smarter and less invasive, and solutions are personal. Through our expertise in Innovative Medicine and MedTech, we are uniquely positioned to innovate across the full spectrum of healthcare solutions today to deliver the breakthroughs of tomorrow, and profoundly impact health for humanity. Learn m

Cyber Recovery Site Reliability Engineer (Virtualization)

Allstate Insurance Company

Remote

Full-time

At Allstate, great things happen when our people work together to protect families and their belongings from life's uncertainties. And for more than 90 years our innovative drive has kept us a step ahead of our customers' evolving needs. From advocating for seat belts, air bags and graduated driving laws, to being an industry leader in pricing sophistication, telematics, and, more recently, device and identity protection. Job Description **We have 1 position on this team focused in Virtualizatio

Cyber Recovery Site Reliability Engineer (Networking)

Allstate Insurance Company

Remote

Full-time

At Allstate, great things happen when our people work together to protect families and their belongings from life's uncertainties. And for more than 90 years our innovative drive has kept us a step ahead of our customers' evolving needs. From advocating for seat belts, air bags and graduated driving laws, to being an industry leader in pricing sophistication, telematics, and, more recently, device and identity protection. Job Description **We have 1 position on this team focused in Networking &

Senior Site Reliability Engineer , Scalability

Cisco Systems, Inc.

Remote

Full-time

Application window is open until further notice. In Meraki SRE we build the highly scalable cloud infrastructure that supports millions of Meraki devices worldwide. Meraki's customer base has grown by a factor of 2-3 every year, serving more than 8 billion HTTP requests per day across ten data centers! Our customers depend on our products to run their critical infrastructure of network switches (now including Cisco Catalyst in addition to the Meraki switches), security appliances, wireless APs,

Cyber Recovery Site Reliability Engineer (Network Security)

Allstate Insurance Company

Remote

Full-time

At Allstate, great things happen when our people work together to protect families and their belongings from life's uncertainties. And for more than 90 years our innovative drive has kept us a step ahead of our customers' evolving needs. From advocating for seat belts, air bags and graduated driving laws, to being an industry leader in pricing sophistication, telematics, and, more recently, device and identity protection. Job Description **We have 1 position on this team focused in Network Secur

Cyber Recovery Site Reliability Engineer (Automation)

Allstate Insurance Company

Remote

Full-time

At Allstate, great things happen when our people work together to protect families and their belongings from life's uncertainties. And for more than 90 years our innovative drive has kept us a step ahead of our customers' evolving needs. From advocating for seat belts, air bags and graduated driving laws, to being an industry leader in pricing sophistication, telematics, and, more recently, device and identity protection. Job Description **We have 1 position on this team focused in Automation &

Cyber Recovery Site Reliability Engineer (Data Protection)

Allstate Insurance Company

Remote

Full-time

At Allstate, great things happen when our people work together to protect families and their belongings from life's uncertainties. And for more than 90 years our innovative drive has kept us a step ahead of our customers' evolving needs. From advocating for seat belts, air bags and graduated driving laws, to being an industry leader in pricing sophistication, telematics, and, more recently, device and identity protection. Job Description **We have 1 position on this team focused in Data Protecti

Site Reliability Engineer - Platform

Coinbase

Remote

Full-time

Ready to be pushed beyond what you think you're capable of? At Coinbase, our mission is to increase economic freedom in the world. It's a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform - and with it, the future global financial system. To achieve our mission, we're seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the

Problem Manager - Cloud Ops & SRE

Verint Systems Inc

Remote

Full-time

Job Description At Verint, we believe customer engagement is the core of every global brand. Our mission is to help organizations discover opportunities previously only scarcely imagined by connecting work, data, and experiences enterprise wide. We hire innovators with the passion, creativity, and drive to answer constantly shifting market challenges and deliver impactful results for our customers. Our commitment to attracting and retaining a talented, diverse, and engaged team creates a collab

Senior Software Engineer - Site Reliability Engineering (Hybrid)

Citizens Bank

Remote or Plano, Texas, USA

Full-time

Job Description 3 Days Hybrid from any of our locations in RI, Iselin NJ, MA, Pittsburgh PA, Dallas TX or Phoenix AZ Role is not relocation eligible. At Citizens, we're more than a bank! Here, you'll experience new things, create new opportunities, think beyond your role and make an impact! As a Sr. Software Engineer, you will work alongside and mentor a group of talented engineers as you pursue a broad range of initiatives. By leveraging your technical skills and thirst for innovation, you wi

Site Reliability Engineer - Remote In United States

Mindbank Consulting Group

Remote

Full-time

Mindbank Consulting Group has an immediate direct-hire opportunity for an experienced Site Reliability Engineer (SRE). This is a fully remote position; work must be performed within the United States. Salary up to $115K; excellent benefits are available. Candidates must be authorized to work for ANY employer in the United States.Mindbank does NOT offer sponsorship at any time.Mindbank does NOT partner/work with third parties. In this role, you will play a critical part in ensuring our client s

Senior Site Reliability Engineer, Core AI Infrastructure

Coinbase

Remote

Full-time

Ready to be pushed beyond what you think you're capable of? At Coinbase, our mission is to increase economic freedom in the world. It's a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform - and with it, the future global financial system. To achieve our mission, we're seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the

Senior Site Reliability Engineer (FedRAMP)

GlobalLogic Inc.

Remote

Full-time

Job Description: 5+ years of experience working on Linux-based infrastructure5+ years of experience developing with object-oriented programming languages like Python and RubyExperience with compliance programs (PCI-DSS, FedRAMP, SOC1/2, etc) and Security Framework/Standards (NIST SP800, CSF, etc.)The successful applicant will be performing work in a FedRAMP moderate environmentExperience with AWS and Google Cloud environmentsExperience or willingness to work in an agile environment (Scrum, Kanba