Lead Site Reliability Engineer Jobs in Raleigh, NC

Refine Results
21 - 37 of 37 Jobs

Senior Site Reliability Engineer - Observability (FedRAMP IL5)

Splunk Inc.

North Carolina, USA

Full-time

Description Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to our work, customers, having fun, and most significantly to each other's success. The Splunk Observability Cloud provides full-fidelity monitoring and fixing across infrastructure, applications, and user inter

Senior Site Reliability Engineer, Observability, FedRAMP

Splunk Inc.

California, USA

Full-time

Description Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers love our technology, but it's our caring employees that make Splunk stand out as an amazing career destination. No matter where in the world or what level of the organization, we approach our wor

Senior Site Reliability Engineer, Test Platform- REMOTE

Cisco Systems, Inc.

Remote or San Francisco, California, USA

Full-time

At Cisco Meraki, we create magic through the energy and passion of our employees, who shape our dynamic community and empower us to solve problems for our customers. This magic unfolds when technology becomes intuitive, functions as intended, and when every individual is valued. By providing our employees with the autonomy to make an impact, we strive to fulfill our mission of simplifying technology so our customers can focus on what matters most to them-whether it's their students, patients, cu

Senior Azure SRE

Kforce Technology Staffing

Remote or Tampa, Florida, USA

Contract

RESPONSIBILITIES: Kforce has a client in Tampa, FL that is seeking a highly skilled Senior Infrastructure Engineer to drive the design, automation, and optimization of cloud infrastructure supporting the firm's core technologies and applications. Acting as a key technical expert, you'll ensure our platforms are scalable, resilient, and aligned with strategic IT initiatives. Responsibilities: * Design and automate infrastructure management to improve system reliability, scalability, and performa

Sr. Site Reliability Engineer, Bare Metal, Infrastructure

Tesla Motors

Remote or Austin, Texas, USA

Full-time

Tesla cloud as a service seeks a high impact Site Reliability Engineer (SRE) to support our bare-metal provisioning platform at scale. You'll provide direct support to internal customers, resolve complex provisioning issues, and escalate systemic problems to engineering. Your focus: ensuring reliable, automated delivery of bare-metal infrastructure using Kubernetes, Metal , and industry standard tooling across diverse hardware from Supermicro, HPE, and Dell. Responsibilities Provide frontline s

Senior Site Reliability Engineer-FedRAMP (FULLY REMOTE)

Splunk Inc.

California, USA

Full-time

Description Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to our work, customers, having fun, and most significantly to each other's success. The Splunk Observability Cloud provides full-fidelity monitoring and fixing across infrastructure, applications, and user interf

Principal Site Reliability Engineer (Safety) - Nashville, TN Hybrid

Oracle Corporation

Remote

Full-time

Job Description We offer unique opportunities for smart, hands-on engineers with the expertise and passion to solve difficult architecture, engineering, and process problems. Our customers run their businesses on our cloud, and our mission is to provide them with the most secure cloud services. Our ideal candidate is a site reliability or devops engineer with expertise and passion in finding and improving how services are deployed and operated. If this is you, joining Oracle Cloud Infrastructur

Sr. DevOps/Site Reliability Engineer

MTW Recruit

Remote

Full-time

No 3rd party inquiries will be processed This is a 100% remote role in Eastern Standard Time zone - preference to EST and CST zones Seeking a talented DevOps/Site Reliability Engineer (SRE) with expertise in Kubernetes, Terraform, Azure, and observability tools like DataDog to deploy and manage scalable, reliable infrastructure. Requirements:Minimum of 7 years of DevOps experience, preferably with SRE background Terraform and Kubernetes are absolutely requiredStrong problem-solving, communicat

Senior Site Reliability Engineer with Kubernetes - W2 - Remote in EST hours (Posted by SAM)

Global Force USA

Remote

Contract

Requirements: 4 + years of experience working within a cloud engineer/SRE roleExpert knowledge of a cloud service providerExpert knowledge and hands on production experience in Kubernetes (bare metal or managed) cluster setup and management required.Experience with infrastructure as code (IaC) tools like Terraform, Pulumi.Experience with Kubernetes deployment tools like Helm, ArgoCD, FluxStrong awareness of networking and internet protocols.Understanding of identity and access management (IAM)Ex

Sr. Spclst , Cloud Engineering

Merck & Company Inc

Remote or Rahway, New Jersey, USA

Full-time

Job Description We are looking for an experienced and enthusiastic Senior Site Reliability Engineer to join our Agile Planning Product Team. As part of the DevXOps Product Line, you will enable product teams to deliver value faster to the business by improving platform services that accelerate agile development projects. You will design scalable solutions, create CI/CD pipelines, and implement automation to enhance reliability and efficiency. As a Senior Reliability Engineer, you will: Become

AI/ML Site Reliability Engineer (SRE)

Lockheed Martin Corporation

Remote or King of Prussia, Pennsylvania, USA

Full-time

Job Description Space is a critical domain, connecting our technologies, our security and our humanity. While others view space as a destination, we see it as a realm of possibilities, where we can do more - we can innovate, invest, inspire and integrate our capabilities to transform the future. At Lockheed Martin Space, we aim to harness the full potential of space to cultivate innovation, reduce costs, and push the boundaries of what technology can achieve. We're creating future-ready solutio

Principal Application Engineer (SRE)

DISCOVER

Remote or Riverwoods, Illinois, USA

Full-time

Discover. A brighter future. With us, you'll do meaningful work from Day 1. Our collaborative culture is built on three core behaviors: We Play to Win, We Get Better Every Day & We Succeed Together. And we mean it - we want you to grow and make a difference at one of the world's leading digital banking and payments companies. We value what makes you unique so that you have an opportunity to shine. Come build your future, while being the reason millions of people find a brighter financial future

Principal Network Site Reliability Engineer - OCI (REMOTE)

Oracle Corporation

Remote

Full-time

Job Description The Oracle Cloud Infrastructure (OCI) delivers mission-critical applications for top tier enterprises around the world. Our cloud offers unmatched hyper-scale, multi-tenant services deployed in more than 40 regions worldwide. The mission of our Network Reliability Engineering team is to provide services that allow our customers to drive operational excellence in OCI networks at scale. Our customers want auto-remediation of incidents, touchless and automated operations such as up

Sr Staff Software Engineer, Reliability Engineering

Airbnb

Remote

Full-time

Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across the globe. Every day, hosts offer unique stays and experiences that make it possible for guests to connect with communities in a more authentic way. The Community You Will Join: We are a community based on connection and belonging - a community that was born in 2007 when two hosts welc

Senior Dev Operations Engineer SRE

Buxton Consulting

Remote

Contract

Position: Senior Dev Operations Engineer SRE Duration: 1+ Year Remote W2 contract Job Description: Top 3 Must Haves Experience setting up alerts / alarms / notifications in AWS cloud. CloudWatch / DynatraceExperience with AWS solutions using AWS services including Kafka, ECS, EKS.Experience with IaC (Infrastructure as code) CDK or Terraform. TECHNICAL KNOWLEDGE AND SKILLS: 6+ years of overall IT experience 4+ years of AWS Cloud management experience with below skill set AWS Certified DevOps and

Senior Manager Application Integration (SRE)

DISCOVER

Remote or Riverwoods, Illinois, USA

Full-time

Discover. A brighter future. With us, you'll do meaningful work from Day 1. Our collaborative culture is built on three core behaviors: We Play to Win, We Get Better Every Day & We Succeed Together. And we mean it - we want you to grow and make a difference at one of the world's leading digital banking and payments companies. We value what makes you unique so that you have an opportunity to shine. Come build your future, while being the reason millions of people find a brighter financial future

site reliability

3S Business Corporation Inc.

US

Full-time, Part-time, Contract, Third Party

Request-ID: 6211 Sr. Site Reliability Engineer USA-Alpharetta-Lexis Client Job Description: UST Global is looking for an experienced and passionate Sr. Site Reliability Engineer to join our engineering team and help us to oversee the assessment and management of the reliability of operations that could impact a product or business. A Sr. Site Reliability Engineer oversees the assessment and management of the reliability of application operations that could impact one or more application servic