Site Reliability Engineer Jobs

Refine Results
1 - 20 of 996 Jobs

Site Reliability Engineer

iTvorks Inc

Reston, Virginia, USA

Contract

Job Title: Site Reliability Engineer Location: Reston, VA Duration: 24 Months Overall years of experience: 8+ years of related experience in their specific area with experience leading teams on projects with similar scope and complexity. Certifications: AWS Solutions Architect, Agile Certified Practitioner (ACP), or relevant cloud certifications. Job Description: We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a stro

Site Reliability Engineer

Purple Hires

Chicago, Illinois, USA

Contract

Position : Site Reliability EngineerLocation : Plano, TX/ Chandler, AZ/ Chicago, IL - Hybrid (3 days onsite)Duration : Long Term Job Summary:Primary skills:OpenShift, Rancher Kubernetes(RKE), Python and Shell Scripting, Linux, and Azure Cloud. ResponsibilitiesResponsible for reliability and support of Container Platform on-prem and external clouds (Azure /AWS /Google)Monitor and troubleshoot Container platform (Openshift), Rancher (RKE) and Azure (AKS) environment performance issues, connectivit

Site Reliability Engineer

Purple Hires

Plano, Texas, USA

Contract

Position : Site Reliability EngineerLocation : Plano, TX/ Chandler, AZ/ Chicago, IL - Hybrid (3 days onsite)Duration : Long Term Job Summary:Primary skills:OpenShift, Rancher Kubernetes(RKE), Python and Shell Scripting, Linux, and Azure Cloud. ResponsibilitiesResponsible for reliability and support of Container Platform on-prem and external clouds (Azure /AWS /Google)Monitor and troubleshoot Container platform (Openshift), Rancher (RKE) and Azure (AKS) environment performance issues, connectivit

Site Reliability Engineer

Avance Consulting

Bellevue, Washington, USA

Full-time

Job Description Required Qualification: At least 4 years of Information Technology experience. SRE Mindset in Production support: Proactive issue identification using observability tools. Skilled in using different monitoring & observability tools to track system performance Incident commander: Ability to diagnose complex issues and actively drive incident calls working with technical, product SMEs, and Tier 2 SREs. Experience in Splunk (including Splunk APM and Splunk O11y), AppDynamics, E

Site Reliability Engineer

Tandym Tech

New York, New York, USA

Contract

A recognized media services organization in California is currently seeking a new Site Reliability Engineer (SRE) to maintain and improve an existing system focused on linear channel delivery whilst working on a new system to modernize this process. Responsibilities: Supporting the Streaming Engineers and providing hands-on Site Reliability Engineering support to manage production observability, incident response and assisting teams with DevOps processes Managing infrastructure as code and leve

Site Reliability Engineer

Compsciprep LLC

Owings Mills, Maryland, USA

Contract

Site Reliability Engineer-Location- Owings Mills, MD (must be onsite 2 days) Duration- 6 months (with possible extension) Site Reliability Engineer- 5+ years of Site Reliability Engineering experience 3+ years of Amazon Web Services (AWS) platform experience Strong experience with Monitoring and Alerting tools such as Prometheus, Grafana, New Relic

Site Reliability Engineer

Aduril Industries

Seattle, Washington, USA

Full-time

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and c

Site Reliability Engineer

Fiserv

Lincoln, Nebraska, USA

Full-time

Calling all innovators - find your future at Fiserv. We're Fiserv, a global leader in Fintech and payments, and we move money and information in a way that moves the world. We connect financial institutions, corporations, merchants, and consumers to one another millions of times a day - quickly, reliably, and securely. Any time you swipe your credit card, pay through a mobile app, or withdraw money from the bank, we're involved. If you want to make an impact on a global scale, come make a diffe

Site Reliability Engineer

Aduril Industries

Costa Mesa, California, USA

Full-time

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and c

SRE

Galent

Omaha, Nebraska, USA

Full-time

SRE Location : Omaha, Nebraska (High Priority) Local preferred or Near by States . Job Summary Seasoned Site Reliability Engineer (SRE) with 5+ years of experience in supporting complex, large-scale distributed systems. Highly skilled in managing production failures, conducting root cause analysis, and driving effective remediation. Strong communicator with expertise in ing, monitoring, and release management, complemented by automation proficiency and a keen ability to learn quickly. This role

Site Reliability Engineer

Epis Data Inc

Florham Park, New Jersey, USA

Full-time

Role: Site Reliability EngineerLocation: Florham Park, NJ - Hybrid 3 days onsite(Onsite Day 1)Experience: 7+ yearsRate: $60/hr. on C2CClient: ADP **Due to client requirements, we need or candidates.** The interview will be virtual on July 11 AM or 11:30 AM EST Experience/Skills: Strong Windows and OpenStack experience Ability to analyze and resolve problems in systems, networks, software, and APIs; understanding where all sources of information can come from. Strong experience with Splunk and Dy

Site Reliability Engineer

Applied Thought Auditors & Consultants Inc.

Charlotte, North Carolina, USA

Full-time

Location: The resource is expected to follow a hybrid model reporting to the Charlotte, NC office Interview Process: Zoom interview Job Description: Job Title: Site Reliability Engineer IV Job Responsibilities: Plan and execute configuration changes to the application and infrastructureRespond to emergencies and other incidentsInvestigate issues in cloud systemsCollaborate with software developers, engineers and operations teamsParticipate in system design consulting and capacity planningPartner

Site Reliability Engineer

Randstad Digital

Jersey City, New Jersey, USA

Contract

job summary: Collaborates with a diverse set of engineers, architects, and teams to design, develop, test, and implement secure, robust, highly available and scalable solutions for Client's External Cloud PlatformCollaborates other software engineers and teams to design and implement deployment approaches using highly scalable, automated, continuous integration and continuous delivery pipelines.Responsible for all aspects of reliability, collaborates with technical experts, key stakeholders, and

Site Reliability Engineer

Jobot

Atlanta, Georgia, USA

Full-time

PE backed SaaS portfolio company streamlining platform to host all apps This Jobot Job is hosted by: Charles Simmons Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume. Salary: $100,000 - $120,000 per year A bit about us: Are you a tech-savvy, innovative, and passionate Site Reliability Engineer looking for an exciting opportunity to work in a dynamic and fast-paced environment? Do you have a knack for solving complex problems? If so, we have a fantast

Site Reliability Engineer

ITECS

Frisco, Texas, USA

Third Party

Position: Site Reliability Engineer Number of positions: 7 Location: FRISCO, TX (Onsite only) Any Visa will work. Required skills Monitors the T-Cloud Platform using the T-Cloud Observability tooling, based on ServiceNow. Resolves incidents across the SDN (Cisco ACI), the CaaS layer (Red Hat OpenShift) and the Telco applications (first application is Mavenir IMS). Monitors the automation and resolves any issues with these automations (if possible). Oversees incident resolution process Escalat

Only w2 - SRE Engineer ( Local to Texas ) F2F Interview - Only 1 Round of Interview

Dornan Technologies

Irving, Texas, USA

Contract

As a Site Reliability Engineer II, you will identify and deliver automation solutions designed to ensure high availability and resiliency using your expertise in software development, complexity analysis, and scalable system design. Strong collaboration skills will be required to work closely with other engineering teams to ensure services/systems are highly stable and performant, meeting the expectations of our business partners and end users. Partner with the architecture and development team

Lead Observability Engineer Sumo Logic & SRE, Remote

Sibitalent Corp

Remote

Contract

Role : Lead Observability Engineer Sumo Logic & SRE Location : Remote Hire type : Contract JD: Experience: 10+ years (with 3+ years in Sumo Logic & Cloud-native observability) Job Summary: We are seeking a highly skilled Lead Observability Engineer to lead a critical implementation of Sumo Logic for a client migrating from Dynatrace. This role requires deep expertise in Sumo Logic, Site Reliability Engineering (SRE) practices, and Kubernetes (EKS) observability. The ideal candidate will design

SRE

Fynbosys Inc

Texas City, Texas, USA

Full-time

For SRE they need basic system monitoring, Ansible Scripting, Azure, Cloud operating Network, Python, basic understanding of the cloud. Job Description: We are seeking a dedicated Site Reliability Engineer II to join our team. In this role, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. You will work closely with cross-functional teams to implement best practices and drive improvements in our infrastructure. Responsibilities- Moni

Site Reliability Engineer (SRE)

Virtuous Tech Inc

McLean, Virginia, USA

Contract

Role : Site Reliability Engineers Location : McLean, Virginia Tech Stack: Open telemetry, new relic, Splunk, other observability platforms, Python, AWSDevOps engineer who has heavy configuration management experience and pipeline automation experiencePure reliability engineering experience who has background performance engineering experienceKey responsibilities of people like implementing observing and monitoringOpen telemetry and implement full coverage of logs, metrics and traces to define al

Data Center Site Reliability Engineer (SRE)

ARROWCORE GROUP

Memphis, Tennessee, USA

Full-time

Title: Data Center Site Reliability Engineer (SRE) Location: Memphis, TN (Onsite) Duration: FTE About the Role We are seeking a Data Center Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of large-scale data center infrastructure supporting advanced AI workloads. In this role, you will collaborate with cross-functional teams to automate operations, enhance observability, and maintain high availability for distributed systems. This is a hands-on technical p