senior site reliability engineer Jobs

Refine Results
121 - 140 of 185 Jobs

Site Reliability Engineer (SRE) Google Cloud Platform/Kubernetes/Dynatrace

Talent Groups

Chicago, Illinois, USA

Contract

Job Description:We are seeking an experienced Site Reliability Engineer (SRE) to join our team in a production support capacity. The ideal candidate should have hands-on experience with Google Cloud Platform, Kubernetes, Dynatrace, and familiarity with log monitoring tools like Splunk or Sumo Logic. The role demands strong incident response, dashboard monitoring, RCA preparation, and client coordination. Key Responsibilities:Manage day-to-day production support activities including incident reso

SRE DevOps Engineer

VDart, Inc.

Bellevue, Washington, USA

Contract, Third Party

Title: SRE DevOps Engineer Location: Bellevue, WA (Initial remote) Duration: Long Term Job Description: Seeking an experienced SRE DevOps Engineer with a strong focus on Microsoft Azure services.Design, deploy, and manage robust cloud infrastructure primarily utilizing various Azure services.Expertly manage and operate Azure Kubernetes Service (AKS) clusters for containerized workloads.Develop and maintain ARM templates for efficient Infrastructure as Code (IaC) deployments.Mandatory proficiency

DevOps Engineer

Intento Analytics LLC

Frisco, Texas, USA

Contract

Client looking for 10+ Years of experience. Title: AIOps Engineer(Devops with AI/ML) Location: Frisco, TX Locals Required Mandatory Skills: Devops/SRE, AI/ML, Python, Monitoring Tools Job Description: We are looking AIOps Engineer (An experienced SRE with knowledge of how to implement AI/ML). Must Have Skills Machine Learning: 8+ Years AI Frameworks(e.g., TensorFlow, PyTorch, scikit-learn): 5+ Years Monitoring & Observability Tools(e.g., Splunk, Prometheus, Grafana, ELK Stack): 5+ Years Python:

Senior Machine Learning Ops Engineer, Global SRE

TikTok

San Jose, California, USA

Full-time

Location : San Jose Employment Type : Regular Job Code : A04380 Apply to this job Share this listing: Responsibilities MLOps - Global SRE team is responsible for the stability of machine learning systems under the Global Monetization Products and Technology organization, to ensure the stable and efficient operations of machine learning models from data preparation, development, training, deployment, serving and so on. Responsibilities 1) Responsible for setting SLOs of online machine lear

SRE Operations-L2 Lead

Talent Groups

Chicago, Illinois, USA

Contract, Third Party

Must have worked on support projects Must know Google Cloud Platform, Kubernetes and Dynatrace Knowledge on Splunk / Sumologic log monitoring is an advantage Monitoring all the key dashboards and timely alerting ,Find RCA for all production issues, help team to debug potential issues, handling daily standup calls and all other client calls, Run through all the JIRA tickets and have the ticket updated with latest findings/RCAShould represent SRE in all client calls and have the deep knowledge on

IT Operations Senior Consultant

Korn Ferry

Marlborough, Massachusetts, USA

Contract

We have partnered with our client in their search for an IT Operations Senior Consultant. The Director, Head of IT Operations & Service Excellence is the strategic and operational leader responsible for uptime, resiliency, and world?class member experiences across BJ's digital and enterprise technology landscape. The role sets the "north?star" for what "good" looks like-defining and publishing service?level objectives (SLOs/SLIs) and operational key results-while building the organizational musc

Senior Engineering Program Manager, iCloud SRE, Apple Services Engineering

Apple, Inc.

No location provided

Full-time

The Apple Services Engineering team is one of the most exciting examples of Apple's long-held passion for combining art and technology. These are the people who power the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. And they do it on an extensive scale, meeting Apple's high expectations with dedication to deliver a huge variety of entertainment in over 35 languages to more than 150 countries. Our Program Managers partner with engineers who build secure, end-to-end solutions

SRE + DevOps Engineer

Sryven

Dallas, Texas, USA

Full-time

Position Title: SRE + DevOps Engineer Location: Dallas, TX & NYC/Jersey City, NJ - Onsite Type of Hire: Full-Time Employee (FTE) / Direct W2 Visa: Independent only Role Summary We are seeking an experienced SRE + DevOps Engineer with 10+ years of experience in managing large-scale cloud infrastructure, implementing CI/CD pipelines, ensuring service reliability, and championing DevOps culture. This role requires deep expertise in cloud platforms, automation, observability, and secure infrastruct

Senior System Engineer

Robert Half

Glendale, California, USA

Contract

Description Senior Site Reliability Engineer/Senior System Engineer (Contract) Job Type: Temporary (Contract) Location: Hybrid - Orlando, FL or Los Angeles, CA (3-4 days onsite per week, with flexible remote options) Work Schedule: Monday-Friday Position Overview: We are seeking a highly skilled Senior Site Reliability Engineer to join our Enterprise Technology team. This role is responsible for designing, implementing, and supporting cloud infrastructure, tools, and services that power bot

SRE Engineer / L3 Support

NasTech Global, Inc.

San Jose, California, USA

Full-time

Job Title: SRE Engineer / L3 Support Location: San Jose, CA(Onsite) Job Type: Full Time Must Have Technical/Functional Skills SRE, NetApp Storage, Linux Certified, Kubernetes Certified, DevOps, Docker, etc. Roles & Responsibilities Experienced Senior SRE working on Kubernetes, with On-Premises experience Candidate should work independently with little guidance from the leads. Experience in working with AWS. Experience in DB technologies in PostGres and MongoDB. Experience in working with the

Senior Reliability Engineer - DaaS

London Stock Exchange Group

St. Louis, Missouri, USA

Full-time

We are looking for a Senior Site Reliability Engineer to join our team delivering and supporting critical applications running on Azure. The ideal candidate will be an expert in Azure services, have a combination of SRE and DevOps skills including automation, monitoring, observability, CI/CD, incident management, and have a deep understanding of end to end application workflow. As a Senior Site Reliability Engineer, you will play a pivotal role in ensuring the reliability and performance of our

Lead Reliability Engineer

London Stock Exchange Group

St. Louis, Missouri, USA

Full-time

We are looking for a Senior Site Reliability Engineer to join our team delivering and supporting critical applications running on Azure. The ideal candidate will be an expert in Azure services, have a combination of SRE and DevOps skills including automation, monitoring, observability, CI/CD, incident management, and have a deep understanding of end to end application workflow. As a Senior Site Reliability Engineer, you will play a pivotal role in ensuring the reliability and performance of our

Senior Site Reliability Developer | Oracle Health

Oracle Corporation

No location provided

Full-time

Job Description Senior Site Reliability Engineer | Oracle Health Join the Oracle Health Team! At Oracle, we're pioneering a new chapter in healthcare technology with Oracle Health Applications & Infrastructure. Our team is creating a cutting-edge platform to modernize healthcare through automation and reliability. This is your chance to be part of an entrepreneurial, fast-paced environment where your contributions will directly impact our success. As a Senior Site Reliability Engineer (SRE)

SeniorSite Reliability Engineer

Pyramid Consulting, Inc.

Irving, Texas, USA

Contract

Immediate need for a talented Senior Site Reliability Engineer. This is a 12+months contract opportunity with long-term potential and is located in Irving, TX (Onsite). Please review the job description below and contact me ASAP if you are interested. Job ID: 25-77084 Pay Range: $68 - $70/hour. Employee benefits include, but are not limited to, health insurance (medical, dental, vision), 401(k) plan, and paid sick leave (depending on work location). Key Responsibilities: Run the production envi

Lead Site Reliability Engineer-North Carolina, Concord Location- Contract

Pegasus Knowledge Solutions

Concord, North Carolina, USA

Contract, Third Party

Title : Lead Site Reliability Engineer Job Type : Contract Location : North Carolina, Concord Location. We are seeking a Lead Site Reliability Engineer (SRE) with deep expertise in AWS networking, infrastructure automation, and production system reliability. This role demands a strong grasp of observability, operational excellence, and the ability to drive the adoption of DevOps/SRE best practices across engineering teams. You will be instrumental in shaping SLIs/SLOs, defining our DevOps matur

AIOps Engineer

Spiceorb

Frisco, Texas, USA

Contract, Third Party

Hello, SpiceOrb is looking for AIOos Engineer in Texas, Below is the JD: Role: AIOps Engineer Location: Frisco, TX(Onsite) Duration: 12+ Months Contract We are looking for an AIOps Engineer ( An experienced SRE with knowledge of how to implement AI/ML). Must Have Skills: Machine Learning: 8+ Years AI Frameworks(e.g., TensorFlow, PyTorch, scikit-learn): 5+ Years Monitoring & Observability Tools(e.g., Splunk, Prometheus, Grafana, ELK Stack): 5+ Years Python: 5+ Years Cloud: 5+ Years DevOps / SRE:

Senior DevOps Engineer

Relativity Space

Long Beach, California, USA

Full-time

At Relativity Space, we're building rockets to serve today's needs and tomorrow's breakthroughs. Our Terran R vehicle will deliver customer payloads to orbit, meeting the growing demand for launch capacity. But that's just the start. Achieving commercial success with Terran R will unlock new opportunities to advance science, exploration, and innovation, pioneering progress that reaches beyond the known. Joining Relativity means becoming part of something where autonomy, ownership, and impact ex

Lead Site Reliability Engineer

JPMorgan Chase & Co.

Jersey City, New Jersey, USA

Full-time

Job Description Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the Commercial and Investment Bank Technology, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Take lead and conduct res

Mid Level SRE

Motion Recruitment Partners, LLC

Portsmouth, New Hampshire, USA

Full-time

A tech-driven organization based in Portsmouth, NH is hiring a Mid-Level Site Reliability Engineer (SRE) to join their NetOps Team. This is a full-time, on-site role focused on building and supporting infrastructure for a high-scale network monitoring and observability platform. You will be working at a scale that is difficult to find at most companies outside of the FAANG organizations. You'll be working with Terraform modules, Helm chart deployments, and Kubernetes environments, not simply co

Lead Site Reliability Engineer

JPMorgan Chase & Co.

Wilmington, Delaware, USA

Full-time

Job Description Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the Enterprise technology, Corporate technology team, you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Take lead and cond