site reliability engineer Jobs in tennessee

Refine Results
1 - 20 of 143 Jobs

Site Reliability Engineer

iPeople Infosystems LLC

Remote

Contract

Role: Site Reliability Engineer Location: 100% Remote Type: Contract Position Job description: Production support expertise with SRE Observability experience : Proactive issue identification using observability tools.Skills in using different monitoring & observability tools to track system performanceProduction support activities including proactive identification of issues leveraging observability tools, Corelating inputs from various dashboards & tools to drive resolutionExperience in swiftl

SRE / Python Developer

Artech, LLC

Remote

Contract

Summary Our organization builds and provides systems and infrastructure that fuel our core services. We are the foundation on which our software developers build the products that our customers love. We are looking for passionate and dedicated Site Reliability Engineers to continue our focus on providing our customers the highest quality Services experience. Our services have to scale globally, stay highly available, and "just work. If you love designing, engineering and running systems and inf

Data Center Site Reliability Engineer (SRE)

ARROWCORE GROUP

Memphis, Tennessee, USA

Full-time

Title: Data Center Site Reliability Engineer (SRE) Location: Memphis, TN (Onsite) Duration: FTE About the Role We are seeking a Data Center Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of large-scale data center infrastructure supporting advanced AI workloads. In this role, you will collaborate with cross-functional teams to automate operations, enhance observability, and maintain high availability for distributed systems. This is a hands-on technical p

Lead Observability Engineer Sumo Logic & SRE Location :Remote

NeoTech Solutions

US

Third Party, Contract

Role : Lead Observability Engineer Sumo Logic & SRE Location : Remote Hire type : Contract JD: Experience: 10+ years (with 3+ years in Sumo Logic & Cloud-native observability) Job Summary: We are seeking a highly skilled Lead Observability Engineer to lead a critical implementation of Sumo Logic for a client migrating from Dynatrace. This role requires deep expertise in Sumo Logic, Site Reliability Engineering (SRE) practices, and Kubernetes (EKS) observability. The ideal candidate will de

Site Reliability Engineer

Zachary Piper Solutions, LLC

Remote

Full-time

Piper Companies is seeking a Remote Site Reliability Engineer to join a leading cybersecurity and cloud consulting firm. The Site Reliability Engineer will play a key role in building and maintaining secure, scalable infrastructure while supporting automation, compliance, and operational excellence across client environments. Responsibilities of the Site Reliability Engineer include: Develop and deploy automation scripts, tooling, and infrastructure to meet client needsManage patching processes

Site Reliability Engineer

Madison-Davis, LLC

Remote

Contract

Role: Drive the technical implementation of monitoring and alerting strategies across enterprise-scale applications and infrastructure.Collaborate directly with development teams to ensure each new initiative includes the correct telemetry, log tagging, and alert payloads.Act as a liaison to Level 2 and Level 3 support teams to maintain and enhance monitoring dashboards used by the enterprise command center (EMC).Standardize alert formats to ensure consistency with SRE policies and support downs

Senior Dev Operations Engineer SRE

Buxton Consulting

Remote

Contract

Senior Dev Operations Engineer SRE Remote (Pleasanton, CA) 12+ Months Top 3 Must Haves Experience setting up alerts / alarms / notifications in AWS cloud. CloudWatch / Dynatrace Experience with AWS solutions using AWS services including Kafka, ECS, EKS. Experience with IaC (Infrastructure as code) CDK or Terraform. Thanks and Regards, Ajeet Singh Buxton Consulting 2010 Crow Canyon Place STE 100 San Ramon, CA 94583 Direct: Email:

Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies and tackling system problems? Join our highly skilled Site Reliability team Our team monitors and measures the reliability of our suite of Compute products and platform. In collaboration with Engineering and Product teams, we improve the performance and reliability of the products we support. Partner with the best You will apply statistical data analysis and an understandin

Site Reliability Engineer

General Dynamics

Remote or Aurora, Colorado, USA

Full-time

Basic Qualifications Bachelor's degree in Computer Science, a related field or equivalent experience is required plus a minimum of 5 years of relevant experience; or Master's degree plus 3 years of relevant experience. CLEARANCE REQUIREMENTS: Department of Defense TS/SCI security clearance is required at time of hire. Applicants selected will be subject to a U.S. Government security investigation and must meet eligibility requirements for access to classified information. Due to the nature of

Site Reliability Engineer

McKesson Corporation

Remote or Columbus, Ohio, USA

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patien

Site Reliability Engineer

UBS AG - Investment Banking

Nashville, Tennessee, USA

Full-time

Your role Are you proficient in Windows Infrastructure and platforms? Do you have sound technical skills and hands on experience in maintaining and improving all aspects of the Windows server environment? We are looking for an engineer to: work with the SRE and Infrastructure engineering teams to improve our firm's hybrid cloud infrastructure be involved in engineering project work and operational support to increase overall supportability and reliability of our firm's enterprise technology env

Sr. Site Reliability Engineer - U.S. Citizen - This role sits within Optum Serves Technology Product organization

Widescope Consulting and Contracting Services

Remote

Full-time

Job Title: Sr. Site Reliability Engineer Location: Headquarters / Telecommute Classification (HR only): Exempt Non-Exempt Reports To (Title): COO Widescope Consulting and Contracting JOB SUMMARY The statements below are not intended to be all-inclusive of the duties and responsibilities of the position. Based on leadership decisions and business needs, all other duties as assigned will be expected for each position.Grafana Widescope Consulting and Contracting is proud to serve our nation's mi

Apigee SRE Automation Lead

Nityo Infotech Corporation

Remote

Contract

Role: Apigee SRE / Automation Lead Remote Contract Job Job Summary: We are seeking a highly skilled and experienced Apigee SRE / Automation Lead to oversee the reliability, scalability, and automation of our API infrastructure. This role demands deep expertise in Apigee platform operations, infrastructure automation, and system reliability engineering. The ideal candidate will have hands-on experience with tools like Terraform, Ansible, DoJo, and scripting, along with a strong background in Lin

SRE (Linux / Golang Automation)

Bayside Solutions

Remote

Contract

Site Reliability Engineer (Linux / Golang Automation) W2 Contract Salary Range: $124,800 - $145,600 per year Location: Remote Role - PST Job Summary: We require a Site Reliability Engineer with a strong background and experience supporting extensive virtualization and Linux compute platforms. Requirements and Qualifications: Experience automating with Golang Experience with Infrastructure as a Service orchestration tools (OpenStack, CloudStack, etc.) Strong experience supporting Linux and

Lead Observability Engineer Sumo Logic

VLink Inc

Remote

Third Party, Contract

VLink is a leading global provider of software engineering services with next-gen technologies and best-in-class talent. With offices in 7+ countries from North America-Europe to APAC & expansion plans in Middle East, VLink has helped SMBs, and large enterprises achieve their business goals, and gained the trust of Fortune-250 companies. VLink is a 'Great Place to Work Certified ' and has been a consistent winner as- Best Places to Work in CT. Trust, collaboration, and accountability are the th

Lead Observability Engineer

Fixity Technologies

Remote

Full-time, Part-time, Contract, Third Party

JD: Experience: 10+ years (with 3+ years in Sumo Logic & Cloud-native observability) Job Summary: We are seeking a highly skilled Lead Observability Engineer to lead a critical implementation of Sumo Logic for a client migrating from Dynatrace. This role requires deep expertise in Sumo Logic, Site Reliability Engineering (SRE) practices, and Kubernetes (EKS) observability. The ideal candidate will design and implement scalable dashboards, alerts, and tracing strategies, drive service-level reli

Senior Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Would you love to deliver huge value to our customers? Would you enjoy contributing to core technology which is serving billions of people? Join our world-class security team Our team is a part of the Cloud Security Intelligence group. We own one of the largest Big Data environments. The group develops Security infrastructures and Security products for our customers. Partner with the best You will be responsible to ensure optimal performance and up-time of Akamai's critical security product

Mainframe Site Reliability Engineer

Fynbosys Inc

Remote

Full-time

A Mainframe Site Reliability Engineer (SRE) applies software engineering principles to mainframe operations to enhance system reliability, scalability, and efficiency. Acting as a bridge between development and operations, the mainframe SRE focuses on automation, proactive monitoring, incident response, and performance optimization of mission-critical mainframe systems. Key responsibilities typically include:Automating repetitive operational tasks to reduce manual intervention and human errorEnh

Senior Site Reliability Engineer

McKesson Corporation

Remote or Columbus, Ohio, USA

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patien

SRE Practice Architect

TEKsystems c/o Allegis Group

Remote

Full-time

Description We are looking for a SRE leader /Architect who has "leadership experience" in software development, system architecture and SRE practices. S/he will set the strategy for the SRE practice in the responsible business technology domain, be accountable for its performance outcome. Qualified candidate must demonstrate experience collaborating with and influencing many stakeholders across organization and deep technical background across technology stacks, including applications, data and