Site Reliability Engineer Jobs in Virginia

Refine Results
21 - 40 of 204 Jobs

Mainframe Site Reliability Engineer

Fynbosys Inc

Remote

Full-time

A Mainframe Site Reliability Engineer (SRE) applies software engineering principles to mainframe operations to enhance system reliability, scalability, and efficiency. Acting as a bridge between development and operations, the mainframe SRE focuses on automation, proactive monitoring, incident response, and performance optimization of mission-critical mainframe systems. Key responsibilities typically include:Automating repetitive operational tasks to reduce manual intervention and human errorEnh

Site Reliability Engineer - CTJ - Top Secret

Microsoft Corporation

Reston, Virginia, USA

Full-time

Do you have a passion for high scale services and working with some of Microsoft's most critical customers? We're looking for a Site Reliability Engineer with the right mix of software development, on-line services experience and passion for quality to envision, design, and deliver Office 365 government cloud service offerings. Office 365 is at the center of Microsoft's cloud first, devices first strategy as it brings together cloud versions of our most trusted communication and collaboration p

Technical Lead, Site Reliability Engineer, Fleetnet

Tesla Motors

Remote or Palo Alto, California, USA

Full-time

We are a small team of experts focused on creating the next-generation server-side infrastructure for Tesla. We're the invisible link connecting every Tesla product, whether it's vehicles, robots, robotaxis, chargers or even mobile apps to bring customers the best user experience possible. We're looking for strong, hands on, technical leader with domain expertise in one or more of: containers, public clouds, or private clouds. Today, over 10 million Tesla users rely on our services to safely and

Senior Site Reliability Engineer

McKesson Corporation

Remote or Columbus, Ohio, USA

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patien

SRE Practice Architect

TEKsystems c/o Allegis Group

Remote

Full-time

Description We are looking for a SRE leader /Architect who has "leadership experience" in software development, system architecture and SRE practices. S/he will set the strategy for the SRE practice in the responsible business technology domain, be accountable for its performance outcome. Qualified candidate must demonstrate experience collaborating with and influencing many stakeholders across organization and deep technical background across technology stacks, including applications, data and

SRE Practice Architect

TEKsystems c/o Allegis Group

Remote

Full-time

Description We are looking for a SRE leader /Architect who has "leadership experience" in software development, system architecture and SRE practices. S/he will set the strategy for the SRE practice in the responsible business technology domain, be accountable for its performance outcome. Qualified candidate must demonstrate experience collaborating with and influencing many stakeholders across organization and deep technical background across technology stacks, including applications, data and

Site Reliability Engineer - Data Analytics - Hybrid

Swift

Culpeper, Virginia, USA

Full-time

ABOUT US We're the world's leading provider of secure financial messaging services, headquartered in Belgium. We are the way the world moves value - across borders, through cities and overseas. No other organisation can address the scale, precision, pace and trust that this demands, and we're proud to support the global economy. We're unique too. We were established to find a better way for the global financial community to move value - a reliable, safe and secure approach that the community can

Site Reliability Engineer - Data Analytics - Hybrid

Swift

Culpeper, Virginia, USA

Full-time

ABOUT US We're the world's leading provider of secure financial messaging services, headquartered in Belgium. We are the way the world moves value - across borders, through cities and overseas. No other organisation can address the scale, precision, pace and trust that this demands, and we're proud to support the global economy. We're unique too. We were established to find a better way for the global financial community to move value - a reliable, safe and secure approach that the community can

Director, Site Reliability Engineering

Walmart Inc.

Remote or Bentonville, Arkansas, USA

Full-time

Position Summary What you'll do Are you passionate about pioneering cutting-edge technology leveraging GenAI and big data to revolutionize Walmart's customer service experiences? Do you dream of working on innovative systems that make a significant impact on hundreds of millions of customers across the globe? We are seeking a visionary and hands-on Director of Site Reliability Engineering (SRE) to lead and scale a world-class SRE organization. This leader will be responsible for building a hig

Senior Site Reliability Engineer - Remote

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you have a passion for cutting edge technologies and tackling system problems? Are you a self-starting professional who thrives in a fast-paced environment? Join our critical CPS SRE team! We ensure that infrastructure services have world-class reliability and uptime. Site Reliability Engineer(SRE)s are the driving force that keeps the system running smoothly and helps identify any bottlenecks before they become issues. We focus on optimizing services, building infrastructure, and eliminat

Senior Site Reliability Engineer, ML Platforms

NVIDIA Corporation

Remote or Santa Clara, California, USA

Full-time

Are you passionate about building and maintaining large-scale production systems that support advanced data science and machine learning applications? Do you want to join a team at the heart of NVIDIA's data-driven decision-making culture? If so, we have a great opportunity for you! NVIDIA is seeking a Senior Site Reliability Engineer (SRE) for the Data Science & ML Platform(s) team. The role involves designing, building, and maintaining services that enable real-time data analytics, streaming,

Senior Site Reliability Engineer

General Motors

Remote

Full-time

Job Description Develop and design software applications for driverless technology company. Duties may include: Build out and improve observability systems, tools and the related codebase. Contribute code, perform code reviews, and create technical designs that improve performance and reliability of observability systems using software and systems engineering skills. Partner with other Software Engineering teams to better understand use-cases and guide the engineers to use the existing tools eff

Principal Site Reliability Engineer (Prisma Access)

PaloAlto Networks

Reston, Virginia, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Director Site Reliability Engineering

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies and tackling system problems? Join our highly skilled Network SRE team We build and operate the Network infrastructure powering Akamai's global cloud platform. Our mission is to deliver reliable, scalable, and performant systems that enable customers to run critical workloads with confidence. As part of this team, you'll help ensure reliability at scale, maintaining the avail

Senior Site Reliability Engineer - (Institutional)

Coinbase

Remote

Full-time

Ready to be pushed beyond what you think you're capable of? At Coinbase, our mission is to increase economic freedom in the world. It's a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform - and with it, the future global financial system. To achieve our mission, we're seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the

Site Reliability Engineer (need strong python coding skills)

Artech, LLC

Remote

Contract

Do something big and innovative! Stretch your creative muscles and work on big issues. Since 1989, we have developed technology environments, applications, and tools by providing experienced teams to implement, enhance, and maintain our clients essential systems and applications. Come join the Scalence team! Job Title: Site Reliability Engineer Location: 100% REMOTE (PST work hours) Duration: 6-12+ months Pay Rate : $60 - $65 /hr. Job Description You will play a pivotal role in ensuring the

Lead Observability Engineer

Fixity Technologies

Remote

Full-time, Part-time, Contract, Third Party

JD: Experience: 10+ years (with 3+ years in Sumo Logic & Cloud-native observability) Job Summary: We are seeking a highly skilled Lead Observability Engineer to lead a critical implementation of Sumo Logic for a client migrating from Dynatrace. This role requires deep expertise in Sumo Logic, Site Reliability Engineering (SRE) practices, and Kubernetes (EKS) observability. The ideal candidate will design and implement scalable dashboards, alerts, and tracing strategies, drive service-level reli

Sr Implementation Lead, SRE (CoP)

Northern Trust

Remote or Chicago, Illinois, USA

Full-time

About Northern Trust: Northern Trust, a Fortune 500 company, is a globally recognized, award-winning financial institution that has been in continuous operation since 1889. Northern Trust is proud to provide innovative financial services and guidance to the world's most successful individuals, families, and institutions by remaining true to our enduring principles of service, expertise, and integrity. With more than 130 years of financial experience and over 22,000 partners, we serve the world'

Sr. Site Reliability Engineer - ServiceNow

Visa Inc.

Ashburn, Virginia, USA

Full-time

Company Description Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose - to uplift everyone, everywhe

Site Reliability Engineer II - Real-Time

Esri

Vienna, Virginia, USA

Full-time

Overview Join us to work collaboratively with our talented team of dynamic and passionate engineers to deliver capabilities that enable our customers to make a difference. You'll deploy and operate ArcGIS Velocity and ArcGIS Workflow Manager SaaS solutions. You will also have the opportunity to design, deploy, and operate next-generation real-time and big data GIS software-as-a-service (SaaS) capabilities for thousands of cloud users worldwide. Our teams have a broad mix of experience levels a