Site Reliability Engineer Jobs in District of Columbia

Refine Results
1 - 20 of 234 Jobs

Site Reliability Engineer

iTvorks Inc

Reston, Virginia, USA

Contract

Job Title: Site Reliability Engineer Location: Reston, VA Duration: 24 Months Overall years of experience: 8+ years of related experience in their specific area with experience leading teams on projects with similar scope and complexity. Certifications: AWS Solutions Architect, Agile Certified Practitioner (ACP), or relevant cloud certifications. Job Description: We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a stro

Site Reliability Engineer

Accenture LLP

Reston, Virginia, USA

Full-time

At Accenture Federal Services, nothing matters more than helping the US federal government make the nation stronger and safer and life better for people. Our 13,000+ people are united in a shared purpose to pursue the limitless potential of technology and ingenuity for clients across defense, national security, public safety, civilian, and military health organizations. Join Accenture Federal Services, a technology company and part of global Accenture, to do work that matters in a collaborative

Site Reliability Engineer (SRE)

Virtuous Tech Inc

McLean, Virginia, USA

Contract

Role : Site Reliability Engineers Location : McLean, Virginia Tech Stack: Open telemetry, new relic, Splunk, other observability platforms, Python, AWSDevOps engineer who has heavy configuration management experience and pipeline automation experiencePure reliability engineering experience who has background performance engineering experienceKey responsibilities of people like implementing observing and monitoringOpen telemetry and implement full coverage of logs, metrics and traces to define al

Site Reliability Engineer

Accenture LLP

Elkridge, Maryland, USA

Full-time

At Accenture Federal Services, nothing matters more than helping the US federal government make the nation stronger and safer and life better for people. Our 13,000+ people are united in a shared purpose to pursue the limitless potential of technology and ingenuity for clients across defense, national security, public safety, civilian, and military health organizations. Join Accenture Federal Services, a technology company and part of global Accenture, to do work that matters in a collaborative

Site Reliability Engineer

Motion Recruitment Partners, LLC

Arlington, Virginia, USA

Full-time

Site Reliability Engineer As the Senior or Staff SRE on the Platform Engineering team, you'll be joining at a foundational stage and play a key role in building and shaping a secure, resilient, and high-performance platform that powers engineering capabilities. The company is located in New York and will remain 100% remote. What You Will Be Doing: Drive Platform Excellence: Continuously improve the platform's reliability, scalability, and deployment efficiency through innovative solutions and r

SRE

Ajace Inc

Reston, Virginia, USA

Full-time

Key Responsibilities: 1. Cloud Infrastructure & Automation: Design, implement, and manage cloud-based infrastructure using platforms like AWS, Azure, or Google Cloud Platform. Utilize Infrastructure-as-Code (IaC) tools such as Terraform, CloudFormation, and Ansible to automate deployments and configurations. Create robust automation targeted at anomaly detection, toil reduction, recovery processes, and self-healing mechanisms, and optimize cloud costs. 2. DevSecOps & CI/CD: Deep understanding of

Site Reliability Engineer

Nightwing

Sterling, Virginia, USA

Full-time

Nightwing provides technically advanced full-spectrum cyber, data operations, systems integration and intelligence mission support services to meet our customers' most demanding challenges. Our capabilities include cyber space operations, cyber defense and resiliency, vulnerability research, ubiquitous technical surveillance, data intelligence, lifecycle mission enablement, and software modernization. Nightwing brings disruptive technologies, agility, and competitive offerings to customers in th

Site Reliability Engineer

Motion Recruitment Partners, LLC

Fort Meade, Maryland, USA

Full-time

A mission-focused technology start-up based out of Arlington is seeking a Site Reliability Engineer (SRE) to support the deployment and stability of AI-powered cyber applications running in secure AWS enclave environments at Fort Meade. This role is ideal for candidates with deep DevOps, AWS, and containerization expertise who are passionate about maintaining high-performance systems that support national security objectives. You'll serve as the operational bridge between the deployment site and

Lead Systems Engineer (Datadog, AWS & ServiceNow Integration)

Lumen Solutions Group Inc.

Washington, District of Columbia, USA

Contract

Job Description:Lead Systems Engineer (Datadog, AWS & ServiceNow Integration)Job Summary We are seeking a seasoned Lead Systems Engineer with deep expertise in Datadog, AWS, and ServiceNow integration. In this leadership role, you will oversee the design, implementation, and maintenance of comprehensive monitoring, observability, and incident management solutions for cloud-based infrastructure and applications. You will play a key role in guiding the team to ensure operational excellence, system

Lead Observability Engineer Sumo Logic & SRE Location :Remote

NeoTech Solutions

US

Third Party, Contract

Role : Lead Observability Engineer Sumo Logic & SRE Location : Remote Hire type : Contract JD: Experience: 10+ years (with 3+ years in Sumo Logic & Cloud-native observability) Job Summary: We are seeking a highly skilled Lead Observability Engineer to lead a critical implementation of Sumo Logic for a client migrating from Dynatrace. This role requires deep expertise in Sumo Logic, Site Reliability Engineering (SRE) practices, and Kubernetes (EKS) observability. The ideal candidate will de

Lead Observability Engineer Sumo Logic & SRE, Remote

Sibitalent Corp

Remote

Contract

Role : Lead Observability Engineer Sumo Logic & SRE Location : Remote Hire type : Contract JD: Experience: 10+ years (with 3+ years in Sumo Logic & Cloud-native observability) Job Summary: We are seeking a highly skilled Lead Observability Engineer to lead a critical implementation of Sumo Logic for a client migrating from Dynatrace. This role requires deep expertise in Sumo Logic, Site Reliability Engineering (SRE) practices, and Kubernetes (EKS) observability. The ideal candidate will design

Site Reliability Engineer

Madison-Davis, LLC

Remote

Contract

Role: Drive the technical implementation of monitoring and alerting strategies across enterprise-scale applications and infrastructure.Collaborate directly with development teams to ensure each new initiative includes the correct telemetry, log tagging, and alert payloads.Act as a liaison to Level 2 and Level 3 support teams to maintain and enhance monitoring dashboards used by the enterprise command center (EMC).Standardize alert formats to ensure consistency with SRE policies and support downs

Site Reliability Engineer

Zachary Piper Solutions, LLC

Remote

Full-time

Piper Companies is seeking a Remote Site Reliability Engineer to join a leading cybersecurity and cloud consulting firm. The Site Reliability Engineer will play a key role in building and maintaining secure, scalable infrastructure while supporting automation, compliance, and operational excellence across client environments. Responsibilities of the Site Reliability Engineer include: Develop and deploy automation scripts, tooling, and infrastructure to meet client needsManage patching processes

Principal Site Reliability Engineer

Clarity Innovations

Arlington, Virginia, USA

Full-time

Clarity Innovations is a trusted national security partner, dedicated to safeguarding our nation's interests and delivering innovative solutions that empower the Intelligence Community (IC) and Department of Defense (DoD) to transform data into actionable intelligence, ensuring mission success in an evolving world. Our mission-first software and data engineering platform modernizes data operations, utilizing advanced workflows, CI/CD, and secure DevSecOps practices. We focus on challenges in Inf

Site Reliability Engineer, Kubernetes Platform (Starshield)

SpaceX

Washington, District of Columbia, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, KUBERENTES PLATFORM (STARSHIELD) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world's largest US gov

Site Reliability Engineer

General Dynamics

Remote or Aurora, Colorado, USA

Full-time

Basic Qualifications Bachelor's degree in Computer Science, a related field or equivalent experience is required plus a minimum of 5 years of relevant experience; or Master's degree plus 3 years of relevant experience. CLEARANCE REQUIREMENTS: Department of Defense TS/SCI security clearance is required at time of hire. Applicants selected will be subject to a U.S. Government security investigation and must meet eligibility requirements for access to classified information. Due to the nature of

Site Reliability Engineer

Akamai Technologies

Cambridge, England, United Kingdom

Full-time

Do you enjoy collaborating with teams to solve complex challenges? Do you have a passion for cutting edge technologies and tackling system problems? Join our highly skilled Site Reliability team Our team monitors and measures the reliability of our suite of Compute products and platform. In collaboration with Engineering and Product teams, we improve the performance and reliability of the products we support. Partner with the best You will apply statistical data analysis and an understandin

FLEX Senior Systems Engineer - SRE

Marriott International

Bethesda, Maryland, USA

Full-time

Job Description The Senior Systems Engineer - Site Reliability Engineering (SRE) is responsible for the reliability, scalability, and performance of mission-critical cloud and on-prem services that support millions of Marriot customers globally. This role involves overseeing incident management, driving automation efforts, and working closely with cross-functional teams to ensure alignment between SRE strategy and business objectives. Partners closely with Product Teams, Applications teams, Inf

Site Reliability Engineer (SRE) - Senior

Electronic Consulting Services, Inc (ECS Federal)

Arlington, Virginia, USA

Full-time

Job Description ECS is seeking a Site Reliability Engineer (SRE) - Senior to work in our Arlington, VA office. Please Note: This position is contingent upon contract award. Program Description ECS is seeking talented professionals to join our successful and growing team in building the next-generation Threat Intelligence Enterprise Service (TIES) solution. The TIES Program is the Cybersecurity and Infrastructure Security Agency's (CISA) dynamic approach to fulfilling its federally mandated cy

Site Reliability Engineer

McKesson Corporation

Remote or Columbus, Ohio, USA

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patien