site reliability engineer Jobs in washington

Refine Results
1 - 20 of 233 Jobs

Site Reliability Engineer

Apolis

Columbia, Maryland, USA

Full-time, Contract

Job Title: Site Reliability Engineer (SRE) Location: Columbia, MD or Chicago, IL (Hybrid Preferred - 4 days onsite, with flexibility) Type: Contract-to-Hire (6-12 months with potential for conversion) About the Role Join our dynamic Platform Engineering team as a Site Reliability Engineer (SRE). You ll be responsible for ensuring the reliability, scalability, and performance of our systems, working in a fast-paced and collaborative environment. The role is open to both senior engineers (5+ years

Site Reliability Engineer

Motion Recruitment Partners, LLC

Arlington, Virginia, USA

Full-time

Site Reliability Engineer As the Senior or Staff SRE on the Platform Engineering team, you'll be joining at a foundational stage and play a key role in building and shaping a secure, resilient, and high-performance platform that powers engineering capabilities. The company is located in New York and will remain 100% remote. What You Will Be Doing: Drive Platform Excellence: Continuously improve the platform's reliability, scalability, and deployment efficiency through innovative solutions and r

SRE Splunk Consultant

Compsciprep LLC

Reston, Virginia, USA

Contract, Third Party

Key Responsibilities* Design and implement observability strategies using OpenTelemetry for distributed tracing, metrics, and logging.* Instrument microservices written in Java and Python using Otel SDKs and auto-instrumentation tools.* Develop and maintain Splunk dashboards, alerts, and reports to provide actionable insights into system performance and reliability.* Collaborate with development and operations teams to ensure consistent and effective telemetry across services.* Automate monitori

Site Reliability Engineer

Thoughtwave Software and Solutions

Columbia, Maryland, USA

Full-time, Part-time, Contract

ROLE:Site Reliability Engineer LOCATION:Columbia, MD or Chicago, IL (3DAYS OSNITE 2 DAYS REMOTE) DURATION: 12+ MONTHS No of positions:5(3 senior and 2 Junior to midlevel) Must work on w2 Must haves: 5+ years of Site Reliability Engineering experience. Extensive experience with SonarQube Experience with Harness is Nice to have but they would like this is you find candidates with it. Extensive experience with CI/CD Pipelines (Docker or Kubernetes) Hands-on experience with monitoring tools (e.g., P

SRE

Ajace Inc

Reston, Virginia, USA

Full-time

Key Responsibilities: 1. Cloud Infrastructure & Automation: Design, implement, and manage cloud-based infrastructure using platforms like AWS, Azure, or Google Cloud Platform. Utilize Infrastructure-as-Code (IaC) tools such as Terraform, CloudFormation, and Ansible to automate deployments and configurations. Create robust automation targeted at anomaly detection, toil reduction, recovery processes, and self-healing mechanisms, and optimize cloud costs. 2. DevSecOps & CI/CD: Deep understanding of

Site Reliability Engineer

Nightwing

Sterling, Virginia, USA

Full-time

Nightwing provides technically advanced full-spectrum cyber, data operations, systems integration and intelligence mission support services to meet our customers' most demanding challenges. Our capabilities include cyber space operations, cyber defense and resiliency, vulnerability research, ubiquitous technical surveillance, data intelligence, lifecycle mission enablement, and software modernization. Nightwing brings disruptive technologies, agility, and competitive offerings to customers in th

Site Reliability Engineer

Motion Recruitment Partners, LLC

Fort Meade, Maryland, USA

Full-time

A mission-focused technology start-up based out of Arlington is seeking a Site Reliability Engineer (SRE) to support the deployment and stability of AI-powered cyber applications running in secure AWS enclave environments at Fort Meade. This role is ideal for candidates with deep DevOps, AWS, and containerization expertise who are passionate about maintaining high-performance systems that support national security objectives. You'll serve as the operational bridge between the deployment site and

Site Reliability Engineer only w2

Symphony Corporation

Remote

Contract

Site Reliability Engineer 6 Months Remote only W-2 The client is looking for a site reliability engineer.

SRE Architect

Alpha Silicon

US

Full-time

Roles & Responsibilities: 18+ years of Development and Operations experience in building and running applications in production that has uptime over 99%. Related experience and/or training; or equivalent combination of education and experience 8+ years of experience as a SRE Architect in running large Reliability & Observability Programs for large, complex infrastructure deployments / distributed systems for major Banking customers. Has a keen eye for industry trends, tries out newer tools/infra

Site Reliability Engineer

Kforce Technology Staffing

Remote or Redwood City, California, USA

Contract

RESPONSIBILITIES: Kforce has a client that is seeking a Reliability Engineer in Redwood City, CA. The client is looking for a consultant who can help our client with the following deliverables: * High-Level Deliverables for Contract Duration with Estimated Sizing * (Small) Remediate all known policy violations within existing Docker build images * Build image: 2 high, 4 medium, 32 low severity issues related to dependencies * Requires knowledge of Docker, tangential comfort with Rust software

FLEX Senior Site Reliability Engineer

Marriott International

Bethesda, Maryland, USA

Full-time

Job Description This is a temporary position. Overview: We're seeking a Senior Site Reliability Engineer to shape and drive our global infrastructure and reliability strategy across 9,500+ properties worldwide. This role is ideal for a strategic thinker with deep experience in Site Reliability Engineering (SRE), Cloud computing, and large-scale enterprise transformation. As part of a highly matrixed organization, you will lead cross-functional efforts that touch franchised and managed operatio

Mobile Application Site Reliability Engineer

BOOZ, ALLEN & HAMILTON, INC.

Washington, District of Columbia, USA

Full-time

Mobile Application Site Reliability Engineer The Opportunity: Do you love finding ways to make applications more efficient? Do you find it impossible to simply maintain when you could improve? Engineering to make applications more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network engineering, systems administration, or sof tware development - if you have a passion for making systems better, we need you! You'll work with a

Site Reliability Engineer (Amdocs)

Highbrow

Remote

Full-time

Key Responsibilities Design, build, and maintain scalable, reliable, and secure infrastructure across production and staging environments. Automate operational tasks and processes using code (Python, Go, Bash, etc.). Drive infrastructure as code (IaC) practices using tools like Terraform, Ansible, or similar. Monitor, troubleshoot, and improve system availability, latency, and performance. Collaborate closely with development, QA, and product teams to design scalable system architecture. Conduct

Site Reliability Engineer

Madison-Davis, LLC

Remote

Contract

Role: Drive the technical implementation of monitoring and alerting strategies across enterprise-scale applications and infrastructure.Collaborate directly with development teams to ensure each new initiative includes the correct telemetry, log tagging, and alert payloads.Act as a liaison to Level 2 and Level 3 support teams to maintain and enhance monitoring dashboards used by the enterprise command center (EMC).Standardize alert formats to ensure consistency with SRE policies and support downs

Site Reliability Engineer, Kubernetes Platform (Starshield)

SpaceX

Washington, District of Columbia, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, KUBERENTES PLATFORM (STARSHIELD) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world's largest US gov

Site Reliability Engineer

Zachary Piper Solutions, LLC

Remote

Full-time

Piper Companies is seeking a Remote Site Reliability Engineer to join a leading cybersecurity and cloud consulting firm. The Site Reliability Engineer will play a key role in building and maintaining secure, scalable infrastructure while supporting automation, compliance, and operational excellence across client environments. Responsibilities of the Site Reliability Engineer include: Develop and deploy automation scripts, tooling, and infrastructure to meet client needsManage patching processes

Sr. Site Reliability Engineer - W2 only

Nasscomm, Inc.

Remote

Contract

Site Reliability Engineer - At least 6+ years of experience defining and implementing Monitoring solutions - alerts, Telemetry, and instrumentation for on-premises and cloud platforms for large enterprises - Site Reliability Engineer will be playing a key role in building Observability and Resilience capabilities on cloud platform (Azure).Responsibilities of the SRE will be: - Build and configure alerts, tracing, telemetry, and instrumentation required for Infrastructure Monitoring and Applicati

FLEX Senior Systems Engineer - SRE

Marriott International

Bethesda, Maryland, USA

Full-time

Job Description The Senior Systems Engineer - Site Reliability Engineering (SRE) is responsible for the reliability, scalability, and performance of mission-critical cloud and on-prem services that support millions of Marriot customers globally. This role involves overseeing incident management, driving automation efforts, and working closely with cross-functional teams to ensure alignment between SRE strategy and business objectives. Partners closely with Product Teams, Applications teams, Inf

Site Reliability Engineer

Splunk Inc.

Colorado, USA

Full-time

Description Site Reliability Engineer Join us on the Splunk TechOps team, empowering our customers to execute our vision making machine data accessible, usable, and valuable to everyone! The Splunk TechOps organization runs Splunk cloud, blending SRE, Systems Engineering and Service Engineering disciplines, across functional global teams. Come join a team that is striving for operational awesomeness and trying to automate the world. We have a large presence with large cloud vendors. You should

Splunk Developer - Dashboards

SES

Reston, Virginia, USA

Contract

Top 4 Technical Skills: Splunk for Dashboards SRE Site Reliability Engineering Python Java Top 3 Soft Skills: Independent worker will be the SME of a small group Good communications working with Business and possibly Teaching Mentoring Happy to be in Reston 3 days per week About SES: Systems Engineering Services Corporation (SESC), founded in 1989, is a leading provider of technology solutions to Fortune 1000 companies and government organizations. Specializing in Accelerated Development Servic