Site Reliability Engineer Jobs in Washington

Refine Results
1 - 20 of 228 Jobs

SRE Splunk Consultant

Compsciprep LLC

Reston, Virginia, USA

Contract, Third Party

Key Responsibilities* Design and implement observability strategies using OpenTelemetry for distributed tracing, metrics, and logging.* Instrument microservices written in Java and Python using Otel SDKs and auto-instrumentation tools.* Develop and maintain Splunk dashboards, alerts, and reports to provide actionable insights into system performance and reliability.* Collaborate with development and operations teams to ensure consistent and effective telemetry across services.* Automate monitori

SRE

Ajace Inc

Reston, Virginia, USA

Full-time

Key Responsibilities: 1. Cloud Infrastructure & Automation: Design, implement, and manage cloud-based infrastructure using platforms like AWS, Azure, or Google Cloud Platform. Utilize Infrastructure-as-Code (IaC) tools such as Terraform, CloudFormation, and Ansible to automate deployments and configurations. Create robust automation targeted at anomaly detection, toil reduction, recovery processes, and self-healing mechanisms, and optimize cloud costs. 2. DevSecOps & CI/CD: Deep understanding of

Site Reliability Engineer

Motion Recruitment Partners, LLC

Arlington, Virginia, USA

Full-time

Site Reliability Engineer As the Senior or Staff SRE on the Platform Engineering team, you'll be joining at a foundational stage and play a key role in building and shaping a secure, resilient, and high-performance platform that powers engineering capabilities. The company is located in New York and will remain 100% remote. What You Will Be Doing: Drive Platform Excellence: Continuously improve the platform's reliability, scalability, and deployment efficiency through innovative solutions and r

FLEX Senior Systems Engineer - SRE

Marriott International

Bethesda, Maryland, USA

Full-time

Job Description The Senior Systems Engineer - Site Reliability Engineering (SRE) is responsible for the reliability, scalability, and performance of mission-critical cloud and on-prem services that support millions of Marriot customers globally. This role involves overseeing incident management, driving automation efforts, and working closely with cross-functional teams to ensure alignment between SRE strategy and business objectives. Partners closely with Product Teams, Applications teams, Inf

Site Reliability Engineer, Kubernetes Platform (Starshield)

SpaceX

Washington, District of Columbia, USA

Full-time

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER, KUBERENTES PLATFORM (STARSHIELD) At SpaceX we're leveraging our experience in building rockets and spacecraft to deploy the Starshield constellation. Starshield is the world's largest US gov

Splunk Developer - Dashboards

SES

Reston, Virginia, USA

Contract

Top 4 Technical Skills: Splunk for Dashboards SRE Site Reliability Engineering Python Java Top 3 Soft Skills: Independent worker will be the SME of a small group Good communications working with Business and possibly Teaching Mentoring Happy to be in Reston 3 days per week About SES: Systems Engineering Services Corporation (SESC), founded in 1989, is a leading provider of technology solutions to Fortune 1000 companies and government organizations. Specializing in Accelerated Development Servic

Site Reliability Engineer

Motion Recruitment Partners, LLC

Fort Meade, Maryland, USA

Full-time

A mission-focused technology start-up based out of Arlington is seeking a Site Reliability Engineer (SRE) to support the deployment and stability of AI-powered cyber applications running in secure AWS enclave environments at Fort Meade. This role is ideal for candidates with deep DevOps, AWS, and containerization expertise who are passionate about maintaining high-performance systems that support national security objectives. You'll serve as the operational bridge between the deployment site and

Site Reliability Engineer (Amdocs)

Highbrow

Remote

Full-time

Key Responsibilities Design, build, and maintain scalable, reliable, and secure infrastructure across production and staging environments. Automate operational tasks and processes using code (Python, Go, Bash, etc.). Drive infrastructure as code (IaC) practices using tools like Terraform, Ansible, or similar. Monitor, troubleshoot, and improve system availability, latency, and performance. Collaborate closely with development, QA, and product teams to design scalable system architecture. Conduct

Site Reliability Engineer only w2

Symphony Corporation

Remote

Contract

Site Reliability Engineer 6 Months Remote only W-2 The client is looking for a site reliability engineer.

Site Reliability Engineer

Kforce Technology Staffing

Remote or Redwood City, California, USA

Contract

RESPONSIBILITIES: Kforce has a client that is seeking a Reliability Engineer in Redwood City, CA. The client is looking for a consultant who can help our client with the following deliverables: * High-Level Deliverables for Contract Duration with Estimated Sizing * (Small) Remediate all known policy violations within existing Docker build images * Build image: 2 high, 4 medium, 32 low severity issues related to dependencies * Requires knowledge of Docker, tangential comfort with Rust software

Sr. Site Reliability Engineer - W2 only

Nasscomm, Inc.

Remote

Contract

Site Reliability Engineer - At least 6+ years of experience defining and implementing Monitoring solutions - alerts, Telemetry, and instrumentation for on-premises and cloud platforms for large enterprises - Site Reliability Engineer will be playing a key role in building Observability and Resilience capabilities on cloud platform (Azure).Responsibilities of the SRE will be: - Build and configure alerts, tracing, telemetry, and instrumentation required for Infrastructure Monitoring and Applicati

Site Reliability Engineer

Zachary Piper Solutions, LLC

Remote

Full-time

Piper Companies is seeking a Remote Site Reliability Engineer to join a leading cybersecurity and cloud consulting firm. The Site Reliability Engineer will play a key role in building and maintaining secure, scalable infrastructure while supporting automation, compliance, and operational excellence across client environments. Responsibilities of the Site Reliability Engineer include: Develop and deploy automation scripts, tooling, and infrastructure to meet client needsManage patching processes

Site Reliability Engineer

Madison-Davis, LLC

Remote

Contract

Role: Drive the technical implementation of monitoring and alerting strategies across enterprise-scale applications and infrastructure.Collaborate directly with development teams to ensure each new initiative includes the correct telemetry, log tagging, and alert payloads.Act as a liaison to Level 2 and Level 3 support teams to maintain and enhance monitoring dashboards used by the enterprise command center (EMC).Standardize alert formats to ensure consistency with SRE policies and support downs

Site Reliability Engineer

Nightwing

Sterling, Virginia, USA

Full-time

Nightwing provides technically advanced full-spectrum cyber, data operations, systems integration and intelligence mission support services to meet our customers' most demanding challenges. Our capabilities include cyber space operations, cyber defense and resiliency, vulnerability research, ubiquitous technical surveillance, data intelligence, lifecycle mission enablement, and software modernization. Nightwing brings disruptive technologies, agility, and competitive offerings to customers in th

Site Reliability Engineer

General Dynamics

Remote or Aurora, Colorado, USA

Full-time

Basic Qualifications Bachelor's degree in Computer Science, a related field or equivalent experience is required plus a minimum of 5 years of relevant experience; or Master's degree plus 3 years of relevant experience. CLEARANCE REQUIREMENTS: Department of Defense TS/SCI security clearance is required at time of hire. Applicants selected will be subject to a U.S. Government security investigation and must meet eligibility requirements for access to classified information. Due to the nature of

Site Reliability Engineer

McKesson Corporation

Remote or Columbus, Ohio, USA

Full-time

McKesson is an impact-driven, Fortune 10 company that touches virtually every aspect of healthcare. We are known for delivering insights, products, and services that make quality care more accessible and affordable. Here, we focus on the health, happiness, and well-being of you and those we serve - we care. What you do at McKesson matters. We foster a culture where you can grow, make an impact, and are empowered to bring new ideas. Together, we thrive as we shape the future of health for patien

IT Engineer IV - Site Reliability Engineer

Kforce Technology Staffing

Reston, Virginia, USA

Contract

RESPONSIBILITIES: Kforce has a client that is seeking an IT Engineer IV - Site Reliability Engineer in Reston, VA. Duties Include: * Design and implement observability strategies using OpenTelemetry for distributed tracing, metrics, and logging * Instrument microservices written in Java and Python using Otel SDKs and auto-instrumentation tools * Develop and maintain Splunk dashboards, alerts, and reports to provide actionable insights into system performance and reliability * Collaborate with d

Site Reliability Engineer II - Real-Time

Esri

Vienna, Virginia, USA

Full-time

Overview Join us to work collaboratively with our talented team of dynamic and passionate engineers to deliver capabilities that enable our customers to make a difference. You'll deploy and operate ArcGIS Velocity and ArcGIS Workflow Manager SaaS solutions. You will also have the opportunity to design, deploy, and operate next-generation real-time and big data GIS software-as-a-service (SaaS) capabilities for thousands of cloud users worldwide. Our teams have a broad mix of experience levels a

Principal Site Reliability Engineer (Prisma Access)

PaloAlto Networks

Reston, Virginia, USA

Full-time

Company Description Our Mission At Palo Alto Networks everything starts and ends with our mission: Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for innovators who are as committed to shaping the future of cybersecurity as we are. Who We Are We take our mission of

Site Reliability Engineer (US Federal)

Workday, Inc.

McLean, Virginia, USA

Full-time

Your work days are brighter here. At Workday, it all began with a conversation over breakfast. When our founders met at a sunny California diner, they came up with an idea to revolutionize the enterprise software market. And when we began to rise, one thing that really set us apart was our culture. A culture which was driven by our value of putting our people first. And ever since, the happiness, development, and contribution of every Workmate is central to who we are. Our Workmates believe a h