Apply Now

Site Reliability Engineer (SRE) Vulnerability Management, Observability & Server Patching

Hybrid in Seattle, WA, US • Posted 2 days ago • Updated 2 days ago

Contract W2

Hybrid

$20 - $80/hr

Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

Dragon NaturallySpeaking
Continuous Improvement
Continuous Integration
DNS
Dashboard
Microsoft Operating Systems
Microsoft Windows Server
Operating Systems
Operational Excellence
Linux
Load Balancing
MEAN Stack
Management
Microsoft Azure
Docker
Firewall
ITIL
Incident Management
Scripting
Service Level
Regulatory Compliance
Reliability Engineering
Root Cause Analysis
TCP/IP
Issue Resolution
Kubernetes
Operational Risk
Python
Qualys
Reporting
Bash
Collaboration
Communication
Computer Networking
Continuous Delivery
Vulnerability Management
Windows PowerShell
Workflow

Summary

Position: - Site Reliability Engineer (SRE) Vulnerability Management, Observability & Server Patching

Location: - Hybrid - 4 days office in Seattle WA

Duration: Contract

Rate: DOE

Role Overview

This role is responsible for ensuring the security, reliability, and operational excellence of server infrastructure through proactive vulnerability management, effective server patching, and robust observability practices. The SRE will leverage platforms such as Brinqa for vulnerability aggregation and prioritization, and Datadog for monitoring, alerting, and service observability.

The ideal candidate will work closely with engineering, security, and application teams to identify and remediate risks, execute patching strategies, and continuously improve system visibility, reliability, and compliance.

Key Responsibilities

Vulnerability Management

Manage and continuously improve the enterprise vulnerability management program using Brinqa for aggregation, prioritization, and reporting.
Identify, analyze, and assess vulnerabilities across server infrastructure, including operating systems, applications, and supporting components.
Partner with security, infrastructure, and application teams to prioritize remediation efforts based on risk and business impact.
Ensure adherence to corporate security policies, regulatory requirements, and industry best practices.

Server Patching & Remediation

Plan, schedule, and execute server patching activities for operating systems and third-party software.
Track patch compliance and remediation metrics, including mean time to patch (MTTP).
Develop and maintain automation scripts and tooling to streamline patching workflows and improve efficiency.
Reduce operational risk by standardizing patching processes and minimizing service disruption.

Observability & Reliability

Maintain and enhance observability of supported services using Datadog.
Design and implement effective monitoring, alerting, and dashboards to improve service reliability and operational awareness.
Define and measure service-level indicators (SLIs), service-level objectives (SLOs), and success metrics.
Analyze incidents and trends to drive continuous improvement in system reliability and performance.

Collaboration & Operations

Collaborate with application owners, platform teams, and other stakeholders to support core SRE and operational objectives.
Provide guidance and best practices related to reliability, security, and operational resilience.
Support incident response, root cause analysis, and post-incident reviews where applicable.

Skills & Qualifications

Strong hands-on experience with server operating systems (Windows Server, Linux) and patching methodologies.
Solid understanding of vulnerability management frameworks, risk-based prioritization, and remediation practices.
Experience with vulnerability management tools such as Brinqa, Qualys, or similar platforms.
Proven experience implementing observability solutions using Datadog.
Experience working in on-premise and Microsoft Azure environments.
Hands-on experience with containerized applicationsusing Docker and Kubernetes (K8s).
Experience with CI/CD pipelines, including GitOps-based deployments using ArgoCD.
Proficiency in automation and scripting (e.g., Python, PowerShell, Bash).
Experience supporting on-call rotations, incident response, and production issue resolution.
Good knowledge of networking concepts, including TCP/IP, DNS, load balancing, firewall rules, and troubleshooting connectivity issues.
Familiarity with ITIL conceptsand operational best practices.
Strong communication and cross-team collaboration skills.
Ability to work independently, manage multiple priorities, and operate effectively in a fast-paced environment.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10398358
Position Id: 8945715
Posted 2 days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Site Reliability Engineer (SRE)

Seattle, Washington

•

2d ago

Site Reliability Engineer (SRE) Vulnerability Management, Observability & Server Patching Seattle WA Role Overview This role is responsible for ensuring the security, reliability, and operational excellence of server infrastructure through proactive vulnerability management, effective server patching, and robust observability practices. The SRE will leverage platforms such as Brinqa for vulnerability aggregation and prioritization, and Datadog for monitoring, alerting, and service observab

Easy Apply

Full-time, Contract

Site Reliability Engineer (SRE)

Hybrid in Seattle, Washington

•

2d ago

Site Reliability Engineer (SRE) Vulnerability Management, Observability & Server Patching Role Overview This role is responsible for ensuring the security, reliability, and operational excellence of server infrastructure through proactive vulnerability management, effective server patching, and robust observability practices. The SRE will leverage platforms such asBrinqafor vulnerability aggregation and prioritization, andDatadogfor monitoring, alerting, and service observability. The ideal can

Easy Apply

Contract

Depends on Experience

Site Reliability Engineer - W2 Only

Hybrid in Seattle, Washington

•

2d ago

Title: Site Reliability Engineer Duration: 12 Month Location: Seattle, WA (Onsite 4 Days office) Site Reliability Engineer (SRE) Vulnerability Management, Observability & Server Patching Role Overview: This role is responsible for ensuring the security, reliability, and operational excellence of server infrastructure through proactive vulnerability management, effective server patching, and robust observability practices. The SRE will leverage platforms such as Brinqa for vulnerability aggreg

Easy Apply

Contract

Depends on Experience

Site reliability Engineer

Hybrid in Bellevue, Washington

•

16d ago

Job Summary As a Site reliability Engineer, you will be responsible for designing, implementing, and maintaining cloud infrastructure across AWS, Google Cloud Platform, and/or Azure. You will work closely with engineering and data teams to support scalable applications and data platforms while ensuring reliability, security, and cost optimization. Key ResponsibilitiesDesign, provision, and manage cloud infrastructure across AWS, Google Cloud Platform, and/or AzureBuild and maintain Infrastruct

Easy Apply

Contract

Depends on Experience

Search all similar jobs