Site Reliability Engineer

Remote • Posted 1 day ago • Updated 11 hours ago
Full Time
Remote
USD $87,100.00 - 157,450.00 per year
Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

  • IT Operations
  • Software Engineering
  • Continuous Integration
  • Continuous Delivery
  • Release Management
  • Scalability
  • Real-time
  • KPI
  • ROOT
  • ProVision
  • Amazon S3
  • Computer Hardware
  • Computer Networking
  • Management
  • Incident Management
  • Cyber Security
  • Computer Science
  • Reliability Engineering
  • DevOps
  • Kubernetes
  • Terraform
  • Security Clearance
  • Root Cause Analysis
  • Disaster Recovery
  • Regulatory Compliance
  • Cloud Computing
  • Optimization
  • Recruiting
  • Market Analysis
  • Law

Summary

Leidos is seeking a Site Reliability Engineer as part of our DevOps team in support of a large-scale, complex Software program within the Department of Justice. This role will ensure the applications are reliable, scalable, and efficient. This role will act as the bridge between development and IT operations, applying software engineering principles to automate infrastructure tasks, improve system reliability, and optimize performance.

Responsibilities:

Automate operations, CI/CD, and release management to ensure system reliability and scalability.

Monitor system health, performance, and capacity in real-time, proactively addressing issues.

Implement monitoring and alerting systems for rapid incident response, in accordance with ATF SLAs or KPIs.

Conduct post-incident reviews to identify root causes and drive remediation efforts.

Manage OpenShift/Kubernetes clusters and define application-level infrastructure using Terraform.

Analyze historical data to predict and provision future infrastructure needs.

Support application-level infrastructure (DBs, S3, IAM) while interfacing with the hardware/networking team, project capacity and utilization.

Improve system stability by managing, monitoring, and scaling services, focusing on reducing manual work and responding to incidents.

Provide rotational on-call coverage, support incident response and conduct RCA for outages

Qualifications:
  • Bachelor's degree in Cybersecurity, Computer Science, or related field with 5 years of experience. Additional years of experience may be substituted in lieu of degree.
  • 3+ years of experience in site reliability engineering or DevOps.
  • Proficient in Kubernetes, OpenShift, Terraform, and observability tools.
  • Must be able to obtain and maintain a public trust clearance.

Preferred Skills:

Experience with root cause analysis (RCA) and disaster recovery execution.

Familiarity with compliance evidence collection and risk-based release gating.

Knowledge of cloud cost optimization and cross-team escalation processes.

If you're looking for comfort, keep scrolling. At Leidos, we outthink, outbuild, and outpace the status quo - because the mission demands it. We're not hiring followers. We're recruiting the ones who disrupt, provoke, and refuse to fail. Step 10 is ancient history. We're already at step 30 - and moving faster than anyone else dares.

Original Posting:
March 11, 2026

For U.S. Positions: While subject to change based on business needs, Leidos reasonably anticipates that this job requisition will remain open for at least 3 days with an anticipated close date of no earlier than 3 days after the original posting date as listed above.

Pay Range:
Pay Range $87,100.00 - $157,450.00

The Leidos pay range for this job level is a general guideline only and not a guarantee of compensation or salary. Additional factors considered in extending an offer include (but are not limited to) responsibilities of the job, education, experience, knowledge, skills, and abilities, as well as internal equity, alignment with market data, applicable bargaining agreement (if any), or other law.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: SCNCAPI2
  • Position Id: 5e4ca3de375cab091e76db32dedc376f
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Today

Full-time

USD 73,450.00 - 132,775.00 per year

Remote

Today

Easy Apply

Full-time

$120000 - $145000

Remote

2d ago

Easy Apply

Full-time

Depends on Experience

Remote

Today

Easy Apply

Full-time

$110000 - $150000

Search all similar jobs