Site Reliability Engineer

  • Washington D.C., DC
  • Posted 4 hours ago | Updated 4 hours ago

Overview

Hybrid
Up to $50
Contract - Independent
Contract - W2
Contract - 12 Month(s)
No Travel Required

Skills

Site Reliability Engineer
devops
AWS
docker
NoSQL
ITSM

Job Details

Job Title: IT Infrastructure Analyst Mid / Junior SRE Engineer
Location: Washington DC / Hybrid
Duration: 12+ Months
Education and Experience
  • Bachelor s degree in computer science, Engineering, or a related field
  • 1 3 years of combined experience in SRE, DevOps, and infrastructure engineering roles.
  • Proficiency in Python or other scripting languages.
  • Hands-on experience with cloud platforms (AWS preferred).
  • Familiarity with container technologies such as Docker, ECS, or Kubernetes.
  • Solid understanding of Linux systems and networking fundamentals.
  • Experience with relational, cloud-native, and NoSQL databases.
  • Support production systems outside standard business hours as needed.
Deployment & Automation
  • Design and implement CI/CD pipelines using GitHub Actions, AWS CodePipeline, and Jenkins.
  • Automate infrastructure provisioning using Infrastructure-as-Code (IaC) tools such as Terraform, AWS CloudFormation, or AWS CDK.
  • Develop automation scripts and self-service tools to streamline operations and reduce manual effort.
Capacity Planning & Performance Optimization
  • Lead cost optimization initiatives across cloud environments.
  • Configure and manage auto-scaling policies and performance thresholds.
  • Design and execute resiliency test plans, and support performance testing and benchmarking.
Incident Management & Operational Response
  • Apply ITIL principles and utilize ITSM platforms like ServiceNow for incident tracking and resolution.
  • Participate in production on-call rotations with strong diagnostic and troubleshooting skills.
  • Author Root Cause Analysis (RCA) reports and knowledge base articles
  • Implement SRE best practices including SLOs, and error budgets
Observability & Monitoring.
  • Leverage observability platforms such as Dynatrace, AppDynamics, ELK Stack, and similar tools.
  • Build and optimize alerts, dashboards, and anomaly detection using Dynatrace and Kibana.
Security & Compliance
  • Manage service accounts, access controls, and permission policies.
  • Create, deploy, and maintain digital certificates and encryption assets.
  • Respond to security incidents; support continuous security compliance.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.