Site Reliability Engineer (SRE

Remote • Posted 3 hours ago • Updated 3 hours ago
Contract W2
12 Months
No Travel Required
Remote
$55 - $60/hr
Fitment

Dice Job Match Score™

🧠 Analyzing your skills...

Job Details

Skills

  • Amazon Web Services
  • Analytical Skill
  • Auditing
  • Budget
  • Build Automation
  • Cloud Computing
  • Collaboration
  • Communication
  • Computer Networking
  • Computer Science
  • Conflict Resolution
  • Continuous Delivery
  • Continuous Integration
  • Continuous Monitoring
  • DevOps
  • DoD
  • FedRAMP
  • Grafana
  • IaaS
  • Incident Management
  • Information Technology
  • Kubernetes
  • Linux
  • Linux Administration
  • Management
  • Microsoft Azure
  • Problem Solving
  • Production Support
  • Python
  • Regulatory Compliance
  • Reliability Engineering
  • Root Cause Analysis
  • Scalability
  • Scripting
  • Service Level
  • Terraform
  • Workflow

Summary

Job Summary
The Site Reliability Engineer (SRE) – Kubernetes Platform will be responsible for supporting the development, deployment, and operations of Kubernetes-based platforms within highly regulated environments such as FedRAMP High and DoD IL5. This role focuses on ensuring platform reliability, scalability, security, and compliance while enabling product teams to deliver services efficiently.
The ideal candidate will have strong hands-on experience in Kubernetes, cloud infrastructure, automation, observability, and platform engineering, along with a solid understanding of compliance-driven environments.
Key Responsibilities
  • Design, implement, and operate Kubernetes platforms in FedRAMP High / IL5 environments.
  • Maintain day-to-day reliability, availability, and performance of platform services.
  • Build automation tools and operational workflows to improve efficiency and reduce manual intervention.
  • Define and monitor Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets.
  • Support compliance and security requirements, including audits and continuous monitoring.
  • Develop and maintain Infrastructure as Code (IaC) using tools such as Terraform.
  • Improve and support CI/CD pipelines and deployment automation.
  • Collaborate with Security, Platform, and Application teams to troubleshoot issues and deliver infrastructure enhancements.
  • Participate in on-call rotations, incident response, and production support.
  • Perform root cause analysis (RCA) and implement preventive solutions.
Required Qualifications
  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or related field.
  • 4–6 years of experience in Site Reliability Engineering (SRE), DevOps, or Platform Engineering.
  • Strong production experience with Kubernetes.
  • Experience with cloud platforms such as Amazon Web Services (AWS)Microsoft Azure, or similar.
  • Solid understanding of Linux systems administration, networking, and container technologies.
  • Experience with Infrastructure as Code (IaC) tools like Terraform.
  • Proficiency in scripting/programming using PythonGo, or similar.
  • Experience with observability and monitoring tools such as Prometheus and Grafana.
  • Strong troubleshooting and incident management skills.
Preferred Qualifications
  • Experience working in FedRAMP HighDoD IL5, or other regulated environments.
  • Hands-on experience with Argo CD or similar deployment automation tools.
  • Knowledge of container security practices and security scanning tools.
  • Experience supporting audited, secure, or government-compliant systems.
  • Familiarity with GovCloud environments.
Required Technical Skills
  • Kubernetes Administration
  • Cloud Infrastructure (AWS / Azure)
  • Linux Administration
  • Networking & Security
  • Infrastructure as Code (Terraform)
  • CI/CD Pipelines
  • Monitoring & Observability
  • Automation & Scripting
  • Incident Management / Production Support
Soft Skills
  • Strong ownership and accountability
  • Excellent problem-solving and analytical skills
  • Effective communication and collaboration
  • Ability to work in structured, compliance-driven environments
  • Continuous learning mindset and proactive attitude
 
 
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90769335A
  • Position Id: 9014724
  • Posted 3 hours ago
Contact the job poster
HC

Hema Chandiran

Recruiter @ Info Way Solutions
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Today

Easy Apply

Contract

Depends on Experience

Remote

Today

Easy Apply

Contract

40 - 60

Remote

Today

Easy Apply

Full-time, Part-time, Contract, Third Party

Remote or San Francisco, California

Today

Full-time

USD 114,297.00 - 235,319.00 per year

Search all similar jobs