Sr. Site Reliability Engineer - U.S. Citizen - This role sits within Optum Serves Technology Product organization

Overview

Remote
$140,000 - $180,000
Full Time

Skills

Site Reliability
SRE
Cloud Platform
Cloud Infrastructure
Observability
Azure
Kubernetes
Terraform
Pulumi
Helm
ArgoCD
Flux
IaC
CI/CD
Splunk
Dynatrace
Grafana
Prometheus

Job Details

Job Title: Sr. Site Reliability Engineer
Location: Headquarters / Telecommute
Classification (HR only): Exempt Non-Exempt
Reports To (Title): COO Widescope Consulting and Contracting

JOB SUMMARY

The statements below are not intended to be all-inclusive of the duties and responsibilities of the position. Based on leadership decisions and business needs, all other duties as assigned will be expected for each position.Grafana

Widescope Consulting and Contracting is proud to serve our nation's military and Veterans. We support federal agencies in advancing the United States health care system and improving the overall health and well-being of those who serve or have served our country. Our health services are designed to help people live healthier lives.

The Cloud Platform Engineer will architect, develop, and maintain Widescope’s cloud environments across both commercial and government cloud platforms. This role collaborates closely with software engineers, solution architects, and DevOps teams to design and sustain a secure, resilient, and high-performance cloud infrastructure that meets the needs of our public and private sector clients.

Role Capabilities:

  • Build, maintain, and operate IaaS and PaaS infrastructure in Azure commercial and government clouds 
  • Work closely with dev teams to identify and measure SLOs, SLAs and SLIs 
  • Act a strong contributor to development of platform services including architecture, provisioning, configuration, deployment, and support  
  • Perform integrations with central logging, metrics dashboards, instrumentation, incident monitoring and management  
  • Build/integrate/administer systems and tools that enable engineering teams to observe their applications in production with autonomy (Dashboards, APMs). 
  • Support software and/or cloud-infrastructure in an on-call rotation basis 
  • Assist with identification and remediation of technical problems at the root cause by continuously implementing automation, self-healing, and real-time monitoring to production systems 
  • Maintain and improve operational tooling, frameworks,  
  • Build frameworks that test the performance and resiliency of our platform services/tools 
  • Automate alerts for metrics on performance, cost, vulnerabilities, risk, compliance violations 
  • Improve processes and champion automation of any manual items around support.  

JOB QUALIFICATIONS

Required:

 

  • 4 + years of experience working within a cloud engineer/SRE role 

 

  • Expert knowledge of a cloud service provider 
  • Expert knowledge and hands on production experience in Kubernetes (bare metal or managed) cluster setup and management required. 
  • Experience with infrastructure as code (IaC) tools like Terraform, Pulumi. 
  • Experience with Kubernetes deployment tools like Helm, ArgoCD, Flux 
  • Strong awareness of networking and internet protocols. 
  • Understanding of identity and access management (IAM) 
  • Experience supporting infrastructure in production cloud environments.  
  • Knowledge of Encryption, Public Key Infrastructure (PKI), understanding of OWASP 
  • Experience working with RESTful services 
  • Some experience with monitoring tools (Azure Monitor, Splunk, Dynatrace, Graphana, Prometheus). 
  • Familiarity with IDEs and Source Control tools like Visual Studio Code and Git. 
  • Must be a U.S. Citizen.

Preferred:

  • Bachelor’s Degree in Computer Science, Information Technology, Software Engineering, Math, Physics 
  • Master’s Degree with coursework focused on advanced algorithms, mathematics in computing, data structures or related field 
  • Expert knowledge of Azure  
  • Demonstrate passion about infrastructure automation 
  • Ability to prioritize work in a fast-paced environment

 

 

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Widescope Consulting and Contracting Services