SRE/ Site Reliablity Engineer - Observability (Need 15+ Years)

Overview

Remote
Depends on Experience
Contract - W2
Contract - 18 Month(s)

Skills

SRE
Argo CD
Kubernetes
Prometheus
Grafana
Python
AWS
Site Reliability

Job Details

Job Title: SRE Engineer - Observability

Location: San Jose, CA (Remote)

Mode: Contract

Job Description:

  • Knowledge of observability - monitoring and alerting
  • Experience with Kubermetheus Stack (Kubernetes, Prometheus, Loki, Grafana, Alert Manager)
  • Specifically, must be able to plan, build, test, and launch an observability platform from end-to-end
  • Experience with AWS
  • Proficiency in Python & GO
  • Recent portfolio projects will be required for review and interviews will include coding challenges
  • PagerDuty experience
  • Zoom Developer Platform experience a plus
  • Experience with CI/CD pipelines
  • Infrastructure as Code experience
  • Strong troubleshooting and problem-solving skills"
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.