Senior Observability Engineer (SRE/DevOps)

  • Posted 8 days ago | Updated 8 days ago

Overview

Remote
Hybrid
$60-70
Accepts corp to corp applications
Contract - W2
Contract - 28 day((s))

Skills

Python
Terraform
Kubernetes
Prometheus
Ansible
Grafana
Splunk.
cloud infrastructure
networking (TCP/IP)

Job Details

Job Summary (List Format): Senior Observability Engineer (SRE/DevOps)

- Design, implement, and enhance monitoring solutions using Prometheus for robust system reliability and alerting.
- Develop and maintain comprehensive observability strategies for cloud-native and distributed systems.
- Collaborate with DevOps, SRE, and application teams to integrate monitoring into CI/CD pipelines and the overall software development lifecycle.
- Respond to incidents, perform root cause analysis, and implement preventive measures to ensure system stability.
- Partner with cross-functional teams to continuously improve system performance, scalability, and reliability.
- Stay updated on industry best practices in observability, site reliability engineering, and cloud infrastructure.
- Work extensively with tools such as Prometheus, Grafana, Splunk, and Kubernetes (preferably Gardener).
- (Preferred) Utilize experience with Dynatrace, APM tools, and Linux administration for enhanced monitoring capabilities.
- Apply strong scripting and automation skills (Python, Terraform, Ansible, or similar technologies).
- Demonstrate excellent analytical, troubleshooting, and communication skills in a collaborative environment.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About NuLeap