Overview
On Site
Depends on Experience
Contract - W2
Contract - Independent
Skills
Reliability Engineering
Microsoft Azure
DevOps
Job Details
Title: Site Reliability Engineer (SRE) Modernization Site Engineer (SE)
Location: Hybrid On-site in Plano, TX | Portsmouth, NH | Seattle, WA | Indianapolis, IN
Responsibilities
- Lead the design and implementation of observability frameworks across applications and infrastructure.
- Define Service Level Indicators (SLIs) and Service Level Objectives (SLOs) in alignment with business and operational requirements.
- Develop and maintain dashboards, alerts, and visualization tools to support proactive monitoring and troubleshooting.
- Partner with development teams to embed reliability engineering best practices into the software delivery lifecycle.
- Drive root cause analysis (RCA) and post-incident reviews to enhance system stability.
- Implement automation to improve system health monitoring and reduce manual operational tasks.
- Collaborate with cross-functional teams to support modernization and cloud adoption initiatives.
Required Qualifications
- 7+ years of professional experience in Site Reliability Engineering, DevOps, or related fields.
- Strong knowledge of observability platforms (e.g., Prometheus, Grafana, Splunk, Datadog, New Relic, AppDynamics).
- Experience defining and implementing SLI/SLO frameworks in production environments.
- Proficiency in cloud environments (AWS, Azure, or Google Cloud Platform).
- Solid understanding of container orchestration (Kubernetes, OpenShift, Docker).
- Hands-on experience with infrastructure-as-code tools (Terraform, Ansible, etc.).
- Strong scripting skills in Python, Bash, or similar languages.
- Proven ability to work in a hybrid on-site environment and collaborate with distributed teams.
Preferred Skills
- Background in application modernization projects.
- Familiarity with CI/CD pipelines and release automation tools (Jenkins, GitLab CI, etc.).
- Experience in high-availability and large-scale distributed systems.
- Strong analytical skills and ability to troubleshoot complex system issues.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.