Overview
Skills
Job Details
Role : Site Reliability Engineer
Location : Onsite (5 days/week) Raleigh, NC
Job Description:Work Authorization: or
Experience : 10+ years required
Required Skills:
-
Site Reliability Engineering (SRE) experience
-
Automation & Scripting: Python, Go, Bash
-
Cloud: Azure
-
Infrastructure-as-Code: Terraform, Ansible
-
Linux (RHEL7+) / Windows Server (2019+)
-
Networking & Storage: NFS, SAN, NAS
-
Authentication: DNS, LDAP, Kerberos, Centrify
Requirements & Responsibilities:
-
Proven expertise in Site Reliability Engineering, with a background in software engineering, infrastructure, or operations.
-
Hands-on experience with cloud platforms (e.g., Azure), operating systems (Linux RHEL7+, Windows 2019+), and networking fundamentals.
-
Strong understanding of networking and storage technologies (NFS, SAN, NAS).
-
Working knowledge of authentication and naming services (DNS, LDAP, Kerberos, Centrify).
-
Proficiency in scripting and automation (Python, Go, Bash).
-
Practical experience with infrastructure-as-code tools (Terraform, Ansible).
-
Ability to define and manage SLIs, SLOs, SLAs and reduce operational toil.
-
Experience integrating with observability platforms for system visibility.
-
Metrics-driven and automation-focused mindset toward improving system reliability.
-
Calm under pressure during incidents/outages with strong incident-response skills.
-
Excellent collaboration and communication skills across engineering and business teams.
-
Proactive and ownership-driven approach to improving systems and processes.