Senior System Administrator / Site Reliability Engineer (Unix / Linux)

Overview

On Site
$50 - $60
Full Time

Skills

Linux
Unix
AWS
Azure
Ansible
Terraform
Kubernetes
CI/CD
Grafana
Prometheus
DevOps

Job Details

Job Title: Senior System Administrator / Site Reliability Engineer (Unix / Linux)
Location: Austin, TX (Onsite)
Duration: Long Term

Mandatory Skills: Linux/Unix Administration, AWS, Azure, Ansible, Terraform, Kubernetes, CI/CD (GitHub Actions/Jenkins), PrometheGrafana, SRE/DevOps Practices
Overall Experience: 10+ Years

Job Description:
The Senior System Administrator / Site Reliability Engineer (SRE) is responsible for ensuring high availability, reliability, and performance of mission-critical systems within the VA s Enterprise Cloud. This role combines system administration, automation, and DevOps practices to support hybrid cloud environments (AWS, Azure, OCI).

Responsibilities:
Monitor, maintain, and optimize system performance, availability, and scalability.
Automate deployments, configuration, and monitoring using Ansible, Terraform, or Puppet.
Manage Infrastructure as Code (IaC) for consistent multi-environment setups.
Lead incident response, perform root cause analysis, and implement postmortem actions.
Build and maintain observability frameworks using Prometheus, Grafana, Datadog, or Splunk.
Support CI/CD pipelines and improve delivery automation for reliability.
Design and operate highly available, fault-tolerant, and self-healing systems.
Enforce security and compliance standards (VA 6500, NIST 800-53, FedRAMP, Zero Trust).
Collaborate with cross-functional teams to integrate SRE best practices into SDLC.
Participate in on-call rotations and drive proactive performance optimization.

Qualifications:
10+ years in SRE, DevOps, or Systems Engineering roles.
Strong Linux/Unix system administration expertise.
Proficient in AWS and Azure (production workload deployment).
Hands-on with containers/orchestration (Docker, Kubernetes, EKS/AKS).
Experience with automation, IaC, and observability tools.
Excellent troubleshooting and communication skills.

Mandatory Certifications:
AWS Certified SysOps Administrator or DevOps Engineer.
Azure Administrator or DevOps Engineer Expert.
Certified Kubernetes Administrator (CKA).

Education:
Bachelor s degree in Computer Science, Electronics Engineering, or related field (or 13+ years equivalent experience).

Thanks & Regards,
Shanmukha

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Taproot Solutions