Site Reliability Engineer

Hybrid in Austin, TX, US • Posted 6 hours ago • Updated 6 hours ago
Contract W2
No Travel Required
On-site
Depends on Experience
Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

  • Site Reliability Engineer
  • DevOps
  • systems engineering
  • site reliability engineering
  • Linux
  • Unix
  • Python
  • Go
  • Java
  • Bash
  • AWS
  • GCP
  • Docker
  • Kubernetes
  • SLIs
  • SLOs
  • error budgets
  • root cause analysis (RCA)
  • Prometheus
  • Grafana
  • Application Insights
  • Datadog
  • Splunk

Summary

Site Reliability Engineer will be responsible for ensuring the reliability, availability, performance, and scalability of production systems by applying software engineering practices to infrastructure and operations. Partners with development teams to build resilient, observable, and automated platforms that meet defined service level objectives (SLOs).

CANDIDATE SKILLS AND QUALIFICATIONS
Minimum Requirements:
Candidates that do not meet or exceed the minimum stated requirements (skills/experience) will be displayed to customers but may not be chosen for this opportunity.
YearsRequired/PreferredExperience
8Requiredexperience in systems engineering, DevOps, or site reliability engineering roles
8RequiredStrong experience with Linux/Unix systems and system internals
8RequiredProficiency in one or more programming/scripting languages (Python, Go, Java, Bash)
8RequiredExperience designing and operating highly available, distributed systems
8RequiredStrong knowledge of cloud platforms (AWS, or Google Cloud Platform) and cloud-native services
8RequiredExperience with containerization and orchestration (Docker, Kubernetes)
8RequiredStrong understanding of monitoring, alerting, and logging concepts
8RequiredExperience defining and managing SLIs, SLOs, and error budgets
8RequiredFamiliarity with incident management, root cause analysis (RCA), and postmortems
8RequiredExperience integrating security and compliance into operational workflows
4PreferredFamiliarity with observability tools (Prometheus, Grafana, Application Insights, Datadog, Splunk)
4PreferredExperience operating 24x7 production environments with on-call rotations
4PreferredExperience with chaos engineering and resiliency testing
4PreferredExperience with feature flags, canary deployments, and progressive delivery
4PreferredStrong documentation skills for runbooks, dashboards, and operational standards
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: atitx
  • Position Id: 8929039
  • Posted 6 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Austin, Texas

Today

Easy Apply

Third Party, Contract

$93.76/-

Austin, Texas

Today

Full-time

Depends on Experience

Austin, Texas

Today

Full-time

USD 130,000.00 - 170,000.00 per year

Austin, Texas

4d ago

Easy Apply

Third Party, Contract

45 - 50

Search all similar jobs