Site Reliability Engineer (SRE) / DevOps Engineer

• Posted 15 hours ago • Updated 5 hours ago
Full Time
Part Time
Fitment

Dice Job Match Score™

🧠 Analyzing your skills...

Job Details

Skills

  • IaaS
  • FOCUS
  • Scalability
  • Continuous Integration
  • Continuous Delivery
  • Scripting
  • Programming Languages
  • Orchestration
  • Systems Architecture
  • Documentation
  • Dashboard
  • Systems Engineering
  • Reliability Engineering
  • Linux
  • Unix
  • Scripting Language
  • Python
  • Java
  • Bash
  • Amazon Web Services
  • Google Cloud Platform
  • Google Cloud
  • Cloud Computing
  • Docker
  • Kubernetes
  • Management
  • Budget
  • Incident Management
  • Root Cause Analysis
  • Regulatory Compliance
  • DevOps
  • Workflow

Summary

Location:- Austin Texas

Hybrid

Job Description:

We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) / DevOps Engineer to join our team. The ideal candidate will have a strong background in systems engineering, cloud infrastructure, and distributed systems, with a focus on reliability, scalability, and performance of production environments.



Key Responsibilities:

  • Design, build, and maintain highly available, scalable, and distributed systems.
  • Manage and optimize cloud-based infrastructure (AWS or Google Cloud Platform).
  • Implement and maintain CI/CD pipelines and DevOps best practices.
  • Monitor system performance, availability, and reliability using modern observability tools.
  • Define and manage SLIs, SLOs, and error budgets to ensure service reliability.
  • Handle incident management, perform root cause analysis (RCA), and drive postmortems.
  • Automate infrastructure and operational processes using scripting and programming languages.
  • Work with containerization and orchestration tools like Docker and Kubernetes.
  • Integrate security and compliance into system architecture and workflows.
  • Develop and maintain documentation including runbooks, dashboards, and operational standards.



Required Qualifications:

  • 8+ years of experience in Systems Engineering, DevOps, or Site Reliability Engineering (SRE) roles.
  • Strong expertise in Linux/Unix systems and system internals.
  • Proficiency in at least one programming/scripting language (Python, Go, Java, or Bash).
  • Experience designing and operating highly available distributed systems.
  • Hands-on experience with cloud platforms (AWS or Google Cloud Platform) and cloud-native services.
  • Experience with Docker and Kubernetes.
  • Strong understanding of monitoring, alerting, and logging frameworks.
  • Experience managing SLIs, SLOs, and error budgets.
  • Knowledge of incident management, RCA, and postmortem practices.
  • Experience incorporating security and compliance into DevOps workflows.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90970922
  • Position Id: SYS - 4545-3762-1775485776
  • Posted 15 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hybrid in Austin, Texas

Today

Easy Apply

Contract, Third Party

Depends on Experience

Hybrid in Austin, Texas

5d ago

Easy Apply

Contract, Third Party

Depends on Experience

Austin, Texas

Today

Full-time

Depends on Experience

Austin, Texas

Today

Easy Apply

Contract, Third Party

$65 - $70

Search all similar jobs