Senior Site Reliability Engineer ( Only Locals to Texas F2F interview at Austin, Texas)

• Posted 17 hours ago • Updated 4 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

  • Cost-benefit Analysis
  • Forms
  • Scheduling
  • Scalability
  • Software Engineering
  • Service Level
  • Systems Engineering
  • DevOps
  • Reliability Engineering
  • Linux
  • Unix
  • Scripting
  • Python
  • Java
  • Bash
  • Amazon Web Services
  • Google Cloud Platform
  • Google Cloud
  • Cloud Computing
  • Orchestration
  • Docker
  • Kubernetes
  • Management
  • Budget
  • Incident Management
  • Root Cause Analysis
  • Regulatory Compliance
  • Workflow
  • Grafana
  • Splunk
  • CHAOS
  • Testing
  • Documentation
  • Dashboard

Summary

Job Title: Site Reliability Engineer

Duration: 4 Months

Location: Austin, TX

In Person Interview at Austin, Texas

Job Summary:

8 or more years of experience, relies on experience and judgment to plan and accomplish goals, independently performs a variety of complicated tasks, a wide degree of creativity and latitude is expected.

Understands business objectives and problems, identifies alternative solutions, performs studies and cost/benefit analysis of alternatives. Analyzes user requirements, procedures, and problems to automate processing or to improve existing computer system: Confers with personnel of organizational units involved to analyze current operational procedures, identify problems, and learn specific input and output requirements, such as forms of data input, how data is to be; summarized, and formats for reports. Writes detailed description of user needs, program functions, and steps required to develop or modify computer program. Reviews computer system capabilities, specifications, and scheduling limitations to determine if requested program or program change is possible within existing system.

Site Reliability Engineer will be responsible for ensuring the reliability, availability, performance, and scalability of production systems by applying software engineering practices to infrastructure and operations. Partners with development teams to build resilient, observable, and automated platforms that meet defined service level objectives (SLOs).

Required Skills and Experience:
  • 8 years of experience in systems engineering, DevOps, or site reliability engineering roles
  • 8 years of strong experience with Linux/Unix systems and system internals
  • 8 years of proficiency in one or more programming/scripting languages (Python, Go, Java, Bash)
  • 8 years of experience designing and operating highly available, distributed systems
  • 8 years of strong knowledge of cloud platforms (AWS or Google Cloud Platform) and cloud-native services
  • 8 years of experience with containerization and orchestration (Docker, Kubernetes)
  • 8 years of strong understanding of monitoring, alerting, and logging concepts
  • 8 years of experience defining and managing SLIs, SLOs, and error budgets
  • 8 years of familiarity with incident management, root cause analysis (RCA), and postmortems
  • 8 years of experience integrating security and compliance into operational workflows

Preferred Experience:
  • 4 years of familiarity with observability tools (Prometheus, Grafana, Application Insights, Datadog, Splunk)
  • 4 years of experience operating 24x7 production environments with on-call rotations
  • 4 years of experience with chaos engineering and resiliency testing
  • 4 years of experience with feature flags, canary deployments, and progressive delivery
  • 4 years of strong documentation skills for runbooks, dashboards, and operational standards
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: RTL939169
  • Position Id: a381861114c6e2c7aedcba4dbe3124e8
  • Posted 17 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Austin, Texas

Today

Full-time

Depends on Experience

Hybrid in Austin, Texas

2d ago

Easy Apply

Full-time

80 - 90

Hybrid in Austin, Texas

4d ago

Easy Apply

Contract, Third Party

Depends on Experience

Austin, Texas

Today

Full-time

Compensation information provided in the description

Search all similar jobs