Site Reliability Engineer

  • Phoenix, AZ
  • Posted 21 days ago | Updated 4 days ago

Overview

Hybrid
Depends on Experience
Contract - Independent
Contract - W2
Contract - 12 Month(s)

Skills

SRE
Site Reliability
grafana

Job Details

Job Title: Site Reliability Engineer

Location: Phoenix, AZ

Duration: 6 Months+

Responsibilities:

  • Expert in Observability & SRE principles, SLI, SLO and SLA definition and management
  • Experienced in Grafana stack and other Application performance management tools and frameworks like Elastic stack, AppD, etc.
  • Expert in SRE observability implementation in instrumentation of metrics, logs and traces.
  • Expertise in Docker & Kubernetes is required.
  • Excellent understanding of micro-services architecture, design patterns, and standard methodologies with an eye towards scale, automation, resiliency, and high availability
  • Prior experience dealing with high volume distributed technical architectures with a high cost of failure, i.e. focus on reliability and availability
  • Experienced with telemetry tooling and observability systems such as: Jaegar, Prometheus, OpenTracing, OpenTelemetry, App Dynamics, Splunk, DataDog, NewRelic, Lightstep, Grafana.
  • Experienced with some amount of Big Data technologies such as: ElasticSearch, NoSql Stores, Kafka, Columnar Databases, DataFlow or Pipeline Systems, Graph DataStores.
  • Expert Performance Analyzer, Resilience, Chaos engineering, FMEA, Scalability, High Availability, JProfiler, Thread dump Analyser, etc.
  • Experience with leveraging common infrastructure services like Enterprise Message Bus, Configuration Services, Toggles, Logging Systems, Telemetry for Observability (e.g. OpenTelemetry).
  • Strong experience with ServiceNow ITOM, ITSM Modules that focuses Discovery, Event Management, Incident, Problem and Change Management
  • Strong sense of architecture and design for fault tolerance, scale-out approaches and stability practiced Azure Well Architected framework or other cloud such as Google Cloud Platform
  • Experience in emerging technologies like Machine Learning and AI Ops is a plus.

Warm Regards,

Dev Raj | Delivery Head - Talent Acquisition

Work:

email:

Tecktiva, LLC