Senior Site Reliability Engineer

Overview

Remote
Depends on Experience
Full Time
No Travel Required

Skills

Apache Kafka
Ansible
Elasticsearch
Kibana
Linux
Terraform
Ubuntu
Scala
Ruby
Python

Job Details

Job Description:

  • Design, deploy, and scale our Prometheus architecture to handle 100+ million active series and beyond.
  • Deploy and operate large, high-performance Elasticsearch clusters holding 2000+TB of data.
  • Deploy and grow high-throughput data pipelines built on Kafka, handling hundreds of thousands of events per second.
  • Design and build an alerting system that allows engineering teams to construct alerts from multiple data sources and alerting workflows.
  • Write libraries and APIs that give engineers self-service access to our monitoring, logging, and other observability systems.
  • Use Terraform to deploy public and private cloud infrastructure.

Job Responsibilities:

  • Experience designing, deploying and operating mid to large-sized distributed systems on VMs or bare metal machines running Linux (we run Debian and Ubuntu).
  • Experience developing with languages like Ruby, Python, Go, Scala, or Bash.
  • Excited by the challenge of solving difficult problems in large distributed systems that deal with huge amounts of data.
  • Want to work on a highly autonomous team that cares deeply about quality and customer experience.
  • Understand the value of observability and can work with other teams to help them better monitor their services.
  • Are willing to be part of a production on-call rotation.
  • Have direct experience with the following technologies (or similar): Elasticsearch, Logstash, Kibana (ELK) stack, Kafka, PrometheThanos/Cortex, Graphite, Ansible, Terraform, Consul.
  • Have strong experience in building out solutions based on Software engineering best practices.

Education: Bachelor's or Master s degree in Computer Science, Computer or Electrical Engineering, Mathematics, or a related field.

GlobalLogic estimates the starting pay range for this role to be performed remotely and the salary range will be $130,000/yr to $135,000/yr and reflects base salary only. This pay range is provided as a good-faith estimate, and the amount offered may be higher or lower. GlobalLogic takes many factors into consideration in making an offer, including candidate qualifications, work experience, operational needs, travel and onsite requirements, internal peer equity, prevailing wage, responsibilities, and other market and business considerations.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About GlobalLogic Inc.