Site Reliability Engineer (LOCALS Preferred)

Overview

On Site
Depends on Experience
Contract - W2
Contract - Independent
Contract - 12 Month(s)

Skills

Amazon Web Services
Kubernetes
Python
C++
Site Reliability

Job Details

About the role

We re making big foundational cloud infrastructure changes to make the experience faster, more reliable, and more scalable for our customer s workloads. This role will be responsible for helping to build, maintain, and operate our new dynamic cloud infrastructure that powers all services.

About the day to day
  • Design and implement systematic improvements to cloud infrastructure and Engine provisioning services to make it fast, reliable, scalable and cost efficient.
  • Collaborate with development teams across the company to improve services reliability, scalability and developer productivity.
  • Together with an engineering team, you will share an on-call rotation and be an escalation contact for service and cloud infrastructure incidents
REQUIREMENTS
  • BS degree in Computer Science, Engineering, or a related field or equivalent experience
  • 3+ years hands-on experience as a Site Reliability Engineer
  • 3+ years of production experience with Kubernetes including using open source solutions from the eco-system
  • 3+ years of proven experience as a professional developer of production software
  • Development experience in an object oriented programming language. We develop in Go, C++, and some Python here and there. Experience with these languages is a plus. You are willing to understand and make cross-cutting changes in the Firebolt codebase regardless of the language.
  • Hands on experience in building and operating cloud native applications on AWS, Google Cloud Platform or Azure.
  • Strong Linux fundamentals and an understanding of networking, including a variety of network protocols
  • Experience building and operating highly concurrent, highly available, and fault-tolerant distributed systems
A bonus if you have
  • Understanding of application security in a cloud environment
  • Experience working with service mesh and multi-cluster mesh infrastructure (Cilium)
  • Experience in monitoring a variety of different application types with a modern prometheus compatible observability stack
  • Experience working with CI/CD pipelines like GitHub actions
  • Experience working with ArgoCD, Terraform, FoundationDB, Kafka and Kubernetes operators is a plus
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Cloud Destinations LLC