Site Reliability Engineer (LOCALS Preferred)

Overview

On Site

Depends on Experience

Contract - W2

Contract - Independent

Contract - 12 Month(s)

Skills

Amazon Web Services

Kubernetes

Python

C++

Site Reliability

Job Details

About the role

We re making big foundational cloud infrastructure changes to make the experience faster, more reliable, and more scalable for our customer s workloads. This role will be responsible for helping to build, maintain, and operate our new dynamic cloud infrastructure that powers all services.

About the day to day

Design and implement systematic improvements to cloud infrastructure and Engine provisioning services to make it fast, reliable, scalable and cost efficient.
Collaborate with development teams across the company to improve services reliability, scalability and developer productivity.
Together with an engineering team, you will share an on-call rotation and be an escalation contact for service and cloud infrastructure incidents

REQUIREMENTS

BS degree in Computer Science, Engineering, or a related field or equivalent experience
3+ years hands-on experience as a Site Reliability Engineer
3+ years of production experience with Kubernetes including using open source solutions from the eco-system
3+ years of proven experience as a professional developer of production software
Development experience in an object oriented programming language. We develop in Go, C++, and some Python here and there. Experience with these languages is a plus. You are willing to understand and make cross-cutting changes in the Firebolt codebase regardless of the language.
Hands on experience in building and operating cloud native applications on AWS, Google Cloud Platform or Azure.
Strong Linux fundamentals and an understanding of networking, including a variety of network protocols
Experience building and operating highly concurrent, highly available, and fault-tolerant distributed systems

A bonus if you have

Understanding of application security in a cloud environment
Experience working with service mesh and multi-cluster mesh infrastructure (Cilium)
Experience in monitoring a variety of different application types with a modern prometheus compatible observability stack
Experience working with CI/CD pipelines like GitHub actions
Experience working with ArgoCD, Terraform, FoundationDB, Kafka and Kubernetes operators is a plus

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

About Cloud Destinations LLC

Share