Site Reliability Engineer - SRE - Primesoft Consulting Services Inc

Overview

On Site

Depends on Experience

Full Time

No Travel Required

Skills

Amazon Web Services

Apache Velocity

Cost Management

DevSecOps

Good Clinical Practice

Google Cloud Platform

Grafana

Kubernetes

Machine Learning (ML)

Microsoft Azure

Storage

Job Details

Looking for Site Reliability Engineer - SRE with 8+ Years of experience.

No C2C only on our W2...

Job Duties:

Implementing advanced monitoring (Prometheus, Grafana, Datadog, ELK), tracing, logging and automated alerting solutions.Scaling distributed systems, optimising compute/storage efficiency, and cost management.
Designing, implementing, deploying and running highly available, fault-tolerant, auto-scaling and auto-healing systems
Deep expertise in AWS, Azure, and Google Cloud Platform, including Kubernetes (EKS, ECS, Fargate, GKE) and server less architectures
Driving reliability best practices across engineering teams, embedding SRE principles into the DevSecOps lifecycle. Partnering with engineering, security, and product teams to balance reliability and feature velocity.
Expertise in CIAM, ForgeRock stack.
Optimising deployment pipelines for high-frequency, zero-downtime releases.
Leveraging machine learning for anomaly detection, predictive scaling, and automated remediation.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Site Reliability Engineer - SRE

Job Details

Share