SRE | Remote | Contract | Only on W2

Remote • Posted 3 hours ago • Updated 3 hours ago

Contract W2

Remote

Depends on Experience

Fitment

Dice Job Match Score™

⏳ Almost there, hang tight...

Job Details

Skills

.NET
API
Analytical Skill
Analytics
Apache Kafka
Bridging
Budget
Cloud Computing
Collaboration
Communication
Computer Networking
Conflict Resolution
Continuous Delivery
Continuous Improvement
Continuous Integration
Cyber Security
Dashboard
Database
DevOps
Dynatrace
Enterprise Networks
Enterprise Services
FOCUS
Firewall
GitHub
Incident Management
Java
KPI
Kubernetes
Load Testing
Management
Mentorship
Microservices
Microsoft Azure
Microsoft SQL Server
Microsoft SQL Server DBA
Performance Analysis
Performance Tuning
Problem Solving
Productivity
React.js
Regulatory Compliance
Reliability Engineering
Root Cause Analysis
Scalability
Software Performance Management
Software Security
Splunk
Terraform
Testing
Virtual Machines
Workflow
Writing

Summary

Hi ,
Greeting from Healthcare Triangle !! We do have opening for our client, Role : SRE
Location : Remote
Duration : Long-term Contract
Job Description : Role Overview

We are seeking a highly skilled Site Reliability Engineer (SRE) to own the overall health, availability, performance, and resilience of our enterprise platform. The platform spans SQL Server, .NET, Java, React.js, Microservices, Kafka, and operates in a hybrid cloud environment on Azure and OnPremises.
The SRE will lead reliability engineering practices across the stack, manage infrastructure deployment pipelines using Terraform, drive application deployments through GitHub and Azure DevOps, ensure timely remediation of security vulnerabilities, and implement worldclass observability using Dynatrace and Splunk.

Key Responsibilities

Platform Reliability & Operations

Own the endtoend health, uptime, performance, and reliability of the platform across cloud (Azure) and onprem environments.

Ensure resilience across application layers: .NET, Java, React.js, Microservices, and backend systems such as SQL Server and Kafka.
Lead incident management, root cause analysis, and postincident reviews with a focus on continuous improvement.

Infrastructure Engineering & Automation

Design, implement, and maintain cloud and onprem infrastructure using Terraform (IaC).

Own and optimize CI/CD pipelines for infrastructure and applications in:

o GitHub Actions

o Azure DevOps

Improve deployment automation, reliability, and release processes across all teams.

Observability, Monitoring & Proactive Operations

Implement and enhance monitoring, alerting, dashboards, and analytics using:

o Dynatrace (APM, RUM, synthetic monitoring, logs, metrics)

o Splunk (log search, correlation, alerting)

Build proactive monitoring workflows to detect issues before they impact customers.
Own SRE metrics such as SLOs, SLIs, Error Budgets, MTTR, MTBF, availability KPIs, and system productivity metrics.
Performance tuning of the database / application services.

Security & Compliance

Ensure all platform and application security vulnerabilities are identified and remediated on time.

Partner with cybersecurity to ensure compliance with enterprise standards and policies.
Automate security scans and integrate them into CI/CD pipelines.

Performance & Scalability

Conduct performance analysis, load testing, and tuning across:

o Microservices

SQL Server databases
Kafka clusters
Frontend React.js applications

Partner with engineering teams to design scalable, reliable system architectures.

Collaboration & Leadership

Collaborate with development, architecture, infrastructure, and security teams.

Advocate for SRE and DevOps culture automation, reliability engineering, blameless postmortems.
Mentor developers and engineers on reliability best practices and tools.

Required Qualifications

5+ years of experience in SRE, DevOps, or Platform Engineering roles.

Strong expertise in:

o SQL Server administration and performance tuning

o .NET, Java, Microservices architectures

o React.js fundamentals

Handson experience with:

o Azure Cloud services (VMs, AKS, App Services, Networking)

o Onprem servers and hybrid integrations

o Terraform (writing, testing, maintaining modules)

CI/CD with GitHub and Azure DevOps

Proficiency with observability tools:

o Dynatrace (preferred)

o Splunk

Experience with Kafka (producers, consumers, performance, tuning).
Strong understanding of SRE fundamentals:

o SLO/SLI design

Error budgets
Distributed systems concepts
Incident response

Preferred Qualifications

Experience with containerization and Kubernetes (AKS or onprem K8s).

Experience with service mesh, API gateway technologies, or eventdriven architectures.
Knowledge of secure coding practices and integrating security in CI/CD.
Familiarity with enterprise networking, firewalls, and hybrid connectivity.

Soft Skills

Strong communication and collaboration abilities.

Analytical mindset with strong problemsolving skills.
Ability to handle pressure in highseverity incidents.
Passion for automation, simplification, and continuous improvement.

Job Impact

In this role, you will directly influence the reliability and stability of core enterprise services used by millions of customers and internal users. You will serve as a technical leader who bridges development, infrastructure, operations, and security to deliver a worldclass, resilient platform.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91172365
Position Id: 8860117
Posted 3 hours ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.