Apply Now

Lead Site Reliability Engineer

Philadelphia, PA, US • Posted 30+ days ago • Updated 7 hours ago

Contract Independent

On-site

USD $150,000.00 - 180,000.00 per year

Judge Group, Inc.

Fitment

Dice Job Match Score™

🎯 Assessing qualifications...

Job Details

Skills

Real-time
Reliability Engineering
Incident Management
Root Cause Analysis
Disaster Recovery
Business Continuity Planning
Scalability
Grafana
New Relic
Leadership
Collaboration
Quality Assurance
DevOps
IaaS
Amazon Web Services
Microsoft Azure
Google Cloud Platform
Google Cloud
Kubernetes
Continuous Integration
Continuous Delivery
Release Engineering
Terraform
Scripting
Python
Bash
Workflow
Management
Customer Facing
SaaS
High Availability
Mentorship
Operational Excellence
Privacy
Marketing

Summary

Location: Philadelphia, PA Salary: $150,000.00 USD Annually - $180,000.00 USD Annually Description:
We are seeking a Lead Site Reliability Engineer (SRE) who combines deep technical expertise with strong leadership and client-facing capabilities. This is a high-impact role responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure and kiosk platform.

You will lead a team of engineers while remaining hands-on, owning uptime, SLAs, and incident management, and driving long-term improvements in system resilience and operational maturity. This role also requires working closely with Fortune 500 clients, translating complex technical concepts into clear, business-friendly insights.

What Makes This Role Unique

This is a rare opportunity for a hybrid leader who can:

Operate as a hands-on SRE expert
Lead and mentor a team of engineers
Act as a client-facing technical advisor
Drive both real-time operations and long-term reliability strategy

Key Responsibilities:

Reliability & Operations

Own platform uptime, SLAs, and overall system reliability
Lead incident response, root cause analysis, and postmortems
Develop and maintain disaster recovery and business continuity plans

Infrastructure & Automation

Design, build, and optimize cloud infrastructure and Kubernetes environments
Automate deployments and operational tasks using CI/CD and Infrastructure-as-Code (Terraform preferred)
Improve system scalability, performance, and resilience

Observability & Monitoring

Implement and enhance monitoring, alerting, and observability tools (e.g., Prometheus, Grafana, New Relic)
Establish operational standards, runbooks, and best practices

Leadership & Collaboration

Lead, mentor, and develop a team of ~6 engineers
Partner with platform engineering, QA, and development teams to ensure operational readiness
Serve as a technical point of contact for clients, clearly communicating system health, risks, and solutions

Required Qualifications:

8+ years of experience in SRE, DevOps, or Platform Engineering
2+ years in a lead or managerial role
Strong expertise in:

Cloud infrastructure (AWS, Azure, or Google Cloud Platform)
Kubernetes and containerized environments
CI/CD pipelines and release engineering
Infrastructure-as-Code (Terraform preferred)

Proficiency in scripting/automation (Python, Bash, or Go)
Deep understanding of observability, monitoring, and logging systems
Experience with GitOps workflows (e.g., ArgoCD)
Proven experience managing production systems with strict uptime requirements

Preferred Experience :

Client-facing experience in enterprise or SaaS environments (required)
Experience communicating with non-technical stakeholders and Fortune 500 clients
Background in high-availability systems and large-scale distributed environments

What We're Looking For :

A hands-on technical leader who can balance execution and strategy
Strong communicator with executive presence
Someone who thrives in high-ownership, fast-paced environments
A mentor who can elevate team performance and operational excellence

By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.
Contact:
This job and many more are available through The Judge Group. Please apply with us today!

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: cxjudgpa
Position Id: 1131735
Posted 30+ days ago

Company Info

About Judge Group, Inc.

The Judge Group, is a leading professional services firm specializing in talent, technology, and learning solutions. We consult, staff, train, and solve. Through our work we make people and organizations better.

Our services are successfully delivered through a network of more than 30 offices across the United States, Canada, and India. The Judge Group is proud to partner with the best and brightest companies in business today, including over 60 of the Fortune 100. We serve organizations in financial services, healthcare, life sciences, insurance, government (including aerospace and defense), manufacturing, and technology and telecommunications.

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Cloud Site Reliability Engineer (SRE)

Berkeley Heights, New Jersey

•

Today

Location: Berkeley Heights, NJ Salary: $70.00 USD Hourly - $80.00 USD Hourly Description: Job Title: Cloud Site Reliability Engineer (SRE) Location: Berkeley Heights, NJ / Alpharetta, GA /Frisco, TX (Onsite 5 Days) Duration: Contract To Hire Job Description: Position Overview: We are seeking a Cloud Site Reliability Engineer (SRE) to drive the reliability, scalability, and performance of our cloud-based infrastructure. The ideal candidate combines software engineering expertise with advanced s

Contract

USD 70.00 - 80.00 per hour

Cloud Site Reliability Engineer (SRE)

Berkeley Heights, New Jersey

•

Today

Location: Berkeley Heights, NJ Salary: $70.00 USD Hourly - $80.00 USD Hourly Description: Job Title: Cloud Site Reliability Engineer (SRE) Location: Berkeley Heights, NJ / Alpharetta, GA (Onsite 5 Days) Duration: Contract To Hire Job Description: Position Overview: We are seeking a Cloud Site Reliability Engineer (SRE) to drive the reliability, scalability, and performance of our cloud-based infrastructure. The ideal candidate combines software engineering expertise with advanced systems opera

Contract

USD 70.00 - 80.00 per hour

Security Analyst

Malvern, Pennsylvania

•

Today

Location: Malvern, PA Description: Our client is currently seeking a Security Analyst Role Responsibilities Enterprise Incident Leadership Leads response for complex, high-impact cybersecurity incidents across global enterprise environments, including major outages, cloud security events, AI-enabled threats, and automation-driven detections. Owns incident command, drives root-cause determination, orchestrates corrective actions, and ensures response activities align to enterprise risk posture,

Contract

Senior Engineer - Windows Platform

Burlington, New Jersey

•

Today

Location: Burlington, NJ Salary: $120,000.00 USD Annually - $145,000.00 USD Annually Description: Position: Senior Engineer - Windows Platform Location: Edgewater Park, NJ (Hybrid) Description Summary: The Senior Engineer, Windows Platform is responsible for the engineering, automation, reliability, security, and lifecycle management of Client enterprise Microsoft infrastructure platforms across corporate, distribution center, store, and cloud environments. This role shares ownership over the

Contract

USD 120,000.00 - 145,000.00 per year

Search all similar jobs

More jobs at Judge Group, Inc. in Philadelphia, PA