Apply Now

Cloud Site Reliability Engineer (SRE)

Berkeley Heights, NJ, US • Posted 30+ days ago • Updated 11 hours ago

Contract Independent

On-site

USD $70.00 - 80.00 per hour

Judge Group, Inc.

Fitment

Dice Job Match Score™

🔗 Matching skills to job...

Job Details

Skills

Scalability
Software Engineering
Operational Efficiency
Accountability
Teamwork
Reliability Engineering
High Availability
Load Balancing
Failover
IaaS
Real-time
Root Cause Analysis
Capacity Management
Regulatory Compliance
Encryption
Auditing
Vulnerability Management
Mentorship
Scripting
Python
Shell
Bash
Orchestration
Docker
Kubernetes
Terraform
Ansible
Splunk
Dynatrace
Data Migration
Operating Systems
Microsoft Windows
Linux
Unix
System Administration
Computer Networking
Incident Management
Problem Solving
Conflict Resolution
Management
Collaboration
Communication
Articulate
Amazon Web Services
Google Cloud
Google Cloud Platform
DevOps
CHAOS
Testing
Cloud Computing
Service Level
Budget
Microsoft Azure
Privacy
Marketing

Scalability
Software Engineering
Operational Efficiency
Accountability
Teamwork
Reliability Engineering
High Availability
Load Balancing
Failover
IaaS
Real-time
Root Cause Analysis
Capacity Management
Regulatory Compliance
Encryption
Auditing
Vulnerability Management
Mentorship
Scripting
Python
Shell
Bash
Orchestration
Docker
Kubernetes
Terraform
Ansible
Splunk
Dynatrace
Data Migration
Operating Systems
Microsoft Windows
Linux
Unix
System Administration
Computer Networking
Incident Management
Problem Solving
Conflict Resolution
Management
Collaboration
Communication
Articulate
Amazon Web Services
Google Cloud
Google Cloud Platform
DevOps
CHAOS
Testing
Cloud Computing
Service Level
Budget
Microsoft Azure
Privacy
Marketing

Summary

Location: Berkeley Heights, NJ Salary: $70.00 USD Hourly - $80.00 USD Hourly Description:
Job Title: Cloud Site Reliability Engineer (SRE)

Location: Berkeley Heights, NJ / Alpharetta, GA (Onsite 5 Days)

Duration: Contract To Hire

Job Description:

Position Overview: We are seeking a Cloud Site Reliability Engineer (SRE) to drive the reliability, scalability, and performance of our cloud-based infrastructure.

The ideal candidate combines software engineering expertise with advanced systems operations skills to maintain highly available systems while reducing operational toil. This role involves automation, monitoring, capacity planning, incident response, and cloud platform management across a dynamic, distributed environment.

As a Cloud SRE, you will work closely with Engineering, Architecture, DevOps, and security teams to ensure seamless service experiences for our customers while contributing to platform design and operational efficiency. Position Requirements: Our Engineers play a critical role in the success of our clients and are expected to effectively communicate our recommended solutions in a consultative role for each client. Therefore, a successful candidate will possess a high degree of self-management, personal accountability, strong communication skills, and teamwork. The ability to interact, engineer, and communicate collaboratively at the highest technical levels with customers, vendors, partners, and all members of staff is required.

Key Responsibilities

System Reliability & Availability: Design and maintain fault-tolerant, high-availability architectures across AWS, Azure, and Google Cloud Platform. Implement redundancy, load balancing, and automated failover strategies.

Cloud Infrastructure Management: Deploy, manage, and optimize cloud resources using IaC tools such as Terraform, Ansible.

Monitoring & Observability: Implement monitoring, alerting, and logging frameworks using Splunk, Azure monitor, Dynatrace, AWS cloud watch or similar to detect and resolve issues proactively.

Incident Management: Lead real-time incident response, root-cause analysis, and postmortems to continuously improve uptime and resilience.

Capacity Planning & Scaling: Predict traffic patterns, optimize resource utilization, and enforce autoscaling and performance best practices.

Automation & Tooling: Develop scripts and internal tooling for automating routine tasks to reduce manual intervention. Languages may include Python, Power Shell, or Bash.

Security & Compliance: Collaborate with security teams to implement secure infrastructure practices including encryption, role-based access, auditing, and vulnerability management.

Collaboration & Mentorship: Work across engineering and DevOps teams, providing guidance on reliability best practices and mentoring junior SREs.

Required Skills & Qualifications

Programming & Scripting: Proficiency in Python, Power Shell, Bash, or equivalent for automation and system management.

Cloud Platforms: Hands-on experience with AWS, Azure, or Google Cloud Platform; strong understanding of VPCs, IAM, serverless architectures, and managed Kubernetes services.

Containers & Orchestration: Experience with Docker and Kubernetes.

Infrastructure as Code (IaC): Proficient in Terraform, Ansible.

Monitoring & Observability: Expertise with Splunk, Azure Monitor, Dynatrace, AWS Cloud Watch or similar tools.

Expert Knowledge and practical experience using Cloud data migration tools

Operating Systems: Advanced knowledge of Windows, Linux/Unix environments, with experience in system administration and networking fundamentals.

Incident Response: Strong problem-solving skills under pressure, with experience managing outages and mitigating risk.

Collaboration & Communication: Ability to articulate technical insights, coordinate across teams, and contribute to a blameless culture to resolve issues and drive consistent results. Preferred Qualifications

Industry certifications such as AWS Certified Solutions Architect, Google Cloud Professional DevOps Engineer, Azure Dev Ops Engineer.

Exposure to chaos engineering or resilience testing frameworks.

Prior experience in Multicloud deployments or hybrid cloud environments.

Familiarity with service-level objectives (SLOs), indicators (SLIs), and error budgets for service reliability.
Gather feedback from the department on areas of improvement and provide solutions utilizing Azure
By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.
Contact:
This job and many more are available through The Judge Group. Please apply with us today!

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: cxjudgpa
Position Id: 1134534
Posted 30+ days ago

Company Info

About Judge Group, Inc.

The Judge Group, is a leading professional services firm specializing in talent, technology, and learning solutions. We consult, staff, train, and solve. Through our work we make people and organizations better.

Our services are successfully delivered through a network of more than 30 offices across the United States, Canada, and India. The Judge Group is proud to partner with the best and brightest companies in business today, including over 60 of the Fortune 100. We serve organizations in financial services, healthcare, life sciences, insurance, government (including aerospace and defense), manufacturing, and technology and telecommunications.

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Lead Site Reliability Engineer

Philadelphia, Pennsylvania

•

Today

Location: Philadelphia, PA Salary: $150,000.00 USD Annually - $180,000.00 USD Annually Description: We are seeking a Lead Site Reliability Engineer (SRE) who combines deep technical expertise with strong leadership and client-facing capabilities. This is a high-impact role responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure and kiosk platform. You will lead a team of engineers while remaining hands-on, owning uptime, SLAs, and incident management,

Contract

USD 150,000.00 - 180,000.00 per year

SITE RELIABILITY ENGINEER

Berkeley Heights, New Jersey

•

Today

Location: Berkeley Heights, NJ Salary: $75.00 USD Hourly - $78.00 USD Hourly Description: ONLY W2 Job Title: Site Reliability Engineer Location: Berkeley Heights, NJ (Onsite) Duration: 4+ Months (Contract to hire) Overview A successful Site Reliability Engineer combines software engineering expertise with operational excellence to build and maintain highly reliable, scalable, and performant systems. This role focuses on improving service availability through automation, monitoring, and continu

Contract

USD 75.00 - 78.00 per hour

Site Reliability Engineer

Buffalo, New York

•

Today

Location: Buffalo, NY Salary: $65.00 USD Hourly - $70.00 USD Hourly Description: Title : Site Reliability Engineer Duration of project : Contract 12+ Months Open Location : Buffalo, NY Remote EST Overview We are building a high-impact Site Reliability Engineering team to support 12+ mission-critical enterprise applications across a mix of legacy and modern environments. This role is part of a strategic initiative focused on: Application instrumentation Observability adoption (OpenTelemetry,

Contract

USD 65.00 - 70.00 per hour

Senior AWS/EKS Engineer

Berkeley Heights, New Jersey

•

Today

Location: Berkeley Heights, NJ Salary: $140,000.00 USD Annually - $160,000.00 USD Annually Description: Senior AWS/EKS Engineer Location: Berkeley Heights, NJ (Onsite, 5 days a week) Employment Type: Full-Time About the Role We're looking for a Senior AWS/EKS Engineer to help power our cloud infrastructure and digital/mobile applications. In this role, you'll collaborate with engineering, product, and support teams to automate deployments, keep production environments resilient, and tackle

Contract

USD 140,000.00 - 160,000.00 per year

Search all similar jobs

More jobs at Judge Group, Inc. in Berkeley Heights, NJ