Disaster Recovery & Resiliency Manager (Infrastructure & Cloud)

• Posted 1 day ago • Updated 1 day ago

Contract Independent

Contract Corp To Corp

Contract W2

Travel Required

Fitment

Dice Job Match Score™

🫥 Flibbertigibetting...

Job Details

Skills

infrastructure
backup
DR

Summary

Disaster Recovery & Resiliency Manager (Infrastructure & Cloud)

Fort Mill, SC (Onsite)

1 year Contract

Position Summary
The Recovery Manager is responsible for ensuring the availability, resilience, and rapid recovery of critical infrastructure and production systems. This role bridging infrastructure engineering and production support to drive "always-on" capabilities. The manager will define, test, and maintain disaster recovery (DR) plans, implement observability to proactively detect potential outages, lead major incident resolution, and conduct root cause analysis (RCA) to continuously improve service reliability.

Key Responsibilities
1. Resiliency Planning & Disaster Recovery (DR)
Develop, maintain, and test comprehensive DR plans, runbooks, and Business Impact Analyses (BIA) for hybrid/cloud infrastructure.
Define RTO (Recovery Time Objective) and RPO (Recovery Point Objective) targets, ensuring infrastructure design meets these requirements.
Lead regular disaster recovery tests, simulation exercises, and tabletop drills, documenting outcomes and tracking remediation actions to closure.
Apply infrastructure-as-code (IaC) principles to automate recovery processes.
2. Production Support & Incident Management
Serve as a primary point of contact (POC) for major infrastructure incidents and high-profile disruptions.
Coordinate technical recovery efforts across cross-functional teams (network, server, storage, database, cloud) during incidents.
Lead Root Cause Analysis (RCA) and post-mortem investigations to identify and deploy countermeasures, ensuring incidents do not recur.
Monitor production system performance and availability, optimizing for high availability (HA).
3. Observability & Monitoring
Develop and promote a company-wide observability platform (e.g., Splunk, Datadog, Prometheus, Grafana) for real-time monitoring of infrastructure health.
Establish and track Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
Implement proactive monitoring, alerting, and automated healing, ensuring fast incident detection and recovery.
4. Leadership & Governance
Provide executive-level reporting on resilience posture, test results, and material risks.
Manage relationships with third-party vendors, partners, and service providers regarding service SLAs.
Ensure adherence to industry frameworks and compliance requirements (e.g., NIST, ISO 22301, ITIL).
Required Skills & Qualifications
Experience: 5 10+ years in IT disaster recovery, business continuity, production support, or infrastructure operations.
Infrastructure: In-depth knowledge of on-premises (VMware, SAN/NAS, Linux/Windows) and Cloud (AWS, Azure) environments.
Tools: Proficient in monitoring/observability tools (e.g., Datadog, Splunk, Dynatrace) and backup/replication technologies (e.g., Rubrik, Cohesity, Zerto).
Methodology: Strong understanding of ITIL, DevOps practices, and incident management frameworks.
Soft Skills: Excellent communication skills, crisis management abilities, and capability to work under pressure.

Best Regards,
Yogesh Bisht
Recruitment Lead

VBeyond Corporation |

Note VBeyond is fully committed to Diversity and Equal Employment Opportunity.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10209652
Position Id: 2026-225509
Posted 1 day ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Disaster Recovery Program Manager

Topeka, Kansas

•

Yesterday

The IT Disaster Recovery Program Manager is a senior-level role responsible for developing, governing, and executing enterprise disaster recovery strategies to ensure the resilience and recoverability of critical IT services. This position oversees the organizations disaster recovery framework, collaborating with various technology and business units to establish recovery objectives, conduct testing, and maintain compliance with regulatory standards. The role also involves managing recovery docu

Easy Apply

Contract, Third Party

Depends on Experience

Business Continuity/Disaster Recovery Manager

Hybrid in Santa Clara, California

•

Today

Job Title: Operational Resilience(Business Continuity and Disaster Recovery) Manager Job location: Santa Clara, CA Duration: 6 Months Contract to Hire About the Role: Client s Data Transformation Group is seeking a hands-on, technically adept Operational Resilience Manager with 5+ years of experience to support the development and execution of Business Continuity and Disaster Recovery (BC/DR) capabilities across hybrid cloud and on-prem environments. The ideal candidate will bring strong technic

Easy Apply

Contract

$70 - $80

Disaster Recovery Engineer - Greenville, SC (Onsite)

Greenville, South Carolina

•

Today

Role:- Disaster Recovery (DR) & Zerto Consultant Hyper V to VMware Environments Job Location: Greenville, SC (Onsite) Duration : 12+ Months Job Summary Seeking an experienced Disaster Recovery (DR) & Zerto Consultant to support the design, validation, and execution of disaster recovery operations for a production Hyper V based environment with replication to a VMware based DR tenant. The consultant will play a critical role in failover testing, recovery orchestration, cross hypervisor recover

Easy Apply

Contract, Third Party

Application Production Support

Plano, Texas

•

Yesterday

Job Title: Application Production Support Location: Plano, Texas (Onsite) Duration: 12+ Month Contract Rate: $60/hour on W2 Role Overview We are seeking an experienced Application Production Support Specialist to manage and support mission-critical Compliance applications focused on Sanctions and AML. This role requires a dependable production support professional who can operate in high-pressure environments, ensure application stability, and lead incident response with clear stakeholder commun

Easy Apply

Contract

$55 - $60

Search all similar jobs