Principal Engineer, IT Disaster Recovery & Resiliency

Overview

On Site
Full Time

Skills

Retail
Supervision
Mentorship
Failover
Data-flow diagrams
RPO
Testing
Policies
Risk analysis
Operations
Business operations
Backup
Service level
Auditing
IT management
Strategy
Documentation
Information Technology
Disaster recovery
Leadership
Presentations
Data
Network
SaaS
PaaS
IaaS
Microsoft Azure
Salesforce.com
Amazon Web Services
PMP
Recovery
High availability
Cloud computing
Computer networking
Communication
Management
Business continuity planning
Risk assessment
Planning
Microsoft Office
Computer science
Information systems
Information security
Genetics
Law
DV

Job Details

Overview

Here at Discount Tire, we celebrate the spirit of our people with extraordinary pride and enthusiasm. Our business has been growing for more than 60 years and now is the best time in our history to join us. We are opening more locations every year and we are always looking for qualified individuals to join us in our growth. We are a company that promotes from within, both in our retail and corporate operations.

Under general supervision, the Principal Engineer, IT Disaster Recovery & Resiliency manages Disaster Recovery (DR) plans, DR testing, and ensures DR solutions are in place for mission-critical applications and systems. Provides direction to determine technical and operational feasibility of solutions are aligned with DR requirements and standards. As a subject matter expert, continually provides mentorship and servant leadership to all.

Essential Duties and Responsibilities:
  • Manages and maintains application, system, and cloud DR Plans
  • Owns DR standards and policies
  • Plans and coordinates DR Testing
  • Performs failover and failback exercises
  • Identifies application single points of failure and presents recommendations to correct
  • Recommends and presents resiliency options for cloud-based applications and infrastructure
  • Collaborates with Enterprise Data Architect on maintenance of all data flow diagrams and topologies
  • Collaborates with Risk Segment on recovery time objectives (RTO) and recovery point objectives (RPO) for mission critical applications and systems
  • Schedules and host meetings to coordinate DR Testing and Plan updates
  • Engages vendors as necessary to discuss HA and DR solutions
  • Leads semi-annual DR policy and standards review
  • Leads reoccurring risk analysis to identify critical operations and systems core to continued business operations in the event of disruption
  • Monitors and evaluates plans and backup systems
  • Develops and deploys awareness, documentation, and communication of disaster procedures to the organization
  • Develops service level recovery standards and agreements with vendors
  • Functions as liaison for auditing and examination of disaster recovery processes
  • Work with IT management to ensure that the disaster recovery plans drive and support DR strategy for the enterprise
  • Establishes and maintains DR procedures and documents interdependencies
  • Other duties as assigned


Qualifications:
  • This position requires a minimum of 10 years' experience in the information technology field
  • 5+ years managing and executing Disaster Recovery Plans for a multi-billion-dollar organization
  • 5+ years leading medium-large projects including planning, execution, and leadership presentations
  • Must have a solid working knowledge of data center computing, SaaS, IaaS, and network environments, and system resiliency in applications (business and technical)
  • Must have experience interacting with auditors
  • Proficient in SaaS, PaaS, and IaaS environments
  • Proficient in Microsoft Azure, AWS and Salesforce services
  • Proficient in Amazon Web Services (AWS) multiple availability zones and regions


Preferred Qualifications
  • DR Certified Planner, Specialist, or Expert or equivalent experience
  • CBRM (Certified Business Resilience Manager) or CBRITP (Certified Business Resilience IT Professional) preferred and PMP certification
  • Understanding of current recovery solutions, High Availability and Cloud Architectures
  • Deep understanding of basic IT network and Infrastructure principles
  • Excellent oral and written communication skills to convey plans, exercises, and activities
  • Must be able to interact with technical, non-technical, and management staff
  • Familiarity with Business Continuity program life cycle plans and source deliverables (e.g., risk assessments, continuity planning)
  • Proficient in Microsoft Office suite including Project


Educational Requirements
  • Bachelor's degree in computer science, information systems, information security, or related IT field is required. A master's degree is preferred.
  • Professional certifications are a plus.


Discount Tire provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local law.

#LI-DV1

#LI-Hybrid
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.