Overview
On Site
USD 80.00 - 87.00 per hour
Contract - Independent
Skills
Quality Assurance
Communication
Product Demonstration
Demonstrations
Recovery
Kubernetes
Stacks Blockchain
Continuous Improvement
Network Design
Middleware
Database Architecture
Disaster Recovery
Scripting
Bash
Python
PHP
Amazon Web Services
Performance Tuning
Load Balancing
Failover
CHAOS
Team Leadership
Splunk
SSL
IT Architecture
User Stories
Cloud Computing
High Availability
Workflow
Mentorship
Training
Collaboration
DevOps
Network Security
Privacy
Marketing
Job Details
Location: Coppell, TX
Salary: $80.00 USD Hourly - $87.00 USD Hourly
Description:
They need a seasoned "resiliency" architect who can design applications that keep running even when parts fail. This person will work with business leaders to understand goals, then translate those into technical blueprints for fault-tolerant systems. They'll build proof-of-concept apps, test failure scenarios, and show how services recover in the cloud, containers, and on-premises environments. Strong communication and mentoring skills are key because they'll guide teammates and explain complex ideas in simple terms.
What They'll Be Doing:
Qualifications
What the Best resumes have:
Preferred Skills
By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.
Contact:
This job and many more are available through The Judge Group. Please apply with us today!
Salary: $80.00 USD Hourly - $87.00 USD Hourly
Description:
They need a seasoned "resiliency" architect who can design applications that keep running even when parts fail. This person will work with business leaders to understand goals, then translate those into technical blueprints for fault-tolerant systems. They'll build proof-of-concept apps, test failure scenarios, and show how services recover in the cloud, containers, and on-premises environments. Strong communication and mentoring skills are key because they'll guide teammates and explain complex ideas in simple terms.
What They'll Be Doing:
- Meeting with stakeholders to gather requirements and map business goals into resilient solution designs
- Building and validating full-stack demo applications that test various failure and recovery patterns
- Running simulated outages in AWS, Kubernetes, or on-prem stacks to prove system recoverability
- Defining and implementing monitoring, alerting, and automation scripts (Bash/Python) to detect and remediate issues
- Leading continuous-improvement efforts, setting high-availability standards, and documenting best practices
Qualifications
- 10+ years designing and implementing distributed applications in production
- 5+ years in network, infrastructure, middleware, and database architecture
- 5+ years hands-on with high-availability or disaster-recovery solutions and methodologies
- Proven scripting ability (Bash, Python, PHP) for automation of resiliency tests
- Experience or certification in cloud platforms (AWS preferred), performance tuning, and chaos engineering
What the Best resumes have:
- Clear examples of systems designed for fault tolerance, load balancing, and automatic failover
- Bullet points around DR drills, outage simulations, or chaos-engineering experiments
- Proven track record of mentoring or leading teams through resiliency projects
- Hands-on experience with monitoring tools (CloudWatch, Splunk, CloudTrail) and IAM/SSL setups
- Demonstrated ability to translate business requirements into technical architecture diagrams or user stories
Preferred Skills
- Translate business drivers into resilient architecture blueprints and proof-of-concept applications
- Validate and test resilience in cloud, containerized, and on-prem environments through scripted simulations
- Define and enforce high-availability standards, monitoring strategies, and automation workflows
- Act as a subject-matter expert and mentor, conducting knowledge transfers and training sessions
- Collaborate with DevOps, networking, security, and development teams to embed resiliency best practices
By providing your phone number, you consent to: (1) receive automated text messages and calls from the Judge Group, Inc. and its affiliates (collectively "Judge") to such phone number regarding job opportunities, your job application, and for other related purposes. Message & data rates apply and message frequency may vary. Consistent with Judge's Privacy Policy, information obtained from your consent will not be shared with third parties for marketing/promotional purposes. Reply STOP to opt out of receiving telephone calls and text messages from Judge and HELP for help.
Contact:
This job and many more are available through The Judge Group. Please apply with us today!
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.