Chaos Test Engineer

Remote • Posted 2 hours ago • Updated 2 hours ago
Full Time
Remote
Fitment

Dice Job Match Score™

🔗 Matching skills to job...

Job Details

Skills

  • Team Building
  • Collaboration
  • IoT
  • Disaster Recovery
  • Artificial Intelligence
  • Orchestration
  • Test Suites
  • Microservices
  • Microsoft Azure
  • Management
  • CHAOS
  • Kubernetes
  • SDK
  • High Availability
  • Testing
  • Cloud Computing

Summary

Join a high-impact engineering team building resilience frameworks across cloud-native platforms. You will design, execute, and evolve chaos experiments that safeguard platform reliability and drive the development of autonomous, AI-powered testing pipelines at scale. Join EPAM to engineer solutions that matter. From AI to cloud transformation, you'll collaborate with top-tier innovators, gain autonomy to explore your ideas, and grow your skills in a culture built for tech excellence. You will be working with an IoT platform, handling millions of devices. Req# Responsibilities Design and manage chaos engineering tests using Azure Chaos Studio, analyze platform architecture to identify failure domains and strengthen system resilience Maintain and enhance existing LitmusChaos test suites across Kubernetes environments, ensure consistent coverage and accuracy across all platforms Build comprehensive testing suites by integration of LitmusSDK, Azure Management SDK, Chaos SDK and Kubernetes SDK to automate and scale chaos experiments Lead HA/DR testing initiatives across all environments, operate independently to validate high availability and disaster recovery readiness Establish and standardize chaos engineering frameworks across AKS and EKS platforms, enable scalable and repeatable resilience practices organization-wide Integrate AI-driven capabilities into the chaos engineering pipeline to enable touchless experiment creation, automated execution and continuous validation Requirements Hands-on experience with Kubernetes orchestration platforms including AKS or EKS, with deep understanding of container-based infrastructure and cloud-native architecture Proficiency in chaos engineering tools including LitmusChaos and Azure Chaos Studio, with demonstrated experience to build and maintain structured test suites Experience with Istio service mesh for traffic management, observability and resilience configuration within microservices environments Practical experience with LitmusSDK, Azure Management SDK, Chaos SDK and Kubernetes SDK Proven ability to conduct HA/DR testing and work autonomously with minimal oversight across complex multi-environment cloud platforms
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10330481
  • Position Id: cc73f32db9696f64e81fe5e80c75cd21
  • Posted 2 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote or New York, New York

Today

Full-time

Remote

Today

Full-time

USD 81,045.75 - 155,000.00 per year

Remote or Chicago, Illinois

Today

Full-time

USD 114,500.00 - 194,700.00 per year

Remote or Eden Prairie, Minnesota

Today

Full-time

USD 91,700.00 - 163,700.00 per year

Search all similar jobs