Job Role: SRE Architect
Experience: 15 Year
Location: Atlanta Georgia(Hybrid)
Visa: Any
Job Description: • Design, implement, and manage container orchestration platforms using Kubernetes (EKS) and related AWS services
• Define orchestration strategies for microservices, batch jobs, and event-driven workloads to ensure scalability and fault isolation
• Architect self-healing systems leveraging auto-scaling, health checks, rolling deployments, and graceful degradation patterns
• Implement resilience patterns such as circuit breakers, bulkheads, retries, rate limiting, and backoff mechanisms
• Lead multi-region and multi-AZ orchestration strategies to ensure high availability and disaster recovery readiness
• Design and automate failover, failback, and traffic-shifting mechanisms using Route 53, ALB/NLB, and service mesh technologies
• Establish resilience testing practices including chaos engineering, fault injection, and game days
• Optimize workload orchestration for performance, cost efficiency, and reliability under peak and failure conditions
• Partner with application teams to embed resilience and orchestration best practices into service design and deployment pipelines
• Continuously evaluate and improve orchestration platforms to support evolving scalability, security, and reliability needs