Overview
Skills
Job Details
Job Title: Senior DevOps Consultant Active/Active Infrastructure & High Availability
Location: Remote
Job Type: 12 Weeks Contract
Start Date: ASAP
Job Overview:
We are seeking a highly skilled Senior DevOps Consultant to design and implement an active-active VPC architecture across two availability zones, enabling shared load balancing and failover-ready systems. The ideal candidate will have deep hands-on experience with AWS infrastructure, cluster replication, and Java-based backend services engineered for high availability. Experience with vector databases and modern AI-enhancement methods such as RAG (Retrieval-Augmented Generation) is a strong plus.
Key Responsibilities:
- Design and deploy two VPCs in active-active mode across two AWS availability zones.
- Implement shared load balancing to ensure continuous availability.
- Set up and maintain data replication strategies to manage seamless failover between zones.
- Build and optimize cluster replication for:
- MySQL (managed and self-hosted)
- Redis (managed and self-hosted)
- Vector databases
- Work closely with development teams to refactor Java applications for high availability in active/active environments.
- Support blue/green deployment strategies to minimize downtime and risk.
- Collaborate on architectural decisions, CI/CD practices, and infrastructure-as-code implementations.
- Ensure operational excellence, scalability, and performance of the infrastructure stack.
Must-Have Qualifications:
- Strong expertise in AWS infrastructure, including VPC setup, load balancers, availability zones, and failover configurations.
- Proven experience with cluster replication for MySQL, Redis, and Vector databases.
- Hands-on experience managing both managed services (RDS, ElastiCache) and self-hosted instances.
- Strong programming skills in Java, specifically around implementing high availability and active/active setups.
- Experience implementing blue/green deployments and rollout strategies for Java-based applications.
- Familiarity with observability tools, disaster recovery strategies, and automation.
Nice to Have (Huge Plus):
- Experience with Weaviate or other Vector Databases (e.g., Pinecone, FAISS, Qdrant).
- Understanding of Retrieval-Augmented Generation (RAG) for enhancing LLM performance.
- Exposure to AI/ML infrastructure, MLOps, or large-scale knowledge retrieval systems.
Preferred Certifications (Optional but Valued):
- AWS Certified DevOps Engineer Professional