Overview
Skills
Job Details
Cassandra Admin (L3) with Datastax with all admin activities like backup, restore, repair, data replication, and compaction strategies.
Performance Tuning and Troubleshooting. Advantage with Kubernetes knowledge
Administration Operations Install, configure, manage, and maintain large-scale Cassandra and Redis database clusters in production environments (on-premise and/or cloud platforms like AWS, Azure).
Performance Tuning Optimization Monitor database performance, identify bottlenecks, and implement optimizations for queries, data models, JVM heap settings (for Cassandra), memory usage, and eviction policies (for Redis).
High Availability Disaster Recovery Develop, implement, and test backup, recovery, and disaster recovery strategies (e.g., nodetool snapshots, RDBAOF persistence) to ensure data durability and minimal downtime.
Troubleshooting Root Cause Analysis Serve as an L3L4 escalation point for complex production incidents, perform root cause analysis (RCA), and implement permanent fixes.
Architecture Design Collaborate with architects and developers on optimal data modeling, schema design, capacity planning, and the implementation of appropriate replication strategies and consistency levels.
Security Compliance Implement and enforce robust security measures, including authentication (LDAP, Kerberos), authorization (RBAC, ACLs), and encryption (TLSSSL), ensuring compliance with industry standards (e.g., GDPR, HIPAA).Monitoring Alerting Set up and manage monitoring and alerting tools (e.g., Prometheus, Grafana, OpsCenter) to proactively track database health and performance metrics.
Automation Automate routine administrative tasks, deployments, and scaling activities using scripting languages (Bash, Python) and Infrastructure as Code (IaC) tools (Terraform, Ansible).
Collaboration Documentation Work effectively with DevOps, SRE, application, and infrastructure teams, and maintain detailed documentation (runbooks, SOPs).
Required Skills and Qualifications
Technical Expertise
Deep expertise in Apache Cassandra architecture and internals (compaction strategies, tombstones, readwrite paths).
Redis Proficiency Hands-on experience managing Redis clusters (Standalone, Sentinel, Enterprise), including persistence, high availability, and memory management.
Cloud Experience Experience deploying and managing databases in cloud environments (AWS, Azure, Google Cloud Platform).
Operating Systems Scripting Strong proficiency in LinuxUnix operating systems and scripting languages like Python or Bash.
Tools
Experience with management and monitoring tools such as DataStax OpsCenter, Prometheus, Grafana, or the ELK stack.
Problem-Solving Strong analytical and troubleshooting skills to diagnose and resolve complex production issues quickly.
Communication Excellent communication skills and the ability to work in a collaborative, team-oriented environment