Overview
Skills
Job Details
Responsibilities:
Design and implement Kubernetes clusters on GKE for stateful workloads.
Configure CSI drivers (PD, GCSFuse, Lustre) and ensure compatibility with BYO Kubernetes environments.
Develop CI/CD pipelines for OSS CSI drivers and maintain test coverage across Kubernetes versions.
Implement autoscaling, lifecycle management, and network policies for session/memory services.
Build and maintain CI/CD pipelines for OSS components.
Automate deployment of session/memory services and artifact storage.
Implement monitoring and alerting for stateful workloads.
Ensure compliance with security and reliability best practices.
Skills:
Strong experience with Kubernetes, GKE, Helm, and CRDs.
Expertise in CSI driver development and troubleshooting.
Familiarity with cloud storage solutions (GCS, Lustre, Filestore).
Proficiency in CI/CD tools (Prow, GitHub Actions).
Proficiency in automation tools (Terraform, Ansible).
Experience with cloud-native observability (Prometheus, Grafana).
Strong scripting skills (Python, Bash).