Overview
Skills
Job Details
Key Highlights:
-
Proven expertise in Google Cloud Platform (Google Cloud Platform) services, including BigQuery, Cloud Logging, IAM, and Service Accounts.
-
Strong background in provisioning, monitoring, and troubleshooting staging and production cloud environments.
-
Experienced in architectural design for reliability, scalability, and performance.
-
Practical application of SRE principles: SLIs, SLOs, error budgets, automation, incident management, and postmortems.
-
Hands-on experience in containerization and orchestration technologies (Kubernetes, Docker, serverless computing).
-
Proficient in observability solutions such as Dynatrace, Prometheus, Grafana, and ELK/EFK stack.
-
Strong programming and scripting skills in Python, Go, and Bash for automation and tool development.
-
Excellent problem-solving, analytical, and strategic thinking abilities.
-
Collaborative leader with strong communication skills and the ability to influence technical direction across teams.
-
Experienced in on-call support and operational readiness for mission-critical systems.