Overview
Skills
Job Details
We are seeking a skilled Support Engineer with strong migration experience to join our team. You will lead the migration of the observability stack while ensuring high availability, performance, and reliability across infrastructure. The ideal candidate has hands-on expertise in Kubernetes, Python, and observability platforms (Grafana, Prometheus, etc.), backed by a strong DevOps/SRE background.
Key Responsibilities-
5+ years of experience in DevOps, SRE, or migration-focused roles
-
Provide operational support for telemetry and observability stacks in Kubernetes
-
Develop and maintain automation/monitoring tools using Python
-
Configure, monitor, and troubleshoot Grafana, Prometheus, Loki, and related tools
-
Collaborate with cross-functional teams to optimize system performance
-
Deploy, configure, and troubleshoot Kubernetes clusters
-
Work in a Linux-based environment with Git-based workflows
-
Support CI/CD pipelines and containerization tools like Docker
-
Familiarity with logging and tracing systems
-
Experience with service meshes (e.g., Cilium)
-
Knowledge of Infrastructure-as-Code tools: Pulumi, Terraform + Helm, Spinnaker
-
Exposure to cloud platforms (AWS, Google Cloud Platform, Azure)
-
Strong troubleshooting and performance tuning skills