Required Qualifications
7+ years of experience in Site Reliability Engineering, DevOps, Cloud Infrastructure, or Production Operations roles.
Strong experience operating workloads in cloud environments such as Microsoft Azure, AWS, or Google Cloud.
Hands-on experience with Kubernetes, Docker, CI/CD pipelines, and Infrastructure as Code tools.
Strong scripting and automation skills using Python, Bash, PowerShell, Go, or similar languages.
Experience with observability and monitoring platforms such as Datadog, Grafana, Prometheus, or Splunk.
Strong understanding of networking, Linux/Windows administration, distributed systems, and cloud-native architectures.
Experience with incident response, production troubleshooting, and operational governance.
Strong communication skills and ability to collaborate across engineering and business teams.
Preferred Qualifications
Experience supporting multi-tenant SaaS environments.
Experience with Terraform, Bicep, ARM templates, or Ansible.
Familiarity with GitOps and modern deployment strategies such as canary or blue/green deployments.
Experience working within regulated or compliance-driven environments.
Relevant cloud or Kubernetes certifications.