Overview
Skills
Job Details
Key Responsibilities
Design, build, and maintain scalable, reliable, and secure infrastructure across production and staging environments.
Automate operational tasks and processes using code (Python, Go, Bash, etc.).
Drive infrastructure as code (IaC) practices using tools like Terraform, Ansible, or similar.
Monitor, troubleshoot, and improve system availability, latency, and performance.
Collaborate closely with development, QA, and product teams to design scalable system architecture.
Conduct root cause analysis (RCA) and postmortems for critical incidents.
Lead and support CI/CD pipeline development and optimization.
Manage and scale large, distributed systems in hybrid or cloud-native environments (e.g., AWS, Azure, OpenShift, Kubernetes).
Improve observability using tools like Prometheus, Grafana, ELK/EFK, or similar.
Champion SRE best practices: error budgets, SLIs/SLOs/SLAs, chaos engineering, etc.
Required Skills and Qualifications
5+ years of experience in SRE, DevOps, or infrastructure engineering roles.
Strong infrastructure knowledge: networking, Linux internals, containers, load balancers, DNS, storage.
Proficiency in at least one scripting/programming language: Python, Go, Java, Bash.
Experience managing platforms at scale (millions of users, hundreds of microservices).
Hands-on experience with configuration management & IaC tools (Terraform, Ansible, Helm).
Solid understanding of CI/CD workflows, GitOps practices, and deployment strategies (blue-green, canary).
Experience working with Amdocs systems and environments (advantage).
Familiarity with telecom-grade SLA expectations and high-availability architectures.
Preferred Qualifications
Experience with Amdocs platforms (CES, CRM, BSS/OSS) is a strong plus.
Cloud certifications (AWS, Azure, Google Cloud Platform) or Kubernetes certifications (CKA/CKAD).
Experience with service mesh technologies like Istio or Linkerd.
Background in security practices (secrets management, IAM, auditing, compliance).