Overview
Remote
Depends on Experience
Contract - W2
Skills
Apigee SRE
Job Details
Role: Apigee SRE / Automation Lead
Remote
Contract Job
Job Summary:
We are seeking a highly skilled and experienced Apigee SRE / Automation Lead to oversee the reliability, scalability, and automation of our API infrastructure. This role demands deep expertise in Apigee platform operations, infrastructure automation, and system reliability engineering. The ideal candidate will have hands-on experience with tools like Terraform, Ansible, DoJo, and scripting, along with a strong background in Linux, AWS, and distributed systems such as Cassandra, PostgreSQL, Qpid, and Zookeeper.
Key Responsibilities:
- Lead the Site Reliability Engineering (SRE) efforts for Apigee infrastructure and services.
- Design and implement automated deployment pipelines and infrastructure provisioning using Terraform, Ansible, and DoJo.
- Manage and monitor distributed systems including Cassandra, PostgreSQL, Qpid, and Zookeeper.
- Ensure high availability, performance, and scalability of Apigee services.
- Develop and maintain monitoring and alerting systems for proactive issue detection and resolution.
- Collaborate with development, operations, and security teams to enforce best practices in API management and infrastructure reliability.
- Perform root cause analysis and implement long-term fixes for production incidents.
- Write and maintain scripts for automation, monitoring, and operational tasks.
Required Skills & Qualifications:
- Proven experience in Apigee platform operations and automation.
- Strong knowledge of Linux system administration and AWS cloud infrastructure.
- Hands-on experience with Cassandra, PostgreSQL, Qpid, and Zookeeper.
- Proficiency in Terraform, Ansible, and DoJo for infrastructure automation.
- Strong scripting skills (e.g., Bash, Python, Shell).
- Experience with monitoring tools and observability frameworks.
- Excellent troubleshooting, analytical, and communication skills.
Preferred Qualifications:
- Experience with CI/CD tools and DevOps practices.
- Familiarity with containerization and orchestration (e.g., Docker, Kubernetes).
- Knowledge of API security, governance, and lifecycle management
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.