Overview
On Site
Depends on Experience
Contract - W2
Able to Provide Sponsorship
Skills
Platform Reliability
Job Details
Core Responsibilities
- Implement automated testing frameworks for infrastructure changes
- Build APIs for common platform operations with Authentication & Authorization
- Create sandbox environments with guardrails for experimentation
- Develop templated project provisioning workflows
- Create runbooks for common failure scenarios
- Implement automated scaling policies based on usage patterns
- Set up cost anomaly detection alerts
- Implement lifecycle policies for cost-effective data storage
- Build example implementations and starter kits
Technical Skills
- Google Cloud Platform Services: Strong experience with BigQuery, Cloud Run, GKE, Cloud Storage, Pub/Sub
- Infrastructure as Code: Advanced Terraform skills with focus on testing and validation
- Containers & Orchestration: Expert in Docker, Kubernetes, GKE administration
- CI/CD: Deep experience with Cloud Build and GitLab pipelines
- Observability: Expert in monitoring, alerting, and SLO implementation
- Cost Management: Experience with Google Cloud Platform billing APIs and cost optimization techniques
- API Development: Strong skills in API design and implementation with authentication/authorization
- Automation: Proficient in Python and shell scripting for platform automation
- GitOps: Implementation experience with GitOps workflows
- Workflow Management: Experience with Astronomer/Airflow deployment and management
- Container Security: Knowledge of container security best practices
- Data Storage: Experience with storage class management and lifecycle policies
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.