Overview
Skills
Job Details
Responsibilities:
Design and deploy cloud solutions in alignment with organizational policies, standards, and best practices
Identify observability and notification needs for the organization
Create and maintain deployment automation for new or updated dashboards or data sources
Lead efforts in enterprise data strategy, observability, stewardship, and metadata management to ensure data accuracy and accessibility
Build relevant cloud infrastructure to support observability including
- Create IAM roles and policies that enable data collection
- Security groups and rules that allow compliance enforcement traffic
- Deployment automation (e.g. Terraform) to create Prometheus and Grafana infrastructure
- SNS topics for sending notifications when alarms are triggered
Write scripts to extend the observability platform where there are gaps in coverage
Act as a technical leader within multi-disciplined, matrixed teams, promoting innovative solutions and emerging technologies to enhance operational efficiency and security
Required Skills
Minimum of five years experience in cloud or hybrid environments, with a strong focus on cloud architecture and observability
Deep understanding of enterprise data strategy, observability practices, and stewardship principles along with knowledge in data standards and metadata management
Experience in architecting cloud-based observability solutions and familiarity with container-based architectures including Kubernetes
Experience with infrastructure as code languages such as Terraform
Experience with Python and the AWS SDK
Experience with CI/CD pipeline automation tools such as GitHub Actions
Stand Out Skills
Direct Experience with observability platforms such as Prometheus, Grafana, and Elastic Stack
Experience with hybrid cloud configurations
Experience in an organization with mature observability practices
*Please note this position cannot provide sponsorship and candidates MUST be able to work on W2 only*