Position Title
Cloud Infrastructure / Kubernetes Architect
Duration
Long-Term Contract
Location
Remote, if requested for second round interview the candidate must be able to attend in person
Position Overview
The Child Support Bureau Information Technology Department is seeking an experienced Cloud Infrastructure / Kubernetes Architect responsible for ensuring the quality and consistency of software architecture while providing day-to-day technical guidance to development teams.
The ideal candidate will possess strong expertise in AWS cloud infrastructure, Kubernetes, observability platforms, identity management, logging, automation, and enterprise architecture practices. This role requires collaboration with cross-functional teams to build scalable, secure, and highly available solutions.
Key Responsibilities
Cloud Infrastructure & Platform Engineering
· Design, deploy, and maintain AWS cloud infrastructure.
· Manage AWS services including EC2, EFS, RDS, ALB, IAM, and S3.
· Implement scalable, secure, and highly available cloud solutions.
· Support backup, recovery, and disaster recovery procedures.
Kubernetes Administration
· Manage and operate Kubernetes and Amazon EKS clusters.
· Configure deployments, integrations, scaling, and troubleshooting.
· Implement event-driven scaling solutions using KEDA.
· Manage Helm chart deployments and lifecycle.
Observability & Monitoring
· Design and maintain monitoring solutions using Prometheus, Alertmanager, and Grafana.
· Configure dashboards, alerting, and operational visibility.
· Monitor system reliability, capacity, and performance.
· Support Splunk integrations and operational monitoring.
Identity & Security
· Implement and support Keycloak identity and access management solutions.
· Configure security controls and access management.
· Support vulnerability remediation and operational security initiatives.
Logging & Data Services
· Implement Fluent Bit log forwarding solutions.
· Support MongoDB environments.
· Work with vector-based database technologies including pgvector.
· Maintain operational logging and monitoring frameworks.
Automation & Document Management
· Develop and support SmartDocument automation solutions.
· Create technical documentation, architecture diagrams, and operational procedures.
· Support automation initiatives across the platform.
Collaboration & Leadership
· Provide technical leadership and architectural guidance.
· Collaborate with development, infrastructure, and business teams.
· Translate business requirements into technical solutions.
· Establish operational best practices and performance standards.
Required Qualifications
· Strong AWS infrastructure experience (EC2, EFS, RDS, ALB, IAM, S3)
· Experience managing Kubernetes and EKS environments
· Experience with Prometheus, Grafana, and Alertmanager
· Experience with NGINX Ingress or NGINX Gateway
· Experience with Fluent Bit log forwarding
· Experience with Keycloak IAM
· Experience using KEDA for event-driven Kubernetes scaling
· Experience with MongoDB
· Experience managing Helm deployments
· Experience with Jira, Confluence, and Bitbucket
· Experience with Splunk
· Strong system performance tuning and reliability engineering experience
· Experience with backup and disaster recovery planning
· Strong communication and documentation skills
Preferred Qualifications
· Knowledge of pgvector and vector database extensions
· Experience with SmartDocument solutions
· Experience supporting enterprise-scale cloud architectures
· Experience with modern observability and platform engineering practices