Job Description:
Must Haves:
7+ years of experience in DevOps or Site Reliability Engineering roles
Strong experience with AWS cloud services including EC2, Lambda, S3, ECS/EKS, and IAM
Proficiency with infrastructure as code tools (Terraform, CloudFormation, or AWS CDK)
Experience with containerization technologies (Docker, Kubernetes)
Strong scripting skills using Python, Bash, or similar
Experience implementing and maintaining CI/CD pipelines (GitHub Actions, Jenkins, or similar)
Knowledge of monitoring and observability tools (DataDog, Prometheus, Grafana)
Understanding of networking concepts including VPCs, subnets, security groups, and load balancers
Experience with configuration management tools (SaltStack, Ansible, Chef, or Puppet)
Ability to work collaboratively with cross-functional teams and communicate effectively with both technical and non-technical stakeholders
Highly Desired:
Experience with microservices architectures
Knowledge of database administration (RDS, Redshift)
Experience with log management solutions (ELK stack, CloudWatch)
Nice to Haves:
Experience with infrastructure security scanning tools
Experience with cloud cost optimization strategies
Background in software development