Sr. Cloud Engineer

Overview

Remote
Depends on Experience
Contract - Independent
Contract - W2
Contract - 60 Month(s)

Skills

Bash
Amazon Web Services
Amazon Lambda
DevOps
GitLab
Continuous Delivery
Continuous Integration
Python
Terraform

Job Details

Title: Sr. Cloud Engineer

Location: REMOTE

Description:
Client is seeking a highly skilled Sr. Cloud Engineer to support the daily operations and long-term reliability of our cloud-based infrastructure. This role is critical for ensuring uptime, performing proactive maintenance, troubleshooting issues and implementing fixes across our cloud environments. You will work closely with development, operations and security teams to ensure the scalability, performance and security of cloud applications. The ideal candidate will be responsible for maintaining cloud-based applications and infrastructure on AWS.

Responsibilities:
Deploy applications across multiple environments (dev, staging, prod) and ensure consistency and stability
Build reusable pipeline templates, jobs and stages for CI/CD consistency across teams
Collaborate with developers to containerize and deploy applications using ECS and Lambda
Configure GitLab Runners and manage environment-specific variables and secrets
Define and deploy readiness and liveness probes for containers running in EKS/ECS
Write custom scripts for CloudWatch custom metrics and alarms based on application specific probes Monitor deployments and system health using CloudWatch and other tools
Implement rollback strategies and manage version control during deployments
Troubleshoot and resolve deployment issues and improve pipeline performance and reliability
Proficient with Python, Bash, YAML/JSON, Node.js, Lambda functions
Perform daily health checks using AWS CLI or scheduled Lambda scripts to check health and log/report results
Set up monitoring thresholds, dashboards, and metrics for application and infrastructure
Perform root cause analysis and incident correlation using monitoring and performance analysis tools
Maintain a central inventory of all licensed software deployed in AWS environments
Maintain accurate documentation on infrastructure and procedures
Patch assessment and maintenance of infrastructure software, to include third party software patches
Develop a patch testing schedule and rollout plan to include rollback and recovery
Create and manage change records. Participate in PI planning/ Agile ceremonies
Keep cloud environments compliant with security standards and best practices
Orchestrate failover and restoration of ECS/ EKS services, Lambda functions, databases and other infrastructure components
Test and document regional failover playbooks and recovery runbooks
Ensure compliance with RTO (Recovery Time Objective) and RPO (Recovery Point Objective) requirements
Participate in on-call rotations to support 24/7 production systems and respond to incidents as they arise

Required Qualifications:
BA/BS in IT, Computer Science or related field (or equivalent work experience may be accepted in lieu of the degree
8+ years of IT experience. 2+ years of experience in cloud support, infrastructure maintenance or IT operations.
Experience with Infrastructure as Code (Terraform, CloudFormation)
Strong proficiency in AWS Lambda (writing, deploying and, optimizing)
Hands-on experience with CI/CD tools (GibHub, GitLab, Kubernettes, DevOps)
Scripting skills for automation and maintenance tasks (Bash, Python)
Cloud certifications (AWS DevOps Engineer, Solutions Architect Associate)
Strong written and verbal communication skills for technical and non-technical stakeholders
Excellent analytical and problem-solving skills

Preferred Qualifications:
Ability to diagnose performance issues in cloud environments
Pre-check and post-check scripts for validating system health

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.