AWS / Cloud Platform Support Engineer

Overview

On Site
Depends on Experience
Full Time

Skills

AWS services including Glue
EKS
Athena

Job Details

AWS / Cloud Platform Support Engineer

Direct-Hire / FTE

Foster City, CA 94404 (Onsite)

Skills / Experience

  • 7+ years of experience with AWS services including Glue, EKS, Athena, S3, ECS, and ASG, with strong capabilities in monitoring, validation, deep-dive troubleshooting, and advanced incident resolution
  • 7+ years of experience working with AWS Serverless services such as Lambda, API Gateway, and DynamoDB, including log analysis, root cause identification, and complex issue resolution
  • 7+ years of experience with Terraform / Infrastructure as Code (IaC), capable of designing, reviewing, and troubleshooting infrastructure deployments and managing environment-level issues
  • 7+ years of experience with containerization and orchestration using Docker, Helm, and Kubernetes, including advanced pod/service troubleshooting and collaboration with platform teams
  • 7+ years of experience with Git Flow and CI/CD pipelines, handling pipeline design, failure analysis, and release coordination across teams
  • Experience with Microservices and event-driven architectures, enabling end-to-end system analysis, incident root cause analysis, and ownership of L3 support resolution
  • Secondary Skills Scripting (Python/Bash), monitoring and observability tools (CloudWatch, Prometheus, Grafana), and a solid understanding of security, networking, and compliance best practices; The role also benefits from experience in incident management, documentation, and mentoring L2 teams, along with exposure to data platforms and analytics workloads
  • Prior experience in working on Agile/Scrum projects with exposure to tools like Jira/Azure DevOps
  • Bachelor s Degree or higher in Information Systems, Computer Science, or equivalent experience
  • Provide constructive feedback during reviews and be open to receiving the feedback
  • Strong interpersonal skills to build and maintain productive relationships with team members
  • Problem-Solving and Analytical Thinking; Capability to troubleshoot and resolve issues efficiently
  • Analytical mindset; Provides regular updates, proactive and due diligent to carry out responsibilities
  • Communicate effectively with stakeholders (technical and non-technical); Communication approach: verbal, emails and instant messages

Job / Role Description The L3 Cloud & Platform Support Engineer is responsible for providing advanced technical support and ownership of complex incidents across cloud-native platforms. The role requires deep hands-on expertise in AWS, serverless and container technologies, Infrastructure as Code, CI/CD pipelines, and distributed systems to ensure platform stability, scalability, and reliability.

Expected Outcome The expected outcome of this role is to ensure high availability, stability, and reliability of cloud platforms by owning and resolving complex L3 incidents end to end. The role will drive faster recovery and reduced repeat issues through strong root cause analysis, preventive fixes, and well-governed infrastructure deployments using Terraform. It will enable smooth and predictable releases, optimized performance of microservices and event-driven systems, and improved operational maturity through enhanced monitoring, automation, documentation, and effective knowledge transfer to L2 teams.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Marici Solutions