AWS Support Engineer / Cloud Support Engineer

Chicago, IL, US • Posted 4 hours ago • Updated 4 hours ago
Contract W2
No Travel Required
On-site
Depends on Experience
Fitment

Dice Job Match Score™

🔗 Matching skills to job...

Job Details

Skills

  • API
  • Accountability
  • Amazon Kinesis
  • Amazon Web Services
  • Cloud Computing
  • Collaboration
  • Communication
  • Continuous Delivery
  • Continuous Improvement
  • Continuous Integration
  • Customer Facing
  • DevOps
  • Documentation
  • IT Service Management
  • Incident Management
  • Knowledge Transfer
  • Management
  • Messaging
  • Microsoft Azure
  • Operational Efficiency
  • Production Support
  • ROOT
  • Root Cause Analysis
  • ServiceNow
  • Soft Skills
  • Streaming
  • User Stories
  • Workflow

Summary

Job Description

The resource will provide platform stability and production support for AWS/cloud-based services, including on-call support, incident management, and operational readiness.  This role is responsible for developing and maintaining runbooks, coordinating technical resolution across teams and driving continuous improvement, while serving as the primary technical liaison between Caterpillar platform teams, product partners and support organizations.

 

Education & Experience Required:

  •  Degree not required but a nice to have. ( Top candidates will have a degree)
  • 2-4 years’ experience is a HARD requirement.

 

Position’s Contributions to Work Group:

  • Incident Management & Cross‑Functional Collaboration - Proven experience owning and managing incident tickets through their full lifecycle, working closely with cross‑functional teams to triage issues, coordinate resolution, communicate status, and drive root cause and follow up actions.
  • Cloud Platforms & API Operations (AWS) - Demonstrated experience owning and supporting AWS hosted APIs and services in production, with strong knowledge of API design, AWS core services, security, monitoring, incident management, and driving reliability and operational readiness.
  • Operational Readiness & Support Enablement - Proven experience understanding end‑to‑end technical and business flows, developing clear and actionable runbooks, and ensuring support teams are fully prepared through documentation, knowledge transfer, and operational readiness activities.

 

Required Technical Skills

(Required)

  • Experience supporting production grade, customer facing platforms in complex, multi‑team environments.
  • A demonstrated ownership mindset, taking accountability for service stability, incident outcomes, and follow through beyond initial investigation.
  • Strong understanding of AWS Kinesis streaming and messaging services, containerized and serverless compute using Fargate and Lambda, and CI/CD pipeline implementation using Azure DevOps.
  • Experience utilizing ServiceNow for incident management and Azure Devops for features, user stories, etc.
  • Proven ability to partner effectively with engineering, product, and platform teams to resolve issues and improve operational efficiency.
  • Experience driving root cause analysis and continuous improvement, turning incidents into long term reliability gains.
  • Strong understanding of operational readiness standards, including monitoring, alerting and runbooks.
  • Comfort operating in on-call or escalation roles, maintaining composure and clear communication during high impact incidents.
  • Ability to identify gaps in processes or tooling and proactively improve support models, documentation, or workflows.
  • Experience working within enterprise ITSM frameworks.

 

Soft Skills

(Required)

  • Strong communication skills, with the ability to translate technical issues into clear status and impact updates for stakeholders

 

Typical task breakdown:

  • Own incident tickets through the full lifecycle, from initial triage to resolution and closure.
  • Collaborate with engineering, platform, product, and operations teams to diagnose issues and coordinate fixes.
  • Communicate incident status, impact, and resolution progress to stakeholders.
  • Lead or contribute to root cause analysis and ensure follow up actions are identified and tracked.
  • Ensure platform reliability through monitoring, alerting, security, and operational best practices.
  • Respond to and manage production incidents impacting AWS services and APIs.
  • Drive reliability, stability, and operational readiness improvements across cloud platforms.
  • Understand end‑to‑end technical and business flows to support production services effectively.
  • Develop, maintain, and improve clear, actionable runbooks for operational support.
  • Lead knowledge transfer sessions to ensure support teams are ready for production support.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10109811
  • Position Id: 8922852
  • Posted 4 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Chicago, Illinois

Today

Easy Apply

Contract

Depends on Experience

Chicago, Illinois

7d ago

Full-time

USD 165,000.00 - 195,000.00 per year

Hybrid in Chicago, Illinois

13d ago

Easy Apply

Contract

Depends on Experience

Hybrid in Northbrook, Illinois

Yesterday

Full-time

USD 92,000.00 - 138,000.00 per year

Search all similar jobs