Experience Level:
Lead: 10+ years
Role Overview
We are seeking a skilled Automation Execution & Triage Engineer to support the reliable operation of automation systems and environments. This role is focused on executing existing automation workflows, identifying and resolving failures, and coordinating with relevant teams to ensure smooth operations. . The candidate must be based in Redmond, WA, and work onsite in a customer environment.
Key Responsibilities
Automation Execution: Run and monitor existing automation tasks across various environments. Ensure timely and successful completion of scheduled runs and maintain execution logs.
Failure Triage: Investigate and triage failures in automation and system environments. Determine whether issues stem from automation scripts, infrastructure, hardware, or external dependencies.
Issue Routing: Escalate hardware-related issues to the appropriate teams with detailed triage documentation. Route software or framework-related issues to the respective engineering teams.
Environment Support: Monitor and maintain the health of test and operational environments. Resolve issues such as configuration mismatches, resource constraints, or system outages that impact automation.
Collaboration: Work closely with infrastructure, support, and engineering teams to ensure timely resolution of issues. Provide clear documentation and updates throughout the triage process.
Required Qualifications
Bachelor s degree in Engineering, Computer Science, or a related field.
6 10 years of experience in automation execution, infrastructure support, or system operations.
Strong troubleshooting and analytical skills.
Familiarity with Linux/Unix environments.
Proficiency in scripting languages such as Python, Bash, and configuration formats like YAML.
Experience in identifying and resolving system-level issues in complex environments.
Must be based in Redmond, WA, and able to work onsite in a customer environment.
Anyone who worked with Amazon in the past 3 years should not have record of PIP/Focus.
Ready to work in shifts and weekend as part of 24/7 coverage
Preferred Qualifications
Experience in large-scale operational environments.
Familiarity with telemetry systems, log analysis, or system diagnostics.
Strong communication and documentation skills.
Prior experience in triage or support engineering roles.
Must have HW knowledge to differentiate SW, HW and Environmental issues while triaging. Experience in dealing with HW/Device testing, triaging.