Overview
Skills
Job Details
AWS Data Engineer
Remote
AWS Cloud & Data Engineering Skills
- AWS Glue Developing ETL pipelines, working with Glue Catalog, and transforming data using PySpark
- AWS Lambda Writing serverless functions (Python, Node.js) to process, move, or transform data
- AWS S3 Managing storage, organizing Qualtrics input/output files, and optimizing data retrieval
- AWS Step Functions Orchestrating Glue and Lambda jobs into automated workflows
- AWS Athena Querying structured data stored in S3 for quick analysis
- AWS CloudWatch & Logging Monitoring job execution and debugging errors
- AWS MWAA airflow for Jobs orchestration
- Data Processing & ETL Skills
Data Transformation & Aggregation Merging dealer master and dealer employee files into a single hierarchy file
ETL Workflow Automation Automating ingestion, transformation, and report generation
Data Format Handling Processing CSV, JSON, or XML files from Qualtrics
- API & Integration Skills
Qualtrics API Extracting raw data, managing survey responses, and automating data pull
AWS Event-Driven Processing Triggering Lambda jobs based on new files arriving in S3
Integration with External Systems Connecting AWS with CRM, ERP, or BI tools
- Data Analysis & Reporting
Data Validation & Quality Checks Ensuring accuracy in processed files before Qualtrics ingests them
Report File Generation Formatting data outputs as per business-defined templates
Basic SQL & Athena Queries Extracting insights from processed data
- Programming & Scripting
Python Writing data processing scripts for Lambda, Glue (PySpark), and automation
SQL Querying and managing structured datasets
Shell Scripting (Bash) Automating file movement and data preparation
- DevOps & Security (Nice to Have)
IAM Permissions & Role Management Ensuring AWS services have correct access
Infrastructure as Code (IaC) Using Terraform or CloudFormation for AWS resource setup
CI/CD for Data Pipelines Automating deployment of Lambda and Glue scripts
Ideal Background for This Role
Experience with AWS Data Engineering & Serverless Architecture
Strong Python & SQL skills
Familiarity with Qualtrics API and structured survey data
Ability to troubleshoot data pipeline failures and performance issues