Overview
Skills
Job Details
Job Title: Data Engineer
Location: Richmond, VA or Plano, TX (Onsite/Hybrid based on client needs)
Type: Long-Term Contract
Job Description:
We are seeking a highly skilled and motivated Data Engineer for a long-term contract opportunity. The ideal candidate will have hands-on experience working with AWS data services and building scalable data pipelines. This position requires a strong understanding of AWS Glue, EMR, SQL, and Python to support data transformation, migration, and analysis.
Responsibilities:
Design, develop, and maintain scalable data pipelines using AWS Glue and EMR.
Write efficient and optimized SQL and Python scripts for data processing and transformation.
Perform data ingestion from multiple sources, including structured and unstructured formats.
Work closely with data analysts, data scientists, and business stakeholders to understand data requirements.
Monitor data pipeline performance and troubleshoot issues in a timely manner.
Optimize AWS Glue jobs and EMR clusters for performance and cost efficiency.
Ensure data quality and integrity throughout the ETL lifecycle.
Required Skills:
3+ years of experience as a Data Engineer or in a similar role.
Strong proficiency in Python and SQL for data processing tasks.
Solid hands-on experience with AWS Glue and Amazon EMR.
Good understanding of Spark (via AWS Glue or EMR).
Experience with data warehousing concepts and ETL development.
Familiarity with S3, Lambda, and IAM for managing data workflows on AWS.
Knowledge of version control systems like Git.
Strong problem-solving skills and attention to detail.