Overview
Skills
Job Details
Data Engineer
NYC,NY
C2C/W2
Role Overview
We are seeking an experienced AWS Data Platform Senior Engineer to join our client s team. In this role, you will be responsible for designing, building, and optimizing cloud-native data solutions on AWS Data Platform. You will collaborate directly with client stakeholders, data architects, and analysts to deliver scalable, secure, and high-performance data platforms that support advanced analytics, reporting, and AI/ML initiatives for value streams.
Key Responsibilities
Design, develop, and maintain data pipelines and ETL/ELT processes using AWS-native and complementary tools.
Build and manage data lakes with AWS Lake Formation, including fine-grained access control and governance.
Develop and optimize large-scale data processing solutions using Spark on EMR (EC2).
Implement and manage data warehouse and analytics solutions using Redshift, Athena, and Glue.
Partner with client teams to integrate multiple data sources and downstream applications.
Ensure platform security and compliance using IAM, KMS, encryption, and governance frameworks.
Monitor, troubleshoot, and optimize data pipelines and query performance across Athena, Redshift, and EMR.
Contribute to engineering standards, code reviews, and best practices, while mentoring junior engineers.
Required Skills & Experience
7+ years of professional experience in data engineering, with at least 3+ years on AWS data platforms.
Strong expertise with AWS services:
Data Lake & Governance: S3, Lake Formation
Data Processing & Analytics: EMR on EC2, Spark, Athena, Redshift, Glue, Kinesis
Orchestration & Monitoring: CloudWatch, Step Functions, Airflow (or similar)
Proficiency in SQL and at least one programming language (Python, Scala, or Java).
Solid understanding of data modeling, schema design, and query optimization.
Hands-on experience with infrastructure-as-code (Terraform, CloudFormation, or CDK).
Knowledge of DevOps practices and CI/CD for data engineering (GitHub Actions, CodePipeline, etc.).
Experience with structured, semi-structured (JSON, Parquet, Avro), and unstructured data.
Strong communication skills with the ability to work independently in client-facing teams.
Must have skills Strong hands on programming exp in python, experience in creating and managing data platforms on AWS with IaaC ( terraform ), Spark, EMR, Lake formation, Athena
Good to Have Skills Apache Hudi, Iceberg , Trino, Design Data Products, Data fabric
Share resume at abhinav at apextgidotcom