Title: Data Engineer
Experience: 10+
Location: Phoenix, AZ || Plano, TX || Newjersy || Providence, RI
Job SummaryWe are seeking an experienced Senior Data Engineer with 10+ years of expertise in designing, developing, and maintaining scalable data platforms and data pipelines. The ideal candidate should possess strong hands-on experience in AWS cloud services, big data technologies, ETL/ELT frameworks, data warehousing, and modern data engineering practices.
The candidate will work closely with business stakeholders, data architects, data scientists, and application teams to build robust, high-performance, and secure data solutions that support analytics, reporting, and machine learning initiatives.
Key Responsibilities- Design, develop, and maintain scalable data pipelines for batch and real-time processing.
- Build cloud-native data solutions using AWS services and modern data engineering frameworks.
- Develop ETL/ELT processes to ingest, transform, and load data from multiple structured and unstructured data sources.
- Design and implement data lakes and data warehouses for enterprise-scale analytics.
- Optimize data processing workflows for performance, reliability, and cost efficiency.
- Implement data quality checks, validation frameworks, and monitoring solutions.
- Collaborate with Data Scientists, BI teams, and business stakeholders to support analytics requirements.
- Develop reusable data frameworks and automation solutions.
- Implement security, governance, and compliance controls across data platforms.
- Troubleshoot production issues and perform root cause analysis.
- Mentor junior and mid-level data engineers.
- Participate in architecture reviews, code reviews, and best-practice initiatives.
- Support CI/CD implementations and infrastructure automation.
Required Technical SkillsAWS Cloud ServicesAWS Glue, Amazon S3, AWS Lambda, Amazon Redshift, Amazon EMR, Amazon Athena, AWS Lake Formation, AWS Step Functions, Amazon Kinesis, Amazon RDS, Amazon DynamoDB, Amazon CloudWatch, AWS IAM, AWS Data Pipeline
Data Engineering Technologies- Apache Spark (PySpark/Spark SQL)
- Hadoop Ecosystem
- Apache Kafka
- Apache Airflow
Programming Languages- Python (Advanced)
- SQL (Expert Level)
- Scala (Preferred)
- Java (Preferred)
Database Technologies- SQL Server
- Oracle
- PostgreSQL
- MySQL
- Snowflake
- Redshift
- MongoDB
Data Warehousing- Star Schema
- Snowflake Schema
- Data Modeling
- Dimensional Modeling
- Data Vault