Job Title: Data Engineer
Location: NJ, NY, CT, PA, DE(onsite)
Job Summary:
We are looking for a Data Engineer to design, develop, and maintain modern data pipelines and infrastructure supporting analytics and business intelligence initiatives. The ideal candidate will have strong experience in cloud data platforms, ETL development, and data modeling, with hands-on coding skills in Python and SQL.
Key Responsibilities:
Build and optimize data ingestion and transformation pipelines for large-scale datasets.
Develop ETL/ELT workflows using modern tools and frameworks.
Design and implement data models for analytics and reporting.
Work closely with data analysts, scientists, and architects to ensure data quality and reliability.
Manage data integration between various on-premise and cloud systems.
Monitor and improve data pipeline performance, cost, and scalability.
Ensure data governance, security, and compliance standards are followed.
Required Technical Skills:
Cloud Platforms: AWS (Glue, S3, Lambda, Redshift) or Azure (Data Factory, Synapse) or Google Cloud Platform (BigQuery).
ETL/ELT Tools: Apache Spark, Databricks, Airflow, Kafka, or similar.
Pogramming: Strong experience with Python and SQL.
Data Warehousing: Hands-on with Snowflake, Redshift, or BigQuery.
Database Systems: Experience with RDBMS and NoSQL databases.
Version Control / CI-CD: Git, Jenkins, or similar tools.
Preferred Skills:
Experience with DBT for data transformations.
Exposure to Terraform or CloudFormation for infrastructure as code.
Familiarity with containerization (Docker, Kubernetes)
Understanding of streaming data technologies (Kafka, Kinesis).