Data Engineer with PySpark & DPL

Overview

On Site
Full Time
Part Time
Accepts corp to corp applications
Contract - Independent
Contract - W2

Skills

Pyspark
DPL
Python
Spark
Hadoop
Delta Lake
SQL
ETl/ELT
AWS
Azure
GCP
NoSQL
CI/CD
Git
Docker

Job Details

Role: Data Engineer with PySpark & DPL

Location: Jersey City, NJ

Required Skills & Qualifications

  • Strong hands-on expertise with:
    • PySpark (RDD, DataFrames, Spark SQL, performance tuning)
    • DPL (Data Pipeline Language / relevant tool-specific DPL)

  • Proficiency in Python for data engineering workflows.
  • Experience with distributed computing and big data technologies (Spark, Hadoop, Delta Lake).
  • Strong SQL skills and experience with relational and NoSQL databases.
  • Experience building ETL/ELT pipelines on cloud platforms (AWS / Azure / Google Cloud Platform).
  • Familiarity with CI/CD, Git, and containerization (Docker/Kubernetes) is a plus.
  • Bachelor s or Master s in Computer Science, Engineering, or related field.

Preferred Skills

  • Experience with orchestration tools (Airflow, ADF, Argo, Prefect).
  • Knowledge of data warehousing concepts (Star schema, SCD, normalization).
  • Experience with streaming platforms (Kafka, Kinesis, Spark Streaming).
  • Exposure to data governance, security, and compliance frameworks.
  • Experience working in Agile environments.

Thanks & Regards,

Shilpa

US IT Recruiter

TekLeaders Inc

5151 Headquarters Dr. Suite 105

Plano TX 75024

Mob:

Email:

tekleaders.com

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.