Data Engineer-Hybrid- W2(New York,Dallas,Chicago)

Overview

Hybrid

Depends on Experience

Contract - W2

Contract - 12 Month(s)

Skills

PySpark

SQL

Testing

Unstructured Data

Web Scraping

Workflow

Data Quality

Amazon Web Services

Analytics

Apache Spark

Collaboration

Data Engineering

Data Science

Database

Databricks

Flat File

Microsoft Excel

Job Details

We are seeking an experienced Data Engineer to design and maintain scalable data pipelines in Databricks. The role involves end-to-end ownership of data workflows, from ingestion to production, ensuring performance, quality, and reliability.

Responsibilities

Build and optimize pipelines in Databricks (PySpark, SQL, Delta Lake).
Ingest data from databases, APIs, PDFs, Excel, flat files, and web scraping.
Implement data quality checks, validation, and testing frameworks.
Deploy, monitor, and optimize pipelines in Databricks/AWS.
Collaborate with data science/analytics teams and document data processes.
Qualifications

5+ years of Data Engineering experience.
Hands-on expertise in Databricks (PySpark, SQL, Delta Lake).
Strong experience with structured & unstructured data.
Knowledge of data quality/testing tools (Great Expectations, Deequ, dbt tests).
Proven ability to optimize large-scale Spark/Databricks workloads.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Share