ETL Data Architect (PySpark / Spark)

β€’ Posted 15 days ago β€’ Updated 6 days ago
Full Time
$DOE
Fitment

Dice Job Match Scoreβ„’

🎯 Assessing qualifications...

Job Details

Skills

  • ETL
  • Pyspark

Summary

HMG America LLC is the best Business Solutions focused Information Technology Company with IT consulting and services, software and web development, staff augmentation and other professional services. One of our direct clients is looking for ETL Data Architect (PySpark / Spark) in Remote. Below is the detailed job description.

Title: ETL Data Architect (PySpark / Spark)
Work Mode: Remote
Employment Type: Full-time

Role Overview

We are looking for a highly skilled ETL Data Architect with strong expertise in PySpark and Apache Spark to design and lead scalable data integration and transformation solutions. This role is ideal for a senior-level (L5) professional who thrives in a consulting-driven environment and can architect modern, cloud-based ETL frameworks.

Key Responsibilities
  • Design and architect ETL/ELT pipelines using PySpark and Spark

  • Define data architecture standards, frameworks, and best practices

  • Lead data modeling, transformation strategies, and optimization efforts

  • Build scalable solutions for batch and streaming data processing

  • Collaborate with business stakeholders, analysts, and engineering teams

  • Ensure data quality, governance, and performance tuning

  • Drive cloud-native data platform implementations

  • Provide technical leadership and mentoring

Required Skills & Experience
  • Strong experience as a Data Architect / ETL Architect

  • Deep hands-on expertise with Apache Spark

  • Advanced proficiency in PySpark

  • Expertise in data pipeline design & optimization

  • Strong SQL and data warehousing concepts

  • Experience with distributed data processing

  • Knowledge of data lakes / lakehouse architectures

  • Familiarity with cloud platforms (AWS / Azure / Google Cloud Platform)

Preferred Qualifications
  • Experience in consulting or client-facing roles

  • Knowledge of streaming frameworks (Spark Streaming / Kafka)

  • Experience with workflow orchestration (Airflow, etc.)

  • CI/CD & DevOps practices

  • Cloud certifications (nice to have)

Employers have access to artificial intelligence language tools (β€œAI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10481867
  • Position Id: 2026-17459
  • Posted 15 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

β€’

26d ago

Easy Apply

Full-time

50 - 55

Jersey City, New Jersey

β€’

Today

Full-time

USD 133,000.00 - 185,000.00 per year

Columbus, Ohio

β€’

5d ago

Easy Apply

Full-time

60 - 65

Jersey City, New Jersey

β€’

Today

Full-time

USD 152,000.00 - 215,000.00 per year

Search all similar jobs