Lead Data Engineer (Iceberg/Trino)

Overview

Remote
Depends on Experience
Contract - W2
Contract - Independent
Contract - 18 Month(s)
10% Travel

Skills

Apache HTTP Server
HIPAA
Data Engineering
Orchestration
PB
Regulatory Compliance
Python
java
jvm
scala
DAG
airflow
lakehouse
iceberg
insurance
telecommunications
healthcare

Job Details

LEAD DATA ENGINEER (ICEBERG/TRINO)
Building out Lakehouse environment managing over 28PB of data and processes 10M+ transactions, requiring engineers who ve operated at true enterprise scale
Deep, production-level expertise in Apache Iceberg and distributed query engines.
Build/manage Iceberg tables with schema evolution, partitioning, and ACID guarantees.
Integrate Trino/Nessie/Ranger into the Lakehouse environment.
Support ingestion/orchestration pipelines (Airflow, AirByte).
Ensure performance and compliance in a HIPAA-regulated environment.
REQUIREMENTS
8+ years in Data Engineering; 2+ with Iceberg in production.
Strong Python + JVM; Airflow DAG authoring experience.
Familiarity with Ranger for security/auditing.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.