Lead Data Lakehouse Engineer (Iceberg)

Overview

Remote
Depends on Experience
Full Time
10% Travel

Skills

Amazon Web Services
Apache Airflow
Apache HTTP Server
Apache Kafka
Apache Parquet
Apache Ranger
Business Intelligence
Cloud Computing
Data Engineering
Data Governance
DevOps
Extract
Transform
Load
Google Cloud Platform
Meta-data Management
Open Source
Orchestration
Query Optimization
Real-time
Storage
Streaming

Job Details

We re seeking a Lead Data Lakehouse Architect to design and lead the implementation of our next-generation, petabyte-scale data lakehouse built on Apache Iceberg. This role will set the foundation for our open-source data ecosystem, integrating Trino as the query layer and driving platform strategy for ingestion, orchestration, and governance.

Key Responsibilities

  • Architect and implement a modern Iceberg-based data lakehouse to support large-scale, real-time analytics and batch workloads.

  • Design high-performance query layers using Trino for federated and interactive querying.

  • Define architecture for metadata catalogs, storage tiers, partitioning strategies, schema evolution, and time travel.

  • Collaborate with data engineering, DevOps, and analytics teams to ensure seamless integration with streaming, ETL, and BI tools.

  • Establish platform standards, security models (e.g., Apache Ranger), data governance policies, and reliability SLAs.

  • Act as the technical authority on scaling Iceberg and Trino in production at petabyte scale.

Required Skills

  • 7+ years in data platform or architecture roles, with 2+ years hands-on with Apache Iceberg at scale.

  • Strong expertise in Trino (or Presto), query optimization, and federated query architectures.

  • Deep knowledge of orchestration (Apache Airflow), streaming (Apache Kafka), and open-source governance tools.

  • Strong foundation in distributed systems, storage formats (Parquet, ORC), and schema management.

  • Experience deploying hybrid or on-prem cloud data platforms (AWS, Azure, Google Cloud Platform).

  • Excellent communication and leadership skills.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.