Sr Data Engineer

Remote • Posted 2 hours ago • Updated 2 hours ago
Contract Independent
Contract W2
Remote
Depends on Experience
Fitment

Dice Job Match Score™

📊 Calculating match score...

Job Details

Skills

  • Apache Iceberg
  • Project Nessie
  • Apache Ranger
  • Trino
  • Ray
  • Apache AirByte
  • Apache Airflow
  • Apache Pinot
  • Apache Superset
  • Amundsen & OpenLineage
  • OpenStack

Summary

CLIENT IS NOT SPONSORING VISAS AT THIS TIME

Sr Data Engineer

Remote role

Contract role, but may go perm down the line

They would like to have at least 2-3 of the technologies together. They are building out the data lakehouse.

We re standing up a modern, open-source Data Lakehouse in our datacenter, engineered for petabyte-scale analytics and real-time workloads.

The core stack includes:

  • Apache Iceberg (table format, ACID, time travel, schema evolution)
  • Project Nessie (as the Iceberg catalog/metastore replacing Hive Metastore)
  • Apache Ranger (security and audit layer for Trino, Iceberg, and the lakehouse)
  • Trino (distributed SQL engine)
  • Ray (scalable Python/AI compute)
  • Apache AirByte (data ingestion)
  • Apache Airflow (workflow orchestration)
  • Apache Pinot (real-time OLAP)
  • Apache Superset (BI/visualization)
  • Amundsen & OpenLineage (data governance, discovery, lineage)
  • Pure Storage FlashBlade/FlashArray (object/block storage)
  • OpenStack (IaaS, K8s orchestration)

What Client needs:

  • Data engineers and architects with hands-on experience in Iceberg, Nessie, Ranger, Trino, and Airflow at our scale. Read about it or saw it one time is not going to cut it for us.
  • Folks who know their way around Kubernetes and IaC.
  • Python and JVM skills are a must; bonus points for Kotlin, C#, and experience with Airflow DAGs and AirByte connectors.
  • Real-world experience with data governance (Amundsen, OpenLineage), security frameworks, and high-availability deployments. This is a highly regulated HIPPA environment.
  • Ability to work as part of a cross-functional team, help design and implement scalable, secure, and automated data pipelines.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10112333
  • Position Id: 8907361
  • Posted 2 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

4d ago

Easy Apply

Contract, Third Party

Depends on Experience

Remote or McLean, Virginia

Today

Full-time

USD 130,000.00 - 216,000.00 per year

Remote or Hybrid in Charlotte, North Carolina

Today

Easy Apply

Contract

$75

Remote or Santa Ana, California

Today

Full-time

USD 129,300.00 - 172,300.00 per year

Search all similar jobs