Data Engineer

Overview

Remote
Depends on Experience
Full Time
No Travel Required

Skills

Python
Java
AWS
Airflow
Prefect
Dagster
DBT
Snowflake
Databricks
Hands-on DBT experience
Strong CS fundamentals
HIPAA familiarity
familiarity with healthcare compliance
Startup experience
big data tools

Job Details


This position is 100% remote
This position is a Full time Direct Hire

Tech Stack

  • Languages: Python (primary), Java

  • Tools: Airflow, Prefect, Dagster, DBT, Snowflake, Databricks

  • Cloud: AWS

  • Focus Areas: Data orchestration, compliance, scalability, lineage

Key Responsibilities

  • Design and implement scalable data orchestration and lineage systems

  • Build validation, transformation, and compliance workflows

  • Optimize large-scale data storage for cost and performance

  • Own projects end-to-end, from architecture to production

  • Contribute as a generalist across the data platform and engineering stack

Must-Have Qualifications

  • 6+ years of experience in data engineering
    (flexible for exceptional candidates with startup backgrounds)

  • Exceptional Python skills

  • Experience with orchestration tools: Airflow, Prefect, or Dagster

  • Proficient in Snowflake or Databricks

  • Strong AWS cloud experience

  • Hands-on with DBT for data transformations, testing, and documentation

  • Java proficiency

  • Strong CS fundamentals and a technical undergraduate degree

  • Experience in early-stage startups (sub-30 person teams; seed or Series A)

  • Genuine excitement about the AI infrastructure movement

  • Must not have too many short stints Sub 2 years

Preferred Qualifications

  • Experience working with unstructured data (PDFs, videos, imaging)

  • Familiarity with healthcare compliance (e.g., HIPAA)

  • Experience at companies with strong engineering cultures

  • Prior experience as a founding data engineer or early technical hire

Pre-Screen Questions (Required for submission)

  1. Tell me about your startup experience.

  2. Tell me about your experience with big data tools, particularly Python and Java, and your understanding of data orchestration and scaling.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About SmartTech Staffing Partners