Data Engineer

• Posted 7 hours ago • Updated 7 hours ago
Full Time
Fitment

Dice Job Match Score™

📋 Comparing job requirements...

Job Details

Skills

  • Startups
  • Google Analytics
  • Recruiting
  • Core Data
  • Workflow
  • SQL
  • Value Engineering
  • Analytical Skill
  • Modeling
  • Python
  • Orchestration
  • Amazon Web Services
  • Cloud Computing
  • Data Warehouse
  • Snow Flake Schema
  • FOCUS
  • Data Quality
  • Machine Learning Operations (ML Ops)
  • Augmented Reality
  • Valuation
  • Real-time
  • Analytics
  • Machine Learning (ML)
  • Artificial Intelligence

Summary

Data Engineer - AI Data Platform

The Role

We're partnering with a fast-growing AI startup building the analytics and observability layer for AI search - essentially "Google Analytics for LLMs."

As more users discover products through tools like ChatGPT, Claude, and Perplexity, companies lack visibility into how they appear in those responses. This platform solves that - and the data challenges are significant.

They're hiring a Data Engineer to help scale the core data infrastructure powering this system - handling large-scale, real-time data and enabling analytics, experimentation, and ML-driven insights.

What You'll Do
  • Build and maintain reliable, production-grade data pipelines
  • Design scalable batch and real-time ingestion systems
  • Optimize performance and cost across Snowflake, ClickHouse, AWS, dbt, and Dagster
  • Own data quality, validation, and monitoring end-to-end
  • Develop and maintain transformation logic powering analytics and product features
  • Support ML workflows (MLOps) with clean, structured, high-quality data
  • Work closely with product, data, and engineering teams

What They're Looking For
  • Proven experience building and maintaining production data pipelines
  • Strong SQL skills (must-have)
  • You've written complex queries, optimized performance, and worked deeply with analytical datasets
  • Comfortable modeling data and transforming large datasets efficiently
  • Strong Python experience
  • Hands-on experience with:
  • dbt
  • Orchestration tools (Dagster, Airflow, or Prefect)
  • AWS (or similar cloud environments)
  • Experience with modern data warehouses such as Snowflake or ClickHouse
  • Strong ownership mindset with a focus on data quality and reliability
  • Interest in or exposure to ML pipelines / MLOps

Why This Role
  • Systems processing 100M+ AI queries per month
  • Work with data powering enterprise-scale AI products
  • Company has: Grown rapidly to ~$30M AR, Raised $96M Series C at ~$1B valuation
  • Solve real problems across: Real-time data pipelines, AI-driven analytics, ML infrastructure

Why Join
  • High ownership - real impact on platform design
  • Early-stage but already scaled - strong product-market fit
  • Work at the intersection of AI, data, and user behaviour
  • Competitive compensation + meaningful equity

If you're excited to work on high-scale data systems powering real-world AI applications, we'd encourage you to apply.

Apply for this role
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90973602
  • Position Id: d19a42fccbea2d48d9a90a9626588894
  • Posted 7 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

New York, New York

Today

Full-time

USD 125,000.00 - 163,800.00 per year

New York, New York

2d ago

Full-time

USD 155,000.00 - 180,000.00 per year

New York, New York

4d ago

Full-time

USD 125,000.00 - 163,800.00 per year

Remote or Jersey City, New Jersey

Today

Full-time

Search all similar jobs