Overview
Skills
Job Details
Key Responsibilities
Build and operate scalable batch and streaming data pipelines.
Model, document, and publish curated datasets and ML-ready features.
Enforce data quality with automated tests, SLAs, lineage, and monitoring.
Tune storage and queries to optimize performance and cost.
Protect data with robust security, privacy, and compliance controls.
Collaborate with product, analytics, and ML teams and support production systems.
Required Skills and Experience
4 6+ years of production data engineering experience or equivalent.
Advanced SQL and proficiency in at least one general-purpose programming language, with hands-on experience using distributed data processing frameworks.
Proven delivery on a major cloud with a modern warehouse or lakehouse.
Working knowledge of streaming patterns and platforms.
Strong data modeling and fluency with modern file and table formats.
Proficiency in orchestration, CI/CD, and infrastructure as code.
Demonstrated ownership of data quality, observability, and lineage.
BS in CS/Engineering or equivalent and strong communication skills.
Preferred Qualification
Experience building and maintaining feature pipelines and stores for ML.
Familiarity with data catalog, lineage, and data quality toolchains.
Track record enabling self-serve analytics with semantic layers or data build tool.
Experience with containers and Kubernetes in production.
Relevant cloud certifications and SaaS or product analytics domain exposure.
Evidence of mentorship, code reviews, and cross-team technical leadership.