As a Senior Data Engineer on DAPI's Tetris team, you will own the design and delivery of complex data engineering solutions that power NYBCe's enterprise analytics, AI, and reporting capabilities. Reporting to the Lead Data Engineer, you will drive technical decisions, set engineering standards, and ensure the reliability and scalability of DAPI's data platform across 49+ integrated enterprise source systems.
This role demands deep technical mastery in SQL, Python, and Azure cloud data engineering, combined with a product orientation - understanding how the data assets you build translate into decisions, reports, and AI outputs for the business. You will mentor Data Engineers, contribute to architectural direction, and serve as a technical anchor for delivery across the Tetris team's sprint cycles.
Advanced Pipeline Development & Ownership
- Architect, build, and own complex data pipelines for high-volume, high criticality workstreams across NYBCe's enterprise data platform.
- Lead the design and implementation of ELT/ETL frameworks using SQL, Python, Azure Data Factory, Databricks, and Azure Synapse Analytics.
- Establish pipeline reliability standards - monitoring, alerting, error handling, and recovery protocols - and ensure adherence across the team.
Data Architecture & Platform Evolution
- Drive the design of scalable data models supporting dimensional warehousing, data lake architectures on Azure.
- Contribute to architectural decisions on data storage, partitioning, compute optimization, and consumption layer design.
- Lead migrations from legacy data solutions to modern cloud-native platforms, managing risk and business continuity throughout.
AI & Analytics Enablement
- Design and deliver feature pipelines and data preparation frameworks that support machine learning model development and deployment.
- Partner with Data Scientists to translate model requirements into production-grade data assets and feature stores.
- Collaborate with Analytics Engineers to ensure data models are optimized for analytical consumption and reporting performance.
Data Quality & Governance Leadership
- Define and implement data quality frameworks - validation rules, SLAs, anomaly detection, and automated testing for pipeline outputs.
- Lead data governance initiatives including metadata management, lineage tracking, data cataloging (Microsoft Purview), and access control.
- Ensure platform compliance with HIPAA, NYBCe data policies, and applicable regulatory requirements.
Mentorship & Technical Leadership
- Mentor Data Engineers - providing code reviews, technical guidance, and architectural feedback that elevates team capability.
- Contribute to DAPI's engineering standards, reusable frameworks, and technical documentation.
- Participate in Agile ceremonies and model strong engineering discipline - clear DevOps hygiene, sprint commitment, and delivery accountability.