Role Overview
As a Data Architect, you will lead the design and implementation of a modern Lakehouse + Data Cloud ecosystem. You will be responsible for defining how data flows from raw telemetry in Databricks into highly optimized, governed presentation layers in Snowflake. Your goal is to balance the high-performance engineering capabilities of Spark/Delta Lake with the seamless, SQL-first scalability of Snowflake.
Key Responsibilities
- Unified Architecture Design: Develop end-to-end blue-prints integrating Databricks (for ingestion, streaming, and ML) with Snowflake (for BI, reporting, and secure data sharing).
- Data Modeling: Design sophisticated schemas using Data Vault 2.0, Dimensional Modeling, or Data Mesh principles to ensure cross-platform consistency.
- Integration Leadership: Implement efficient data movement patterns using Snowpipe Streaming, Apache Iceberg (for interoperability), or Unity Catalog-to-Snowflake integrations.
- Performance Optimization: Tune Databricks clusters (Photon engine, Liquid Clustering) and Snowflake warehouses (Auto-scaling, Query Acceleration) to minimize latency and cost.
- Governance & Security: Establish a unified security posture across both platforms using RBAC, Row-Level Security, and data masking, ensuring compliance with GDPR/CCPA.
- AI/ML Readiness: Architect data foundations that support Databricks Mosaic AI and Snowflake Cortex, enabling LLM and predictive analytics use cases.
Technical Requirements:
Advanced expertise in Snowpark (Python/Java), Dynamic Tables, Horizon Catalog, and Zero-copy cloning.
Deep knowledge of Delta Lake, Spark Structured Streaming, Delta Live Tables (DLT), and Unity Catalog.
Mastery of SQL and Python; familiarity with Scala or Java is a plus.
Hands-on experience with Apache Iceberg and external tables to bridge the two ecosystems.
Experience with Airflow, dbt (Cloud/Core), or Dagster for multi-platform pipelines.
Proficiency in at least one major provider (AWS, Azure, or Google Cloud Platform).
Preferred Certifications
Snowflake: SnowPro Advanced: Architect (ARA-C01).
Databricks: Databricks Certified Solutions Architect or Data Engineer Professional.
Cloud: AWS Certified Data Engineer or Azure Solutions Architect.