HMG America LLC is the best Business Solutions focused Information Technology Company with IT consulting and services, software and web development, staff augmentation and other professional services. One of our direct clients is looking for Data Lake / Lakehouse Data Modeler Architect in Newtown Square, PA. Below is the detailed job description.
Job Title: Data Lake / Lakehouse Data Modeler Architect- Azure Databricks (US Healthcare Payer Domain)
Location: Newtown Square, PA
Work mode: Hybrid (3 days onsite must)- Resources who are in EST are Preferred ( who can visit office for a week in a month)
Role Summary
Seeking a Data Lake/Lakehouse Data Modeler with deep hands-on experience building governed, secure, and high-performance data models on Azure for Healthcare use cases. The role will design logical and physical schemas across landing, curated, and serving layers to support Payer domain Analytics and governance reporting.
Key Responsibilities:
- Design logical and physical models across raw, curated, and consumption layers optimized for lake house patterns.
- Define canonical models and source-to-target mappings for member, provider, claims, prior auth etc
- Define retention, archival, encryption (CMK), and access controls using Azure Key Vault and Azure AD.
- Work with data engineers and platform teams to implement models in Databricks and Synapse with CI/CD and tests.
- Perform data profiling, validation, and iterative tuning to meet performance and SLAs for BI and ML consumers. Required Qualifications:
- Proven experience modeling Lakehouse or data lake solutions for US Healthcare Payers.
- Deep knowledge of US Payer domain with modeling members, provider and all types of claims.
- Hands-on experience with ADLS Gen2, Azure Databricks (Delta Lake).
- Practical experience with CDC, streaming, low-latency analytics, and schema evolution strategies.
- Familiarity with Unity Catalog, Azure AD, Key Vault, network isolation, and regulatory compliance controls.
Technical Skills:
- Delta Lake, Parquet/ORC, schema evolution, partitioning and Z-order/clustering strategies.
- Spark SQL, Databricks notebooks, or similar modeling frameworks, and strong SQL proficiency.
- Ingestion and orchestration with ADF, Tidel and Databricks Workflows.
- BI and analytics integration using Power BI, Synapse SQL, and Databricks SQL for served models.
- Strong stakeholder engagement across compliance, risk, finance, engineering, and data science teams.
- Clear documentation, model governance, and ability to present designs for audits and architecture reviews.
Experience & Education:
- 10 + years of experience in data modeling or data engineering with US Healthcare and Payer domain is a must have experience.
- 5 years of experience with different modeling tools like Erwin/ Embarcadero or ER Studio
- 5 years of experience in modeling various data domains in Payer domain.
- Azure or Databricks certifications is value added.
- Deliverables: Model artifacts and data dictionaries, source-to-target mappings, lineage for audits, and measurable performance and quality improvements.