Employment type – Full Time
Job Title – Databricks Architect (Healthcare)
Job Location – Remote
Duration-Full Time
Experience level— 10-15+ years
Mandatory Skills – Databricks Architect
Job Description –
We are seeking a highly experienced Databricks Architect to lead the design and implementation of a scalable Lakehouse architecture supporting payer analytics, regulatory reporting, and advanced healthcare use cases. This role will play a critical part in enabling data-driven decision-making across value-based care, risk adjustment, and quality measures such as HEDIS.
The ideal candidate will have deep expertise in Databricks, Apache Spark, AWS cloud services, and healthcare data standards, along with a strong focus on data governance, security, and performance optimization.
Key Responsibilities:
- Lead the design and implementation of Databricks Lakehouse architecture for payer analytics and reporting
- Architect scalable and high-performance data pipelines using Apache Spark and Delta Lake integrated with AWS services
- Enable advanced analytics use cases including value-based care, risk adjustment, HEDIS measures, and population health
- Leverage Gen AI and Agentic AI capabilities to accelerate code development, testing, and deployment processes
- Ensure data security, governance, and compliance (HIPAA, PHI) using Unity Catalog, encryption, and private networking
- Define and enforce data architecture standards, best practices, and governance frameworks
- Collaborate with business and analytics teams to translate requirements into robust data solutions
- Optimize performance, scalability, reliability, and cloud cost efficiency for large-scale healthcare data workloads
- Provide technical leadership, mentorship, and guidance to engineering teams
- Drive CI/CD, automation, and DevOps practices within the data platform
Required Skills & Qualifications:
- 10+ years of experience in data engineering, data architecture, or related roles
- Strong hands-on experience with Databricks Lakehouse platform
- Deep expertise in Apache Spark (PySpark/Scala/SQL) and Delta Lake
- Extensive experience with AWS services (S3, Glue, Lambda, Redshift, etc.)
- Strong understanding of data governance and security frameworks, including HIPAA and PHI handling
- Experience with Unity Catalog and secure data access patterns
- Familiarity with healthcare data standards such as FHIR, HL7, EDI (X12)
- Experience designing and optimizing large-scale distributed data systems
- Strong problem-solving, communication, and stakeholder management skills
Preferred / Good to Have:
- Experience implementing AWS HealthLake, Clinical Data Platforms, or Healthcare Data Lakes
- Exposure to Terminology Servers and Master Patient Index (MPI) solutions
- Experience with real-time/streaming pipelines (Kafka, Kinesis)
- Knowledge of data quality frameworks and observability tools
- Certifications in Databricks or AWS
- Experience working in payer/provider healthcare environments
Key Competencies:
- Strategic thinking and architecture design
- Leadership and mentoring
- Stakeholder collaboration
- Innovation with emerging technologies (AI/ML, GenAI)
- Strong focus on data quality, compliance, and governance
Tekshapers is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.