AI Data Architect

Hybrid in Exton, PA, US • Posted 13 hours ago • Updated 13 hours ago
Full Time
No Travel Required
Hybrid
Depends on Experience
Fitment

Dice Job Match Score™

🧠 Analyzing your skills...

Job Details

Skills

  • ASC X12
  • Data Architecture
  • Electronic Data Interchange
  • Health Care
  • HL7
  • HIPAA
  • Microsoft Azure
  • Python
  • SQL
  • Artificial Intelligence
  • Apache Spark
  • Data Lake
  • Data Governance
  • Cloud Computing
  • Extract, Transform, Load
  • MBA
  • Vector Databases
  • Medicare

Summary

Genzeon, an AI and automation company with deep engineering and data expertise, dedicated to serving the healthcare and retail industries. Our platform solutions – including HIP One, CompliancePro Solutions, and Patient Engagement Solutions – empower organizations to scale innovation and transform outcomes.

Genzeon is a global community of innovators and problem-solvers, with a culture built on inclusion, flexibility, and purpose-driven work. With four global delivery centers, we support providers, payers, Healthtech, and retail organizations worldwide.

Genzeon has an exciting opening for AI Data Architect | Healthcare AI Platform to join our dynamic team.

AI Data Architect | Healthcare AI Platform Genzeon Corporation — Healthcare Division

Exton, PA/ Hybrid

0–4 years

The short version: We run a multi-model AI pipeline that processes 150K Medicare documents/year — faxed PDFs, EDI transactions, FHIR data, clinical notes. You’ll design and build the data architecture that ingests, stores, governs, and serves all of it to AI models and clinical reviewers. On-prem GPUs, hybrid cloud, HIPAA compliance. This is the real thing.

What you’ll do

  • Design the end-to-end data architecture for a healthcare AI platform — ingestion, storage, processing, serving, governance
  • Build pipelines for heterogeneous healthcare data: faxed PDFs, X12 EDI (835/837/278), FHIR R4, HL7v2, CMS files, unstructured clinical notes
  • Architect the data lake/lakehouse layer (Apache Iceberg, MinIO, DuckDB, PostgreSQL/pgvector)
  •  Design the embedding and vector storage layer that powers RAG — chunking, indexing, retrieval optimization
  • Build data lineage tracking from source document to AI decision
  • Implement HIPAA/HITRUST data governance — encryption, access controls, audit logging, PHI handling
  • Monitor data quality across the pipeline — schema drift, completeness, freshness, anomalies
  • Optimize for hybrid infrastructure: on-prem GPUs (RTX 50U0, L40S), NAS, Azure GovCloud, Azure Commercial

What you need:

  • A data pipeline you’ve built that ran in production (we’ll ask about it)
  •  SQL fluency and Python proficiency
  • Experience with at least one of: Spark, dbt, Airflow, Dagster, Prefect
  • 3 to 5 years of hands-on experience working within a fast-paced healthcare environment, preferably within a high-growth startup company.
  • Bachelor’s degree with 4 to 5 years of relevant healthcare industry experience, or an MBA looking to leverage specialized business insights.
  • Hands-on work with unstructured or semi-structured data — PDFs, images, OCR outputs, free text
  • Practical understanding of vector databases, embeddings, and how RAG systems consume data
  • Comfort with on-premises infrastructure, not just managed cloud services
  • Data quality and governance as instincts, not afterthoughts
  • Must be currently located in or willing to work within the Greater Nashville, Bay Area, or Philadelphia regions.
  • High-energy, high-powered professional who thrives under pressure and adapts quickly to shifting priorities.

 

Strong signals:

  • Healthcare data formats (X12 EDI, FHIR, HL7, CCD/C-CDA)
  • Apache Iceberg, Delta Lake, or modern table formats
  • MinIO / S3 / object storage architecture
  • pgvector, Pinecone, Weaviate, or similar vector stores
  • DuckDB or embedded analytical engines
  • HIPAA technical safeguards implementation
  • ML data pipelines — training data, feature stores, evaluation sets, feedback loops
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10379292
  • Position Id: 8981550
  • Posted 13 hours ago
Contact the job poster
Anilkumar Padhy

Anilkumar Padhy

Recruiter @ Genzeon
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Wayne, Pennsylvania

Today

Easy Apply

Full-time

$115000 - $125000 per annum

Wayne, Pennsylvania

Today

Easy Apply

Full-time

$120000 - $130000 per annum

Wayne, Pennsylvania

Today

Full-time

USD 122,000.00 - 152,500.00 per year

King of Prussia, Pennsylvania

Today

Full-time

Search all similar jobs