Overview
Skills
Job Details
The ETL Developer is responsible for designing, developing, and maintaining robust data pipelines that ingest large-scale clinical, claims, and administrative data from various healthcare sources into a centralized data warehouse or lake.
Key Responsibilities
Pipeline Development: Build and optimize complex ETL/ELT workflows to handle structured and unstructured healthcare data (EHR extracts, Labs, Claims, Pharmacy).
Standards Implementation: Map and transform legacy data into industry-standard formats using Common Data Model.
Data Security: Ensure all data movement processes comply with HIPAA, HITRUST, and SOC2 regulations, implementing encryption and de-identification where necessary.
Performance Tuning: Monitor and optimize SQL queries and batch jobs to ensure high-speed ingestion and minimal system downtime.
Technical Requirements
Proficiency: SQL (Advanced), Python, Pandas, aws s3, aws ec2 and ETL tools (Airflow).
Experience: 3+ years in data engineering, specifically working with healthcare datasets.
Knowledge: Deep understanding of healthcare interoperability (API integration, SFTP).