Data Architect Role: For building data lake handling Radiology, Digital Pathology, Clinical data, Omics data
Remote
Duration: Long term
Data architect with experience in building a data lake bringing different Clinical data sets from CRO’s to Pharma such as Radiology images (DICOM), Digital Pathology images (Non-DICOM), Clinical trials data/CRFs, Omics data. This role requires defining a strategy to store, version, and use of these heterogenous datasets to be used for longitudinal views of trial subjects/patients
Key Responsibilities
Design and architect data strategy to bring different data sets such as Radiology, Digital Pathology, Trials data/CRF’s and Omics data.
Develop robust data models to support multi-modal analytics and machine learning workflows.
Strong understanding of clinical study datasets, including EHR, trial data, and regulatory-compliant formats.
Utilize AWS resources (S3, RDS, Glue, Redshift, Lambda) for secure and scalable data processing.
Design ETL pipelines for data ingestion and transformation.
Collaborate with data scientists, and Technical architect (Imaging) to enable use of ML models and integrated analytics.
Strong communication and leadership skills to drive data architecture and strategy with different stakeholders.
Preferred Skills
Understanding of AI/ML use in Clinical data side (Imaging, Trial data, Omics etc.)
Prior experience in working with Pharma building/maintaining such Clinical data lakes