Data Integration & Modeling Specialist

Overview

Remote
Depends on Experience
Part Time
No Travel Required
Unable to Provide Sponsorship

Skills

Amazon RDS
Amazon Redshift
Amazon S3
Advanced Analytics
Amazon Web Services
Analytical Skill
Artificial Intelligence
Biomedicine
Clinical Data Management
Clinical Research
Cloud Computing
Communication
Computer Science
DICOM
Data Architecture
Data Governance
Data Integration
Data Management
Data Processing
Data Science
Extract, Transform, Load
Google Cloud Platform
Machine Learning (ML)
HIPAA
HL7
Management
Mapping
Modeling
Problem Solving
Python
Soft Skills
Remote Desktop Services
Workflow
SQL
Proteomics
Documentation

Job Details

REMOTE OPPORTUNITY

Role Overview

  • We are seeking an experienced professional to lead data linkage and modeling across diverse biomedical data sources, including imaging, omics, and clinical datasets.
  • This role requires strong expertise in integrating heterogeneous data, understanding clinical study datasets, and leveraging cloud resources for scalable solutions.
  • The ideal candidate will enable advanced analytics and AI/ML applications in precision medicine and clinical research.

Key Responsibilities

  • Design and implement data linkage strategies between imaging, omics (genomics, proteomics), and clinical datasets.
  • Develop robust data models to support multi-modal analytics and machine learning workflows.
  • Ensure data harmonization and interoperability using industry standards (e.g., HL7, FHIR, DICOM).
  • Interpret and structure clinical study datasets, including EHR, trial data, and regulatory-compliant formats.
  • Collaborate with clinical teams to ensure accurate mapping of patient-level data across modalities.
  • Utilize AWS resources (S3, RDS, Glue, Redshift, Lambda) for secure and scalable data processing.
  • Design and optimize SQL queries for large-scale biomedical datasets.
  • Implement ETL pipelines for data ingestion and transformation.
  • Ensure adherence to Good Clinical Practices (Google Cloud Platform), HIPAA, and data privacy regulations.
  • Maintain documentation for data integration workflows and validation processes.
  • Work closely with bioinformaticians, data scientists, and imaging specialists to enable integrated analytics.
  • Provide technical guidance on data architecture and modeling best practices.

Required Qualifications

  • Master’s in Data Science, Computer Science, or related field.
  • Proven experience in multi-modal data integration (imaging + omics + clinical).
  • Strong knowledge of clinical study datasets and regulatory standards.
  • Hands-on experience with AWS services and SQL for large-scale data management.
  • Proficiency in Python/R for data processing and modeling.

Preferred Skills:

  • Familiarity with AI/ML workflows and biomedical data pipelines.
  • Experience with data standards (DICOM, HL7, FHIR, OMOP CDM).
  • Knowledge of ETL tools and data governance frameworks.

Soft Skills:

  • Excellent problem-solving and analytical skills.
  • Strong communication and documentation abilities.
  • Ability to work in cross-functional teams and manage complex projects.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.