Applied AI Engineer, Clinical Informatics

Boston, MA, US • Posted 4 hours ago • Updated 4 hours ago
Full Time
On-site
USD $181,500.00 - 283,800.00 per year
Fitment

Dice Job Match Score™

🧠 Analyzing your skills...

Job Details

Skills

  • FOCUS
  • AIM
  • ROOT
  • Evaluation
  • Clinical Research
  • Natural Language
  • Supervised Learning
  • Clustering
  • Workflow
  • Analytical Skill
  • Data Management
  • Auditing
  • HIPAA
  • Health Informatics
  • Biology
  • Bioinformatics
  • Genetics
  • Epidemiology
  • Data Science
  • Pharmaceutics
  • Clinical Data Analysis
  • Python
  • R
  • SQL
  • Cloud Computing
  • Amazon Web Services
  • Google Cloud Platform
  • Google Cloud
  • Microsoft Azure
  • HPC
  • Generative Artificial Intelligence (AI)
  • Training
  • Scratch
  • High Performance Computing
  • CDISC
  • SDTM
  • Database
  • Survival Analysis
  • Natural Language Processing
  • Deep Learning
  • Clinical Data Management
  • HL7
  • Management
  • Privacy
  • Research
  • Publications
  • Artificial Intelligence
  • Informatics
  • Genomics
  • Modeling
  • Pure Data
  • Clinical Trials
  • Machine Learning (ML)
  • Ontologies
  • Reasoning
  • Biomedicine
  • Data Analysis
  • SAP BASIS
  • Chinese
  • Japanese
  • Leadership
  • Network
  • Health Care
  • Life Insurance
  • Promotions

Summary

At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We're looking for people who are determined to make life better for people around the world.

Therapy areas across Eli Lilly focus on new therapeutic approaches for the treatment of different diseases. You will work with partners across Lilly to discover and develop novel biologic, small molecule and nucleic acid-based therapeutics. Our focus is the patient: by understanding the biology and pathophysiology underlying disease states, we aim to address the root cause of disease and develop breakthrough therapies. We have one of the strongest pipelines in the industry and a track record of delivering impactful medicines that improve people's lives. The Lilly research environment is evolving to centralize the access and analysis of human genetic, omic, and clinical data. This new initiative will work to define data, tools and process to provide the therapy area teams key evidence for target evaluation and target discovery.

We are seeking a highly specialized Applied AI Engineer Clinical Informatician to lead research at the intersection of completed clinical trial datasets and biobank-linked population data. This is fundamentally a hands-on research role (not operational trial management), where you will be an individual contributor. Your core mission is to build the systems and tools that extract, define, and contextualize patient phenotypes from locked trial databases, real-world data, and biobank cohorts, that will turn archived data that can generate translational insight that shapes the next generation of clinical research.

You will work with rich, already-collected datasets: locked trial databases, archived omics profiles, longitudinal electronic health records, and population-scale biobank cohorts. Your mandate is to build the AI and ML systems that make these datasets manageable and ready for detailed analysis. This role suits someone who thinks like a scientist, builds like an engineer, and communicates like a clinician.

Apply today!

Key Responsibilities

AI & Machine Learning for Translational Discovery
  • Develop and deploy agentic AI applications that enable natural language interaction with clinical data
  • Ground AI outputs in validated biological knowledge, for example implementing RAG pipelines anchored in biomedical ontologies (HPO, Gene Ontology, MeSH, DrugBank), clinical trial registries, and curated pathway databases
  • Deploy unsupervised and self-supervised learning approaches like clustering, representation learning, contrastive learning to discover latent patient archetypes and molecular disease subtypes across trial and biobank data
  • Deploy survival models and dynamic treatment regime estimators using combined clinical and omics features
  • AI tooling to harmonize heterogeneous trial and biobank datasets to common data representations
  • Evaluate and monitor model performance, safety, and reliability in production environments
  • Manage vendors and contractors as well as partner relationships with relevant teams across Lilly

Post-Trial Data Research & Analysis
  • Building pipelines for locked clinical trial databases (SDTM, ADaM) to conduct secondary and exploratory research beyond primary endpoints
  • Deploy ML workflows to identify trial subgroup effects, treatment heterogeneity, and responder/non-responder signatures from completed trial data
  • Mine adverse event narratives, clinical notes, and investigator comments using NLP to surface latent safety signals not captured in structured endpoints in biobanks and clinical datasets
  • Reconstruct patient-level longitudinal trajectories from trial visit data to model disease progression, drug response kinetics, and time-to-event outcomes
  • Architect workflows for meta-analytic and cross-trial integrative analyses across multiple completed studies to identify generalizable biological and clinical patterns
  • Build connections to large-scale biobank cohorts (UK Biobank, All of Us, etc.) as external validation and enrichment resources for trial-derived findings for clinical phenotypes

Research Rigor, Reproducibility & Governance
  • Establish research data management practices ensuring full reproducibility of analyses including data versioning, containerized compute environments, and audit-ready analysis logs
  • Ensure all research activities follow HIPAA, GDPR, and relevant IRB and ethics committee requirements

Basic Qualifications
  • M.S. in Biomedical Informatics, Computational Biology, Bioinformatics, Statistical Genetics , Epidemiology, or a closely related quantitative field or an MD/PhD with equivalent depth in translational data science with 6+ years of research experience working with clinical trial datasets (SDTM/ADaM), biobank data, or large-scale population health data in an academic, pharmaceutical, or research institute setting
  • Or Ph.D. in Biomedical Informatics, Computational Biology, Bioinformatics, Statistical Genetics, Epidemiology, or a closely related quantitative field or an MD/PhD with equivalent depth in translational data science with 3+ years of research experience working with clinical trial datasets (SDTM/ADaM), biobank data, or large-scale population health data in an academic, pharmaceutical, or research institute setting

Additional Skills & Preferences
  • Demonstrated use of AI tools in production environments for clinical data analysis
  • Expert proficiency in Python and/or R for statistical modelling and ML; strong command of SQL and experience with cloud-based research computing environments (ideally DNAnexus, AWS, Google Cloud Platform, Azure, or HPC clusters)
  • Familiar with advanced generative AI methods like finetuning of LLMs. Building and training foundation models from scratch. High performance computing environments
  • Deep knowledge of CDISC standards (SDTM, ADaM) and experience analyzing clinical trial databases for secondary research purposes
  • Demonstrated experience applying ML methods including survival analysis, causal inference, NLP, and deep learning to clinical or genomic research questions
  • Thorough understanding of OMOP CDM, HL7 FHIR Genomics, and major biomedical ontologies
  • Direct research experience with major public and restricted-access biobank resources (UK Biobank, All of Us, etc.)
  • Experience with federated learning, differential privacy, or secure computation frameworks applied to multi-site biomedical research
  • Track record of peer-reviewed publications in clinical AI, translational informatics, genomics, or a related field
  • Familiarity with the target trial framework and its application in biobanks
  • Knowledge of pharmacogenomics, drug response modeling, or PK/PD data analysis from clinical trials
  • Experience with knowledge graph construction, graph ML, or ontology-driven reasoning for biomedical discovery
  • Hands-on experience with multi-omic data analysis

Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form ( for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.

Lilly is proud to be an EEO Employer and does not discriminate on the basis of age, race, color, religion, gender identity, sex, gender expression, sexual orientation, genetic information, ancestry, national origin, protected veteran status, disability, or any other legally protected status.

Our employee resource groups (ERGs) offer strong support networks for their members and are open to all employees. Our current groups include: Africa, Middle East, Central Asia Network, Black Employees at Lilly, Chinese Culture Network, Japanese International Leadership Network (JILN), Lilly India Network, Organization of Latinx at Lilly (OLA), PRIDE (LGBTQ+ Allies), Veterans Leadership Network (VLN), Women's Initiative for Leading at Lilly (WILL), enAble (for people with disabilities). Learn more about all of our groups.

Actual compensation will depend on a candidate's education, experience, skills, and geographic location. The anticipated wage for this position is
$181,500 - $283,800

Full-time equivalent employees also will be eligible for a company bonus (depending, in part, on company and individual performance). In addition, Lilly offers a comprehensive benefit program to eligible employees, including eligibility to participate in a company-sponsored 401(k); pension; vacation benefits; eligibility for medical, dental, vision and prescription drug benefits; flexible benefits (e.g., healthcare and/or dependent day care flexible spending accounts); life insurance and death benefits; certain time off and leave of absence benefits; and well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities).Lilly reserves the right to amend, modify, or terminate its compensation and benefit programs in its sole discretion and Lilly's compensation practices and guidelines will apply regarding the details of any promotion or transfer of Lilly employees.

#WeAreLilly
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: eliin001
  • Position Id: 44c7728a646c0859ff38ee7c7cf34f2b
  • Posted 4 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Lexington, Massachusetts

Today

Full-time

USD 100,600.00 - 155,400.00 per year

Waltham, Massachusetts

Today

Full-time

USD 176,000.00 - 264,050.00 per year

Boston, Massachusetts

Today

Full-time

Compensation information provided in the description

Boston, Massachusetts

Today

Full-time

USD 150,000.00 - 215,000.00 per year

Search all similar jobs