Apply Now

AI Engineer - Clinical Data Science

Remote • Posted 30+ days ago • Updated 1 hour ago

Full Time

On-site

Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

Team Building
Data Processing
Decision-making
Writing
Prototyping
Generative Artificial Intelligence (AI)
Reporting
Research
MSC
Computer Science
Bioinformatics
LangChain
Cloud Computing
Software Engineering
Design Patterns
Version Control
Code Review
Command-line Interface
Orchestration
Workflow
Evaluation
Data Science
Statistics
Data Visualization
Prompt Engineering
Application Development
Clinical Trials
Pharmaceutical Industry
Privacy
Life Sciences
DevOps
Continuous Integration
Continuous Delivery
Neo4j
Network
Analytics
Extraction
Natural Language Processing
PyTorch
TensorFlow
React.js
RESTful
PostgreSQL
SQL
TypeScript
R
Machine Learning (ML)
LlamaIndex
Microsoft Certified Professional
Document Processing
PDF
Optical Character Recognition
Pandas
NumPy
scikit-learn
Plotly
Database
Vector Databases
Graph Databases
Relational Databases
Docker
Google Cloud
Google Cloud Platform
Vertex
GCS
Terraform
GitHub
API
Flask
Python
Testing
Git
Switches
Pharmaceutics
Artificial Intelligence
Management

Summary

Job Description:

We are looking for an AI Engineer to join our Data Science team, building AI-powered solutions for clinical data processing and analysis within a major pharmaceutical organization. You will design, develop and deploy generative AI systems that automate clinical reporting workflows, extract intelligence from documents, and accelerate data-driven decision making.
This is a hands-on engineering role - you'll be writing production code, not just building prototypes.

Responsibilities:

Generative AI & Automation.
Develop LLM-powered automation tools for clinical reporting and document generation workflows.
Build AI-driven code generation pipelines and quality assessment frameworks.
Design and implement human-in-the-loop review workflows with feedback loops to continuously improve output quality.

Research & Evaluation:

Research and evaluate emerging AI methods, frameworks, and techniques for specific tasks - e.g. comparing fine-tuning vs zero-shot approaches, assessing new document extraction tools, or trialling new agentic frameworks.
Prototype and benchmark new approaches before recommending adoption.
Stay current with a rapidly evolving field and bring new ideas to the team.

Agentic AI & Orchestration:

Design and build multi-agent systems for data workflows - agents that retrieve, generate, validate, and iterate autonomously.
Implement agent orchestration using frameworks such as Google ADK, Lang Graph, or Lang Chain.
Deploy and manage agents on Google Vertex AI.

Document Understanding & RAG:

Build document processing pipelines (PDFs, Word/DOCX) - extraction, parsing, table detection, structure recognition.
Design and build RAG pipelines grounded in source documents.
Process, extract and transform data from unstructured and semi-structured sources.
Code Quality & Engineering Practices:
Write clean, well-tested, maintainable Python code following SOLID principles and recognised design patterns.
pply single responsibility, dependency inversion, and interface segregation in real codebases - not just theory.
Write meaningful tests and maintain high standards across the team.
Refactor and improve existing code as part of normal development workflow.

AI-Assisted Development:

Use AI coding tools (e.g. Gemini CLI, GitHub Copilot) as a core part of your development workflow.
Critically review and validate AI-generated code - understanding what it produces, why, and when it's wrong.
Write effective prompts to direct AI tools toward correct, secure, well-structured output.
Know when to use AI and when to write code manually - judgement over speed.

Platform & Infrastructure:

Integrate and orchestrate LLM providers available through Google Vertex AI (Gemini, etc.).
Build internal tools and applications using Stream lit and Fast API.
Containerize and deploy services using Docker.

Required Skills & Experience:

MSc in Data Science, Computer Science, Bioinformatics, or related field (or equivalent practical experience), Strong Python skills.
Hands-on experience building RAG systems or LLM-powered applications (using LangChain, LlamaIndex, or similar frameworks).
Experience integrating LLM APIs (Google Gemini, OpenAI, or similar) - we work primarily through Google Vertex AI.
Working knowledge of vector databases (ChromaDB, Weaviate, Qdrant, Pinecone, or similar).
Cloud platform experience (Google Cloud Platform preferred, especially Vertex AI).
Docker and containerized deployments.
Strong software engineering fundamentals - SOLID principles, clean code practices, design patterns, testing, version control (Git), code review.
Comfortable using AI-assisted development tools (e.g. Gemini CLI, GitHub Copilot) - and critically evaluating what they produce.
Strongly Preferred.
Experience with agentic AI patterns - multi-agent orchestration, tool use, autonomous workflows (LangGraph, Google ADK, or similar).
Document processing experience - extracting and parsing data from PDFs and Word/DOCX files programmatically.
Understanding of LLM evaluation principles and output quality assessment (BLEU, ROUGE etc, code execution metrics, or similar).
Data science fundamentals - Pandas, NumPy, scikit-learn, statistical analysis, data visualization.
Prompt engineering and optimisation techniques.
Streamlit application development.

Domain Knowledge:

Clinical trials or pharmaceutical industry experience.
Familiarity with clinical data standards.
wareness of regulatory and data privacy requirements in life sciences.

Infrastructure & DevOps :

Terraforma or infrastructure-as-code experience.
CI/CD pipeline design (GitHub Actions or similar).

Knowledge Graphs:

Neo4j, Cypher query language.
Network for graph analytics.
Graph-based RAG or knowledge extraction.

AI/ML:

Experience with LLM-driven code generation.
LLM fine-tuning experience (e.g. LoRA, PEFT, RLHF, Vertex AI model tuning, or similar approaches).
NLP and text processing (HuggingFace Transformers, Sentence-Transformers).
PyTorch or TensorFlow (for custom model work if needed).
Google ADK (Agent Development Kit) or Vertex AI Agent Builder.
Model Context Protocol (MCP) for tool integration and interoperability.

Other:

Frontend experience (React, TypeScript).
FastAPI or Flask REST API development.
PostgreSQL or similar relational databases.

What You'll Work With:

Languages: Python (primary), SQL, some TypeScript/R.
AI/ML : Lang Chain, LlamaIndex, Lang Graph, Google ADK, MCP, Hugging Face Transformers, Sentence-Transformers, Google Gemini (via Vertex AI).
Document Processing: PyMuPDF, python-docx, pdf plumber, OCR tools.
Data: Pandas, NumPy, SciPy, scikit-learn, Plotly.
Databases: Vector databases, graph databases, relational databases.
Infrastructure: Docker, Google Cloud Platform (Vertex AI, GCS), Terraform, GitHub Actions.
Applications: stream lit, Fast API, Flask.
Tools: Python packaging, testing frameworks, linting, Git.

About You:

You care about code quality - not just making things work, but making them maintainable.
You're comfortable working across the full stack of an AI application, from data ingestion to user-facing tools.
You can context-switch between multiple projects and work autonomously.
You're curious about the clinical/pharmaceutical domain and motivated to learn it.
You see AI-assisted development as a force multiplier, not a replacement for engineering judgment.
You're a self-directed learner who researches new methods and tools, evaluates them critically, and knows when to adopt vs when to stick with what works.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91116760
Position Id: 85b55f524a9087539798e50d1dd5fbb
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

AI Engineer - AI/ML

Remote or Minnetonka, Minnesota

•

Today

Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best.Here, you will find a culture guided by inclusion, talented peers, comprehensive benefits and career development opportunities. Come make an impact on the communities we serve as you help us advance health opt

Full-time

USD 98,500.00 - 176,000.00 per year

Senior AI Engineer, Agentic and RAG Systems

Remote

•

Today

We are seeking a hands-on Senior AI Engineer who designs, builds, and operates production GenAI systems - agentic workflows, RAG pipelines, and LLM-backed services with real users and real SLAs. This is an engineering role, not a research role. The bar is reliability, latency, cost, observability, and safe deployment at scale, with end-to-end ownership from architecture through on-call. Typical workloads include enterprise knowledge platforms, conversational analytics, agentic automation, and LL

Full-time

Senior AI Engineer

Remote

•

Today

Senior AI Engineer (Contract-to-Hire) | US RemoteAbout the RoleWe are seeking an experienced Senior AI Engineer to join a leading healthcare retail enterprise driving large-scale AI transformation initiatives across finance, healthcare, and enterprise applications. This role offers the opportunity to design, build, and deploy production-grade Generative AI and Machine Learning solutions that deliver measurable business value. The ideal candidate combines deep expertise in AI/ML technologies with

Easy Apply

Contract

Depends on Experience

AI / Gen AI Engineer

Remote

•

8d ago

Sr AI / Gen AI Engineer Location :: Remote with Travel Duration ::Long Term Job Description: We are looking for a hands-on AI Engineer to build agentic AI solutions on Google Cloud, with strong focus on OCR, document processing, structured data extraction, and vector-based retrieval systems (RAG). Key Responsibilities Build AI solutions using Vertex AI and GeminiDesign OCR and document extraction pipelines using Google Document AIImplement RAG architectures with Vector DatabasesBuild and manage

Easy Apply

Contract

Depends on Experience

Search all similar jobs

Remote jobs at Katalyst Healthcares and Lifesciences