Lead Data Science Engineer

New York, NY, US • Posted 12 hours ago • Updated 12 hours ago
Full Time
On-site
USD $135,000.00 - 180,000.00 per year
Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

  • Innovation
  • Analytics
  • Advanced Analytics
  • Partnership
  • Biostatistics
  • Analytical Skill
  • Communication
  • Reporting
  • Data Engineering
  • Data Modeling
  • Extract
  • Transform
  • Load
  • Python
  • SQL
  • Authentication
  • Authorization
  • Auditing
  • Unstructured Data
  • Data Quality
  • Meta-data Management
  • Documentation
  • Collaboration
  • Agile
  • Statistics
  • Data Science
  • Computer Science
  • Data Architecture
  • Change Data Capture
  • Batch Processing
  • Job Scheduling
  • Snow Flake Schema
  • Docker
  • Kubernetes
  • Git
  • GitHub
  • Continuous Integration
  • Continuous Delivery
  • Terraform
  • Clinical Trials
  • Research
  • Cloud Computing
  • Vector Databases
  • Machine Learning (ML)
  • Microsoft Certified Professional
  • Access Control
  • Management
  • Regulatory Compliance
  • Lifecycle Management
  • Artificial Intelligence
  • Market Analysis
  • Sales
  • Insurance
  • SAP BASIS
  • SAP MM
  • Military
  • DirectShow
  • DS
  • Law

Summary

Location: New York, Hybrid

Medidata follows a hybrid office policy in which employees who are hired for an in-person position are expected to work on site a certain number of days per week in accordance with Company policy.

About our Company:

Medidata is powering smarter treatments and healthier people through digital solutions to support clinical trials. Celebrating 25 years of ground-breaking technological innovation across more than 36,000 trials and 11 million patients, Medidata offers industry-leading expertise, analytics-powered insights, and one of the largest clinical trial data sets in the industry. More than 1 million users trust Medidata's seamless, end-to-end platform to improve patient experiences, accelerate clinical breakthroughs, and bring therapies to market faster. Discover more at

Our Team:

Medidata is looking for individuals who will help us tackle some of the most complex questions facing the industry today using our proprietary platform and advanced analytics. At Medidata, we never work alone. This role will partner heavily with all of the key stakeholder functions including product, delivery, data science, engineering, partnerships, and biostatistics. Successful Medidata AI candidates will be skilled in analytical/quantitative thinking, structured communication, and excited about building the next horizon of Medidata's mission to power smarter treatments and healthier people. You will be reporting to Director, Data Engineering.

Responsibilities:
  • Apply advanced skills in data architecture, data science engineering, data modeling, and data quality using modern cloud-native technologies.
  • Develop ETL pipelines, working with vector databases, automation, and CI/CD using tools such as Python, SQL, and Git.
  • Established MCP governance standards for agent interactions, including authentication, authorization, audit logging, context management, and compliance controls.
  • Develop LLM applications using Retrieval-Augmented Generation (RAG) and support fine-tuning for domain-specific tasks.
  • Analyze and manipulate both structured and unstructured data sources, ensuring high data quality and readiness for downstream consumers.
  • Built agent-driven metadata management solutions to maintain data catalogs, business glossaries, lineage documentation, and governance policies.
  • Document and communicate technical work clearly to stakeholders at all levels, both technical and non-technical.
  • Collaborate effectively in Agile environments and cross-functional teams, building secure, scalable data pipelines into Snowflake from both on-premise and cloud-based sources.

Qualifications:
  • Bachelor's degree in a technical or scientific field, such as Statistics, Data Science, Computer Science, or similar
  • 7+ years of experience in roles such as Data Scientist or Data Engineer with a strong foundation in Enterprise Data Architecture and Engineering
  • Hands-on experience with tools and concepts such as Airflow, CDC, batch processing, and job scheduling.
  • Experienced in building scalable, cloud-native data pipelines using tools and services like Streamlit, Snowflake and containerization platforms like Docker/Kubernetes.
  • Proficient in Git/GitHub, GitHub Actions for CI/CD, and managing infrastructure as code using Terraform
  • Experience with clinical trial data is not required, but interest to learn and understand how these data improve medical research is paramount
  • Hands-on experience building high-throughput data pipelines across cloud platforms and MCP server environments. Proficient in implementing RAG architectures, vector databases, and low-latency retrieval layers.
  • Skilled in integrating AI/ML pipelines into production-grade data infrastructure while establishing MCP governance frameworks, including model access controls, context management policies, auditability, security standards, compliance monitoring, and lifecycle management to ensure secure, scalable, and responsible AI operations.

The salary range posted below refers only to positions that will be physically based in New York City. As with all roles, Medidata sets ranges based on a number of factors including function, level, candidate expertise and experience, and geographic location. Pay ranges for candidates in locations other than New York City, may differ based on the local market data in that region.

The salary range range for this position physically based in NYC/ NJ Metro Area is $135,000-$180,000.

Base pay is one part of the Total Rewards that Medidata provides to compensate and recognize employees for their work. Most sales positions are eligible for a commission on the terms of applicable plan documents, and many of Medidata's non-sales positions are eligible for annual bonuses. Medidata believes that benefits should connect you to the support you need when it matters most and provides best-in-class benefits, including medical, dental, life and disability insurance; 401(k) matching; flexible paid time off; and 10 paid holidays per year.

Equal Employment Opportunity:

In order to provide equal employment and advancement opportunities to all individuals, employment decisions at Medidata are based on merit, qualifications and abilities. Medidata is committed to a policy of non-discrimination and equal opportunity for all employees and qualified applicants without regard to race, color, religion, gender, sex (including pregnancy, childbirth or medical or common conditions related to pregnancy or childbirth), sexual orientation, gender identity, gender expression, marital status, familial status, national origin, ancestry, age, disability, veteran status, military service, application for military service, genetic information, receipt of free medical care, or any other characteristic protected under applicable law. Medidata will make reasonable accommodations for qualified individuals with known disabilities, in accordance with applicable law.

Applications will be accepted on an ongoing basis until the position is filled.

#LI-Hybrid

#LI-MM1

Inclusion statement

In order to provide equal employment and advancement opportunities to all individuals, employment decisions at 3DS are based on merit, qualifications and abilities. 3DS is committed to a policy of non-discrimination and equal opportunity for all employees and qualified applicants without regard to race, color, religion, gender, sex (including pregnancy, childbirth or medical or common conditions related to pregnancy or childbirth), sexual orientation, gender identity, gender expression, marital status, familial status, national origin, ancestry, age (40 and above), disability, veteran status, military service, application for military service, genetic information, receipt of free medical care, or any other characteristic protected under applicable law. 3DS will make reasonable accommodations for qualified individuals with known disabilities, in accordance with applicable law. Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable state laws and local ordinances. We are committed to fair employment practices and will evaluate all candidates based on their qualifications, regardless of past arrest or conviction history.

Salary Pay Transparency

Compensation for the role will be commensurate with experience. The total expected compensation range will be between $135000 and $180000, representing the base salary (or annualized salary based on estimated hourly compensation) and target bonus.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: RTX172115
  • Position Id: c1967de51f8d9ddf3debb557455ab508
  • Posted 12 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

New York, New York

Today

Full-time

USD 116,500.00 per year

Newark, New Jersey

Today

Full-time

USD 115,000.00 - 155,000.00 per year

Hoboken, New Jersey

Today

Full-time

USD 120,000.00 - 140,000.00 per year

New York, New York

13d ago

Easy Apply

Contract

Depends on Experience

Search all similar jobs