Senior Data Scientist

Overview

On Site
Full Time

Skills

Reporting
Social Sciences
Pivotal
Art
Algorithms
Large Language Models (LLMs)
FOCUS
Science
Product Development
Bridging
Computer Science
Mathematics
Informatics
Data Engineering
Data Architecture
Database Administration
Data Visualization
Data Extraction
Numerical Analysis
Statistics
Deep Learning
Generalized Linear Model
SEM
Scanning Electron Microscope
Linux
Cloud Computing
High Performance Computing
Amazon Web Services
Data Processing
Analytics
Amazon EC2
Amazon Redshift
Machine Learning (ML)
Amazon SageMaker
Computer Cluster Management
Scheduling
Data Warehouse
Extract
Transform
Load
ELT
SQL
NoSQL
Database
Data Modeling
Big Data
Apache Hadoop
Apache Spark
Continuous Integration
Continuous Delivery
API
Infrastructure Architecture
Scripting
Stata
C++
R
Python
Pandas
NumPy
TensorFlow
PyTorch
Object-Oriented Programming
Amazon S3
Version Control
Git
Code Review
Unit Testing
Documentation
Cross-functional Team
Agile
Communication
Attention To Detail
Conflict Resolution
Problem Solving
Research
Screening
Data Science
Employment Authorization
Accessibility
Business Intelligence
Payroll
Jersey
Life Insurance
Management
Professional Development
LinkedIn
Taxes
Legal
Finance

Job Details

Position Description

This is a fully benefited, full-time Harvard University position that has been funded through 7/31/2027. There is the possibility of renewal, contingent on funding, university priorities and satisfactory job performance.

The position will be instrumental in supporting the computational needs of Harvard faculty-led projects supported by the Harvard Data Science Initiative (HDSI). The position resides within the HDSI but will have a close collaborative relationship with - and dotted reporting line to - the Institute for Quantitative Social Sciences (IQSS). The successful applicant must be able to understand complex research problems at a fundamental level. Broadly, this individual will contribute to data science-driven projects by constructing outputs intended for broad open access by different stakeholders. Furthermore, this role is pivotal in supporting research activities at HDSI by developing advanced data models, managing architectures, utilizing cloud services for scalable data processing, employing state-of-the-art statistical techniques, managing and analyzing large datasets, and applying machine learning algorithms and large language models to derive meaningful insights. Interactions with industry partners such as Amazon Web Services (AWS) will provide valuable exposure to advanced high- performance cloud computing capabilities.

The HDSI supports research in data science methodology and applications through multiple programs, including industry-sponsored research collaborations that align with research interests across Harvard schools, with a focus on understanding, mitigating, and finding solutions to global health, environmental, food, and social crises.

IQSS is a university-wide initiative working with staff, affiliated faculty, students, fellows, and visiting scholars. Partnering with faculty and researchers to incubate data-intensive social science research initiatives, IQSS offers dedicated support to these scientific programs through administrative services, data science services, research administration, technical infrastructure and product development. IQSS builds bridges from academia to the rest of the world by collaborating with industry and governments in incentive-compatible ways to produce social good for academia, companies, and the general public.

This position reports to the Executive Director of the HDSI and operates within the HDSI and within a team of senior data scientists at IQSS. The HDSI Scientific Director provides strategic support and guidance. The ideal candidate will demonstrate a passion for pushing the boundaries of data science technologies to solve urgent global challenges, enthusiasm for contributing to a collaborative and innovative environment, and an affinity for joining teams of academic researchers to advance knowledge for society's benefit.

Basic Qualifications

  • Minimum of seven years' post-secondary education or relevant work experience


Additional Qualifications and Skills

  • Bachelor's or Master's Degree in Statistics, Data Science, Computer Science, Mathematics, Informatics, or other health data related field.
  • Prior work within a research environment is essential, including familiarity with the research pipeline and the process of conducting research employing accepted scientific experimental practices.
  • Knowledgeable of;
    • data engineering, data architecture, database management, and data visualization techniques, with high proficiency in data extraction and wrangling.
    • numerical methods, statistical analysis, machine learning, and deep learning. More specifically, experience fitting and interpreting a range of models, including at least some of: GLM, GLMM, SEM, econometric models, machine learning models
  • The ability to create and maintain databases using libraries from Python, R in Linux environment.
  • Expertise in leveraging cloud platforms and high-performance computing environments, especially Amazon Web Services (AWS), for scalable data processing and analytics (EC2, S3, Redshift, Lambda), and machine learning tools (SageMaker, Glue, Athena) and cluster management and scheduling systems (e.g., slurm)
  • Experience with;
    • data warehousing and ETL/ELT processes. Skilled in SQL, NoSQL databases, and data modeling techniques. Experience with big data technologies and ecosystems (e.g. Hadoop, Spark).
    • CI/CD pipelines for data science projects and their reliable deployment. Assisting with release of models/products to the proper platform (e.g., a website, an interactive API, etc.), including infrastructure design.


  • Background in scientific programming/scripting (Python, R, Stata, and C++);
    • 3+ years of experience using either Python or R in a data science and/or research context required.
    • 5+ years of this experience preferred. More specifically, advanced skills in Python libraries for data science (Pandas, NumPy, sci-kit learn, TensorFlow/PyTorch) or experience using object-oriented programming systems in R (e.g., S3, S4, RC, R6).
  • Adherence to best practices in scientific programming, including version control (Git), code review, unit testing, and documentation to ensure reproducibility and maintainability of data science projects.
  • Proven track record of success in working in a cross-functional team in an agile environment.
  • Excellent communication skills; able to simplify complex technical concepts to stakeholders.
  • Detail-oriented expertise, with strong problem-solving skills to support research.
  • Strong team player with a service mindset, able to guide researchers and is customer focused.
  • Awareness of and aptitude to appropriately and effectively understand, respect, and adapt to cultural and identity-based differences within group environments, and experience fostering and reinforcing an environment that values unique experiences, cultures, backgrounds, and goals.


Working Conditions

  • Work is performed in an office setting
  • Occasionally required to work outside of normal business hours and may be contacted during off hours


Additional Information

Candidates who move forward in the process may be asked to complete a coding exercise as part of the interview.

Please note:
  • Harvard University requires pre-employment reference and background screening
  • The Harvard Data Science Initiative is unable to provide work authorization and/or visa sponsorship.
  • This position has a 90-day orientation and review period.
  • This is a fully benefited, full-time Harvard University position that has been funded through 7/31/2027. There is the possibility of renewal, contingent on funding, university priorities and satisfactory job performance.


Accessibility:

Harvard University welcomes individuals with disabilities to apply for positions and participate in its programs and activities. If you would like to request an accommodation or have questions about the physical access provided, please contact our University Disability Resources Department.

Work Format Details

Harvard University actively supports hybrid work where business needs allow. This position has been designated as primarily. While this position is primarily remote, travel to campus may be necessary based on business needs and the nature of work. Examples include bi-annual or quarterly Town Halls, critical business meetings or other work events. Additional details will be discussed during the interview process. All remote work must be performed within one of the Harvard Registered Payroll States, which currently includes Massachusetts, Connecticut, Maine, New Hampshire, Rhode Island, Vermont, Georgia, Illinois, Maryland, New Jersey, New York, Virginia, Washington, and California (CA for exempt positions only). Certain visa types and funding sources may limit work location. Individuals must meet work location sponsorship requirements prior to employment.

Benefits

We invite you to visit Harvard's Total Rewards website ( ) to learn more about our outstanding benefits package, which may include:

  • Paid Time Off: 3-4 weeks of accrued vacation time per year (3 weeks for support staff and 4 weeks for administrative/professional staff), 12 accrued sick days per year, 12.5 holidays plus a Winter Recess in December/January, 3 personal days per year (prorated based on date of hire), and up to 12 weeks of paid leave for new parents who are primary care givers.
  • Health and Welfare: Comprehensive medical, dental, and vision benefits, disability and life insurance programs, along with voluntary benefits. Most coverage begins as of your start date.
  • Work/Life and Wellness: Child and elder/adult care resources including on campus childcare centers, Employee Assistance Program, and wellness programs related to stress management, nutrition, meditation, and more.
  • Retirement: University-funded retirement plan with contributions from 5% to 15% of eligible compensation, based on age and earnings with full vesting after 3 years of service.
  • Tuition Assistance Program: Competitive program including $40 per class at the Harvard Extension School and reduced tuition through other participating Harvard graduate schools.
  • Tuition Reimbursement: Program that provides 75% to 90% reimbursement up to $5,250 per calendar year for eligible courses taken at other accredited institutions.
  • Professional Development: Programs and classes at little or no cost, including through the Harvard Center for Workplace Development and LinkedIn Learning.
  • Commuting and Transportation: Various commuter options handled through the Parking Office, including discounted parking, half-priced public transportation passes and pre-tax transit passes, biking benefits, and more.
  • Harvard Facilities Access, Discounts and Perks: Access to Harvard athletic and fitness facilities, libraries, campus events, credit union, and more, as well as discounts to various types of services (legal, financial, etc.) and cultural and leisure activities throughout metro-Boston.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.