Bioinformatics Data Manager

  • New York, NY
  • Posted 11 days ago | Updated 4 hours ago

Overview

On Site
140000
Full Time

Skills

Attention to detail
Quality control
Data cleansing
Data Science
Version control
Data Analysis
Organizational skills
Data processing
Technical writing
Cloud storage
Google Cloud Platform
Bioinformatics
Data
Management
Research
Meta-data management
Cloud computing
Linux
Bash
HPC
Python
Git
GitHub
Statistics
Science
Collaboration
Genomics
Writing
Amazon Web Services
SQL
Privacy
Health care

Job Details

We are currently partered with an amazing non-profit, based in New York, who need to hire a Data Manager.

ESSENTIAL FUNCTIONS/RESPONSIBILITIES

Data releases
  • Maintain the Bioinformatics quality control pipeline, used to verify consistency with phenotypic data and ensure the integrity of released genomic data
  • Execute existing variant calling pipelines, which will be included in data releases
  • Package whole-exome and whole-genome data for regular curated and rapid releases
  • Manage ad hoc genetic data releases for various cohorts and datasets

Data organization
  • Coordinate data receipt from external investigators, vendors, and research groups
  • Perform incoming data cleaning, including deidentifying sample identifiers, organizing data files, and ensuring consistent metadata
  • Manage the organization of raw, cleaned, and released data on our local cluster environment
  • Harmonize our large collection of heterogeneous datasets hosted on our Base


Data sharing and support
  • Support data sharing for investigators and collaborations
  • Respond promptly to dataset questions from external investigators
  • Support data access for cloud platforms

MINIMUM QUALIFICATIONS

Education
  • B.S. or M.S. in data science, bioinformatics, or a related discipline

Required Experience
  • At least 4+ years' relevant work experience
  • Extensive experience with Linux/bash
  • Experience working in an HPC environment
  • Experience with Python
  • Experience with version control using git / GitHub
  • Basic skills in data analysis and statistics
  • Strong organizational skills and outstanding attention to detail
  • Effective oral and written communicator
  • Ability to thrive in collaborative environments

Desired Experience
  • Enthusiasm for open science and collaboration
  • Experience with genomics data processing and analysis
  • Experience writing technical documentation
  • Experience with cloud storage solutions (AWS, Google Cloud, Terra)
  • Working knowledge of SQL
  • Familiarity with data privacy and security regulations in the healthcare or research domain