Overview
On Site
140000
Full Time
Skills
Attention to detail
Quality control
Data cleansing
Data Science
Version control
Data Analysis
Organizational skills
Data processing
Technical writing
Cloud storage
Google Cloud Platform
Bioinformatics
Data
Management
Research
Meta-data management
Cloud computing
Linux
Bash
HPC
Python
Git
GitHub
Statistics
Science
Collaboration
Genomics
Writing
Amazon Web Services
SQL
Privacy
Health care
Job Details
We are currently partered with an amazing non-profit, based in New York, who need to hire a Data Manager.
ESSENTIAL FUNCTIONS/RESPONSIBILITIES
Data releases
Data organization
Data sharing and support
MINIMUM QUALIFICATIONS
Education
Required Experience
Desired Experience
ESSENTIAL FUNCTIONS/RESPONSIBILITIES
Data releases
- Maintain the Bioinformatics quality control pipeline, used to verify consistency with phenotypic data and ensure the integrity of released genomic data
- Execute existing variant calling pipelines, which will be included in data releases
- Package whole-exome and whole-genome data for regular curated and rapid releases
- Manage ad hoc genetic data releases for various cohorts and datasets
Data organization
- Coordinate data receipt from external investigators, vendors, and research groups
- Perform incoming data cleaning, including deidentifying sample identifiers, organizing data files, and ensuring consistent metadata
- Manage the organization of raw, cleaned, and released data on our local cluster environment
- Harmonize our large collection of heterogeneous datasets hosted on our Base
Data sharing and support
- Support data sharing for investigators and collaborations
- Respond promptly to dataset questions from external investigators
- Support data access for cloud platforms
MINIMUM QUALIFICATIONS
Education
- B.S. or M.S. in data science, bioinformatics, or a related discipline
Required Experience
- At least 4+ years' relevant work experience
- Extensive experience with Linux/bash
- Experience working in an HPC environment
- Experience with Python
- Experience with version control using git / GitHub
- Basic skills in data analysis and statistics
- Strong organizational skills and outstanding attention to detail
- Effective oral and written communicator
- Ability to thrive in collaborative environments
Desired Experience
- Enthusiasm for open science and collaboration
- Experience with genomics data processing and analysis
- Experience writing technical documentation
- Experience with cloud storage solutions (AWS, Google Cloud, Terra)
- Working knowledge of SQL
- Familiarity with data privacy and security regulations in the healthcare or research domain