Senior Data Engineer

  • Columbus, OH
  • Posted 7 hours ago | Updated 7 hours ago

Overview

Hybrid
Depends on Experience
Accepts corp to corp applications
Contract - Independent
Contract - W2

Skills

Apache Hive
Machine Learning (ML)
Python
Public Health
Cloudera
Data Collection
Git
Innovation
Jupiter

Job Details

Seeking a Senior Data Engineer experience building data collection, ingestion, and curation processes using CML based services and experience collecting and managing Public Health data from public sources, using APIs. Experience developing data automation jobs with Python-based Jupiter Notebooks to collect data from remote sources and curate it into Apache HIVE tables

Interview: Teams

Location: Columbus, OH

Posting: 766795

Work: ON-SITE

Role and Experience

  • Experience collecting and managing Public Health data from public sources, including US Census, Center for Disease Control and Prevention using APIs.
  • Experience collecting and managing Public Health data from agencies via the Innovation Platform (IOP)
  • Experience using git-based projects in the IOP Cloudera Machine Learning Environment (CML) to manage Public Health data
  • Experience building data collection, ingestion, and curation processes using CML based services and libraries
  • Experience developing data automation jobs with Python-based Jupiter Notebooks to collect data from remote sources and curate it into Apache HIVE tables
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.