Data Engineer

Overview

Remote
Full Time

Skills

Information Technology
Writing
Scripting
Web Scraping
SQL
Data Retention
Data Engineering
Computer Science
Data Management
Attention To Detail
Statistics
Apache Spark
Apache Kafka
C
Perl
JavaScript
Programming Languages
Data Cleansing
Visualization
Reporting
MapReduce
Natural Language Processing
Information Retrieval
Unstructured Data
Machine Learning (ML)
Apache Mahout
Data Mining
Production Support

Job Details

Data Engineer
Information Technology

Job Description:

Data Engineer

Data Engineer Roles and Responsibilities:
  • Collect and process raw data at scale for a variety of projects and initiatives
  • Design and develop data applications using selected tools and frameworks as required and requested for a variety of teams and projects
  • Read, extract, transform, stage and load data to selected tools and frameworks as required and requested
  • Perform tasks such as writing scripts, web scraping, calling APIs, write SQL queries, etc
  • Work closely with the engineering team to integrate your work into our production systems
  • Process unstructured data into a form suitable for analysis
  • Analyze processed data
  • Support business decisions with ad hoc analysis as needed
  • Monitoring data performance and modifying infrastructure as needed
  • Define data retention policies

Data Engineer Skills/Requirements:
  • 5-7 years of recent experience in data engineering
  • Bachelor's Degree or more in Computer Science or a related field
  • A solid track record of data management showing your flawless execution and attention to detail
  • Strong knowledge of and experience with statistics
  • Programming experience, ideally inPython, Spark, Kafka orJava, and a willingness to learn new programming languages to meet goals and objectives
  • Experience in C, Perl, JavaScript or other programming languages is a plus
  • Knowledge of data cleaning, wrangling, visualization and reporting, with an understanding of the best, most efficient use of associated tools and applications to complete these tasks
  • Experience in MapReduce is a plus
  • Deep knowledge of data mining, machine learning, natural language processing, or information retrieval
  • Experience processing large amounts of structured and unstructured data, including integrating data from multiple sources
  • Experience with machine learning toolkits including, H2O, SparkML or Mahout
  • A willingness to explore new alternatives or options to solve data mining issues, and utilize a combination of industry best practices, data innovations and your experience to get the job done
  • Experience in production support and troubleshooting

#JoinCSI

Job Details

City :
REMOTE

State :
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.