Data Engineer

Overview

Remote

Full Time

Skills

Information Technology

Writing

Scripting

Web Scraping

SQL

Data Retention

Data Engineering

Computer Science

Data Management

Attention To Detail

Statistics

Apache Spark

Apache Kafka

Perl

JavaScript

Programming Languages

Data Cleansing

Visualization

Reporting

MapReduce

Natural Language Processing

Information Retrieval

Unstructured Data

Machine Learning (ML)

Apache Mahout

Data Mining

Production Support

Job Details

Data Engineer
Information Technology

Job Description:

Data Engineer

Data Engineer Roles and Responsibilities:

Collect and process raw data at scale for a variety of projects and initiatives
Design and develop data applications using selected tools and frameworks as required and requested for a variety of teams and projects
Read, extract, transform, stage and load data to selected tools and frameworks as required and requested
Perform tasks such as writing scripts, web scraping, calling APIs, write SQL queries, etc
Work closely with the engineering team to integrate your work into our production systems
Process unstructured data into a form suitable for analysis
Analyze processed data
Support business decisions with ad hoc analysis as needed
Monitoring data performance and modifying infrastructure as needed
Define data retention policies

Data Engineer Skills/Requirements:

5-7 years of recent experience in data engineering
Bachelor's Degree or more in Computer Science or a related field
A solid track record of data management showing your flawless execution and attention to detail
Strong knowledge of and experience with statistics
Programming experience, ideally inPython, Spark, Kafka orJava, and a willingness to learn new programming languages to meet goals and objectives
Experience in C, Perl, JavaScript or other programming languages is a plus
Knowledge of data cleaning, wrangling, visualization and reporting, with an understanding of the best, most efficient use of associated tools and applications to complete these tasks
Experience in MapReduce is a plus
Deep knowledge of data mining, machine learning, natural language processing, or information retrieval
Experience processing large amounts of structured and unstructured data, including integrating data from multiple sources
Experience with machine learning toolkits including, H2O, SparkML or Mahout
A willingness to explore new alternatives or options to solve data mining issues, and utilize a combination of industry best practices, data innovations and your experience to get the job done
Experience in production support and troubleshooting

#JoinCSI

Job Details

City :
REMOTE

State :

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Share