Data Engineer

Overview

Remote
Full Time

Skills

Attention to detail
Natural language processing
Information Technology
Web Scraping
SQL
Unstructured data
Data retention
Data engineering
Computer science
Data management
Programming languages
Data cleansing
Data mining
Machine Learning (ML)
Information retrieval
Production support
Data
Design
Writing
Scripting
Policies
Statistics
Software development
Apache Spark
Apache Kafka
C
Perl
JavaScript
Visualization
Reporting
MapReduce
Apache Mahout

Job Details

Data Engineer
Information Technology

Job Description:

Data Engineer

Data Engineer Roles and Responsibilities:
  • Collect and process raw data at scale for a variety of projects and initiatives
  • Design and develop data applications using selected tools and frameworks as required and requested for a variety of teams and projects
  • Read, extract, transform, stage and load data to selected tools and frameworks as required and requested
  • Perform tasks such as writing scripts, web scraping, calling APIs, write SQL queries, etc
  • Work closely with the engineering team to integrate your work into our production systems
  • Process unstructured data into a form suitable for analysis
  • Analyze processed data
  • Support business decisions with ad hoc analysis as needed
  • Monitoring data performance and modifying infrastructure as needed
  • Define data retention policies

Data Engineer Skills/Requirements:
  • 5-7 years of recent experience in data engineering
  • Bachelor's Degree or more in Computer Science or a related field
  • A solid track record of data management showing your flawless execution and attention to detail
  • Strong knowledge of and experience with statistics
  • Programming experience, ideally inPython, Spark, Kafka orJava, and a willingness to learn new programming languages to meet goals and objectives
  • Experience in C, Perl, JavaScript or other programming languages is a plus
  • Knowledge of data cleaning, wrangling, visualization and reporting, with an understanding of the best, most efficient use of associated tools and applications to complete these tasks
  • Experience in MapReduce is a plus
  • Deep knowledge of data mining, machine learning, natural language processing, or information retrieval
  • Experience processing large amounts of structured and unstructured data, including integrating data from multiple sources
  • Experience with machine learning toolkits including, H2O, SparkML or Mahout
  • A willingness to explore new alternatives or options to solve data mining issues, and utilize a combination of industry best practices, data innovations and your experience to get the job done
  • Experience in production support and troubleshooting

#JoinCSI

Job Details

City :
REMOTE

State :