Overview
Skills
Job Details
Data Engineer
Charlotte, NC (Need Onsite day 1, hybrid 3 days from office).
Job Description:
We're looking for a Data Engineer with strong technical skills and a programmer's mindset to support both legacy systems and an upcoming data lake initiative.
Responsibilities
Use agile software development processes to make iterative improvements to our back-end systems.
Model front-end and back-end data sources to help draw a more comprehensive picture of user flows throughout the system and to enable powerful data analysis.
Build data pipelines that clean, transform, and aggregate data from disparate sources.
Develop models that can be used to make predictions and answer questions for the overall business.
Develops and maintains scalable data pipelines and builds out new API integrations to support continuing increases in data volume and complexity.
Collaborates with analytics and business teams to improve data models that feed business intelligence tools, increasing data accessibility and fostering data-driven decision making across the organization.
Implements processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.Writes unit/integration tests, contributes to engineering wiki, and documents work.
Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
Works closely with a team of frontend and backend engineers, product managers, and analysts.
Defines company data assets (data models), spark, sparkSQL, and hiveSQL jobs to populate data models.
Requirements:
Legacy Pipeline (Immediate ramp-up):
SQL, Liquibase, SSIS, TFS, AutoSys
New Data Lake Pipeline (Upcoming project):
Hive, Dremio, Python, PySpark, Airflow, Data Lake Architecture
Ideal Candidate Should Have:
Strong hands-on experience in some or most of the above technologies
Excellent documentation skills
Ability to read and understand compiled codebases (Java, C#); ability to write in these languages is a plus
A programmer's mindset to solve complex data and integration challenges
Preferred, but not required:
Experience in building or maintaining ETL processes