Lead Data Engineer

Overview

On Site

USD 70-85

Full Time

Part Time

Accepts corp to corp applications

Contract - W2

Contract - Independent

Skills

SQL

DATA SOLUTIONS

CI/CD

METHODOLOGIES

AWS

ENGINEER

SPARK

DATA ARCHITECT

PYTHON

AZURE

Job Details

Position: Lead Data Engineer

Location: Charlotte, NC 28217 (2 days/week Onsite)

Duration: 6-month contract to hire

Requirements:

Python, SQL and PySpark
Azure preferred but open to other (Google Cloud Platform, AWS)
Databricks, Snowflake, and Fabric
Batch and real-time streaming building pipelines
Gathering requirements from the business
Mentoring some of the Data Engineers on the team room for growth

Any nice-to-have skills that would make them exceptional? Certifications, 2+ yrs data architecture, data engineering patterns, git repo, self-starter and go-getter attitude, software engineering background, ML experience

Job Description: As the Technical Lead Data Engineer, your primary responsibility will be to spearhead the design, development, and implementation of data solutions to empower our organization to derive actionable insights from intricate datasets. You will take the lead in guiding a team of data engineers, fostering collaboration with cross-functional teams, and spearheading initiatives geared towards fortifying our data infrastructure, CI/CD pipelines, and analytics capabilities

Responsibilities:

Apply advanced knowledge of Data Engineering principles, methodologies and techniques to design and implement data loading and aggregation frameworks across broad areas of the organization.
Gather and process raw, structured, semi-structured and unstructured data using batch and real-time data processing frameworks.
Implement and optimize data solutions in enterprise data warehouses and big data repositories, focusing primarily on movement to the cloud.
Drive new and enhanced capabilities to Enterprise Data Platform partners to meet the needs of product / engineering / business.
Experience building enterprise systems especially using Databricks, Snowflake and platforms like Azure, AWS, Google Cloud Platform etc
Leverage strong Python, Spark, SQL programming skills to construct robust pipelines for efficient data processing and analysis.
Implement CI/CD pipelines for automating build, test, and deployment processes to accelerate the delivery of data solutions.
Implement data modeling techniques to design and optimize data schemas, ensuring data integrity and performance.
Drive continuous improvement initiatives to enhance performance, reliability, and scalability of our data infrastructure.
Collaborate with data scientists, analysts, and other stakeholders to understand business requirements and translate them into technical solutions.
Implement best practices for data governance, security, and compliance to ensure the integrity and confidentiality of our data assets.

Qualifications:

Bachelor's or master's degree in computer science, Engineering, or a related field.
Proven experience (8+) in a data engineering role, with expertise in designing and building data pipelines, ETL processes, and data warehouses.
Strong proficiency in SQL, Python and Spark programming languages.
Strong experience with cloud platforms such as AWS, Azure, or Google Cloud Platform is a must.
Hands-on experience with big data technologies such as Hadoop, Spark, Kafka, and distributed computing frameworks.
Knowledge of data lake and data warehouse solutions, including Databricks, Snowflake, Amazon Redshift, Google BigQuery, Azure Data Factory, Airflow etc.
Experience in implementing CI/CD pipelines for automating build, test, and deployment processes.
Solid understanding of data modeling concepts, data warehousing architectures, and data management best practices.

Job Details

Share