Principal Data engineer // NYC, NY // Onsite


Alliance IT
Dice Job Match Score™
🤯 Applying directly to the forehead...
Job Details
Skills
- Analytics
- Apache Flink
- Apache HTTP Server
- Business Intelligence
- Cloud Computing
- Collaboration
- Computer Science
- Data Architecture
- Data Engineering
- Data Governance
- Data Modeling
- Data Processing
- Data Quality
- Data Science
- Data Storage
- ELT
- Extract, Transform, Load
- Java
- Machine Learning (ML)
- Management
- Orchestration
- Python
- Real-time
- Regulatory Compliance
- SQL
- Scala
- Streaming
- Unstructured Data
- Workflow
- amazon dynamodb
- Airflow
- bigdata
Summary
Job Description
We are seeking Data Engineers in a contract capacity to join our talented Data Engineering team.
In this role, you''ll contribute to building robust data pipelines that ingest and/or transform over 30TB of data each day.
Job Overview
We are seeking experienced Data Engineers (Contract) to join our dynamic Data Engineering team. In this role, you will design, build, and maintain scalable data pipelines that ingest, process, and transform 30+ TB of data daily. You will work closely with data scientists, analysts, and product teams to ensure reliable, high-quality data is available for analytics, machine learning, and business intelligence.
The ideal candidate has strong experience with large-scale distributed data systems, cloud platforms, and ETL/ELT pipelines, along with a passion for building efficient and reliable data infrastructure.
Key Responsibilities
Design, develop, and maintain scalable data pipelines to ingest and process large volumes of structured and unstructured data.
Build and optimize ETL/ELT workflows for high-volume data processing (30TB+ daily).
Work with batch and streaming data pipelines to ensure real-time and near-real-time data availability.
Implement data quality, validation, and monitoring frameworks to ensure data reliability.
Collaborate with data scientists, analysts, and software engineers to understand data requirements and deliver high-quality datasets.
Optimize data storage, partitioning, and query performance across large datasets.
Manage and orchestrate workflows using pipeline orchestration tools.
Ensure data governance, security, and compliance standards are maintained.
Troubleshoot and resolve data pipeline failures and performance bottlenecks.
Document data architecture, workflows, and engineering best practices.
Required Qualifications
Bachelor’s or Master’s degree in Computer Science, Engineering, Data Science, or related field.
15+ years of experience in Data Engineering or related roles.
Strong programming skills in Python, Scala, or Java.
Expertise in SQL and data modeling techniques.
Experience building large-scale data pipelines handling TB-scale datasets.
Hands-on experience with distributed data processing frameworks such as:
Apache Spark
Apache Hadoop
Apache Flink (optional)
- Dice Id: 10494622
- Position Id: 8909676
- Posted 19 hours ago
Company Info
About Alliance IT
Similar Jobs
It looks like there aren't any Similar Jobs for this job yet.
Search all similar jobs