Overview
On Site
Depends on Experience
Full Time
Skills
Amazon Web Services
Apache Flink
Apache Kafka
Apache Spark
Data Engineering
Data Storage
Kronos
Machine Learning (ML)
NumPy
Pandas
PyTorch
Python
Streaming
Time Series
Workflow
scikit-learn
Job Details
Job Title: Senior Data Engineer
Location: New York City
Job Description:
We are seeking a highly skilled Senior Data Engineer with a proven track record in designing, building, and maintaining large-scale, high-throughput data systems. The ideal candidate will have deep expertise in working with time series data, specialized storage systems, and strong experience in both batch and streaming pipelines. This role is critical to integrating robust data infrastructure with advanced machine learning workflows.
Must-Have Qualifications:
- 5+ years of hands-on experience in data engineering, focusing on scalable and high-performance systems
- Extensive experience with time series data and purpose-built storage systems such as KDB+, TimeSet, or Kronos
- Strong expertise in building and maintaining streaming and batch data pipelines using technologies like Apache Kafka, AWS Glue, Apache Flink, or Apache Spark
- Proficiency in Python, with experience integrating pipelines with ML libraries such as pandas, NumPy, scikit-learn, or PyTorch
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.