Data Engineer - (Python OR Java C#), Data Modelling, database design, ETL, SQL, 12+ Mths Cont NYC
JPC - 3507
Level 4: ( 8 to 15 yrs of industry Exp)
Loc: NYC ( Hybrid 4 days a week onsite)
Duration : 12+ months
Title: Data Engineer - (Python OR Java C#), Data Modelling, database design, ETL, SQL, 12+ Mths Cont NYC
Description:
Overview
We are looking for a
Data Engineer to join our team with growing requirement of Data & Analytics. The hire will be responsible for building and optimizing our data storage, ingestion and consumption architecture to provide a strong base for our data platform. Also, they will collaborate with technology partners across the Firm to integrate with new data sources and consumers. Furthermore, they will work on a variety of projects around Sales Data Integrations, Analytics, Digital Marketing Integration, Campaign Management, Lead Generation, Visualization, and Advanced Analytics (Artificial Intelligence and Machine Learning). The ideal candidate is an experienced software developer who enjoys optimizing data and analytics systems and building them from the ground up. They must be self-directed and comfortable supporting the needs of multiple systems and projects. Candidates will be expected to be very hands-on and pick up domain knowledge. We focus on solving the problem and not getting tied down by technological choices, so it is extremely important that candidates are open to learning & working with technologies that are new for them. We support what we build, and the candidate will be responsible for post-production support on a rotation basis.
Responsibilities
Build and optimize data pipelines to consume data from various sources, transform & model it, and then distribute it to multiple consumers.
Collaborate with technology partners across the Firm to understand and integrate with new data sources and consumers.
Work with multiple data sources (including structured and unstructured) to create an optimize data cubes for consumption by analytical dashboards and / or downstream systems.
Identify, design, and implement improvements / automations for optimizing data processing, storage, and availability, while maintaining scalability, etc.
Develop and support analytics & machine learning projects.
Support migration of the data platform from on-premises to cloud infrastructure.
Work with various stakeholders to assist with data & analytics related projects and issues.
Qualification
We are looking for a candidate with 7+ years of experience in Data Engineering role, who has attained a bachelor's or master's degree in computer science, computer engineering, information technology/systems or another relevant technical field.
Advanced working knowledge and experience with data modelling, data warehousing concepts, database design principles, query authoring (SQL) and performance optimization.
A successful history of manipulating, transforming, processing, and extracting value from large, disconnected datasets.
Hands-on programming experience with Python, Java, or C#.
Hands-on experience with ETL tools.
Working knowledge of data transformation, processing, analysis, and machine learning using Python (Pandas, NumPy, etc.)
Excellent problem-solving and analytical skills with attention to details.
Strong project management and organizational skills.
Experience supporting and working with cross-functional teams in a dynamic environment.
Excellent communication and collaboration skills.
Skillset and Experience
Experience with Relational database systems like Sybase, SQL Server, Postgres, etc.
Experience with object-oriented and/or function scripting languages: Python, Java, C++, or Scala, etc.
Experience with ETL tools (Informatica, etc.) and frameworks.
Experience with machine learning / artificial intelligence techniques like regression, classification, clustering, time series, text mining, large language models, etc.
Experience of visualization tools like Tableau, QlikView, Power BI etc. is a plus.
Experience with Cloud technologies like Snowflake, Microsoft Azure, Databricks, etc is a plus.
Experience with data-science tools / workbench like Dataiku, Jupyter, Rapid Miner, etc. is a plus.
Experience with Big Data tools / technologies: Hadoop, Pig, Hive, Hue, PySpark etc. is a plus.
MAJOR SKILLS LOOKING FOR:
Advanced working knowledge and experience with data modelling, data warehousing concepts, database design principles, query authoring (SQL) and performance optimization.
Hands-on programming experience in one language (Python, Java, or C#).
Hands-on experience with ETL (Informatica, etc.) tools and frameworks.
Hands-on experience of data transformation, processing, analysis, and machine learning using Python (Pandas, NumPy, etc.)