Overview
Skills
Job Details
Strong proficiency in Python for data manipulation and pipeline development.
Solid understanding of JSON structures and handling semi-structured data.
Proficiency in SQL for data querying, joining, and aggregation.
Experience with NumPy for numerical and matrix operations.
Knowledge of relational databases (e.g., PostgreSQL, MySQL) and data modeling.
Familiarity with version control systems (e.g., Git).
Excellent problem-solving skills and attention to detail.
[Optional] Experience with cloud platforms like AWS, Google Cloud Platform, or Azure is a plus.
Design, develop, and maintain robust data pipelines and ETL processes using Python.
Extract, transform, and load data from various sources into our data warehouse.
Work with large volumes of structured and semi-structured data (e.g., JSON).
Optimize and write complex SQL queries for data analysis and reporting.
Perform data validation and ensure data integrity across systems.
Utilize NumPy and Python libraries for data wrangling and computation.
Collaborate with data scientists, analysts, and other engineers to support data needs.