Overview
Hybrid
Up to $64
Contract - W2
Contract - 12 Month(s)
Skills
Python
PySpark
GCP
ETL
BI
Job Details
Lead Data Platform Architect
Hybrid onsite Dearborn, MI 2-3 days a week onsite; come September it will be 4 days a week
12 month contract.
Job Description:
- Design data solutions in the cloud or on premises, using the latest data services, products, technology, and industry best practices
- Experience migrating legacy data environments with a focus performance and reliability
- Data Architecture contributions include assessing and understanding data sources, data models and schemas, and data workflows
- Ability to assess, understand, and design ETL jobs, data pipelines, and workflows
- BI and Data Visualization include assessing, understanding, and designing reports, creating dynamic dashboards, and setting up data pipelines in support of dashboards and reports
- Data Science focus on designing machine learning, AI applications, MLOps pipelines
- Addressing technical inquiries concerning customization, integration, enterprise architecture and general feature / functionality of data products
- Experience in crafting data lake house solutions in Google Cloud Platform. This includes relational & vector databases, data warehouses, data lakes, and distributed data systems.
- Must have PySpark API processing knowledge utilizing resilient distributed datasets (RDSS) and data frames
Skills Required:
- Design data solutions in the cloud or on premises, using the latest data services, products, technology, and industry best practices
- Experience migrating legacy data environments with a focus performance and reliability
- Data Architecture contributions include assessing and understanding data sources, data models and schemas, and data workflows
- Ability to assess, understand, and design ETL jobs, data pipelines, and workflows
- BI and Data Visualization include assessing, understanding, and designing reports, creating dynamic dashboards, and setting up data pipelines in support of dashboards and reports
- Data Science focus on designing machine learning, AI applications, MLOps pipelines
- Addressing technical inquiries concerning customization, integration, enterprise architecture and general feature / functionality of data products
- Experience in crafting data lake house solutions in Google Cloud Platform.
- This includes relational & vector databases, data warehouses, data lakes, and distributed data systems.
- Must have PySpark API processing knowledge utilizing resilient distributed datasets (RDSS) and data frames
Skills Preferred:
- Ability to write bash, python and groovy scripts to help configure and administer tools
- Experience installing applications on VMs, monitoring performance, and tailing logs on Unix
- PostgreSQL Database administration skills are preferred
- Python experience and experience developing REST APIs
Experience Required:
- 10 + years
Education Required:
- Bachelor's Degree Computer Science, Computer Information Systems, or equivalent experience
Education Preferred:
- Masters Data Science Preferred
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.