Overview
On Site
Depends on Experience
Full Time
100% Travel
Skills
Machine Learning (ML)
PySpark
Data Visualization
Data Warehouse
Google Cloud Platform
Job Details
- Design data solutions in the cloud or on premises, using the latest data services, products, technology, and industry best practices
- Experience migrating legacy data environments with a focus performance and reliability
- Data Architecture contributions include assessing and understanding data sources, data models and schemas, and data workflows
- Ability to assess, understand, and design ETL jobs, data pipelines, and workflows
- BI and Data Visualization include assessing, understanding, and designing reports, creating dynamic dashboards, and setting up data pipelines in support of dashboards and reports
- Data Science focus on designing machine learning, AI applications, MLOps pipelines
- Addressing technical inquiries concerning customization, integration, enterprise architecture and general feature / functionality of data products
- Experience in crafting data lake house solutions in Google Cloud Platform. This includes relational & vector databases, data warehouses, data lakes, and distributed data systems.
- Must have PySpark API processing knowledge utilizing resilient distributed datasets (RDSS) and data framesSkills Required:
- Design data solutions in the cloud or on premises, using the latest data services, products, technology, and industry best practices
- Experience migrating legacy data environments with a focus performance and reliability
- Data Architecture contributions include assessing and understanding data sources, data models and schemas, and data workflows
- Ability to assess, understand, and design ETL jobs, data pipelines, and workflows
- BI and Data Visualization include assessing, understanding, and designing reports, creating dynamic dashboards, and setting up data pipelines in support of dashboards and reports
- Data Science focus on designing machine learning, AI applications, MLOps pipelines
- Addressing technical inquiries concerning customization, integration, enterprise architecture and general feature / functionality of data products
- Experience in crafting data lake house solutions in Google Cloud Platform. This includes relational & vector databases, data warehouses, data lakes, and distributed data systems.
- Must have PySpark API processing knowledge utilizing resilient distributed datasets (RDSS) and data framesSkills
Preferred:
- Ability to write bash, python and groovy scripts to help configure and administer tools
- Experience installing applications on VMs, monitoring performance, and tailing logs on Unix
- PostgreSQL Database administration skills are preferred
- Python experience and experience developing REST APIs
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.