Role: Data Engineer with Google Cloud Platform
Location: Dallas, TX
Exp: 10+
A Data Engineer specializing in Dataiku DSS and Google Cloud Platform builds scalable data pipelines, integrates cloud data services, and enables analytics and machine learning solutions. This role bridges visual data preparation (Dataiku) with cloud-native engineering (Google Cloud Platform).
⸻
Key Responsibilities
• Design and build end-to-end data pipelines using Dataiku and Google Cloud Platform services
• Develop and maintain datasets, flows, and recipes in Dataiku
• Ingest and process structured & unstructured data from multiple sources
• Build scalable pipelines using Google Cloud Platform tools like:
• BigQuery (data warehousing)
• Cloud Storage (data lake)
• Dataflow (ETL pipelines)
• Pub/Sub (real-time ingestion)
• Optimize SQL queries and data workflows for performance
• Automate pipelines using Dataiku scenarios and Google Cloud Platform scheduling tools
• Ensure data quality, governance, and security compliance
• Collaborate with data scientists for ML model deployment
• Monitor, debug, and troubleshoot data pipelines
⸻
Required Skills
Core Dataiku Skills
• Strong experience with Dataiku DSS
• Building:
• Flows, Datasets, Recipes (Visual + Code)
• Scenario automation and job orchestration
• Managing connections (especially Google Cloud Platform integrations)