Centralizing data pipeline management with Matillion Hub involves
leveraging the Data Productivity Cloud to unify ETL/ELT tasks, visibility, and security in a single, browser-based interface
The Hub acts as a central control plane, connecting to various data sources and cloud data platforms (Snowflake, Databricks, Redshift) while using agents for execution.
Here are the key requirements, components, and steps to centralize management:
Centralized Infrastructure (Control Plane)
- Deployment Options: Choose between full SaaS (Matillion hosts and manages infrastructure) or hybrid-SaaS (you deploy and manage agents in your private cloud).
- Agent Management: Use the Hub to manage, monitor, and configure data plane agents that run your pipelines.
Git Integration: Enable Git features within the Hub to manage version control, allowing teams to contribute, push transformations, and manage data products centrally.
Data Source and Destination Integration
Operational Requirements (Management)
Development & Transformation
- Low-Code/High-Code Interface: Utilize the Designer interface within the Hub for visual, low-code transformation, or incorporate Python/SQL scripting.
- DataOps Enabled: Use the Matillion Pipeline Language (DPL) to automate the creation of folders, projects, and pipelines, integrating with tools like Azure DevOps.