Dataflux Architect

Remote • Posted 2 hours ago • Updated 17 minutes ago
Contract Independent
Contract W2
Contract Corp To Corp
Remote
Company Branding Image
Fitment

Dice Job Match Score™

🤯 Applying directly to the forehead...

Job Details

Skills

  • Pyspark
  • GAP Analysis
  • Mentoring
  • Amazon Web Services
  • databricks
  • Data Warehousing
  • Data Quality
  • Data Architecture
  • Data Governance
  • Microsoft Azure
  • Enterprise Architecture
  • Data Modelling
  • Architecture
  • Infrastructure Management
  • Parsing
  • GDPR
  • Consulting
  • workflows
  • SQL databases
  • Reference Data
  • Leadership
  • Safety Principles
  • Solution Architecture
  • Knowledge of Finance
  • Architectural Design
  • Data Lakes
  • Health Care
  • Health Insurance Portability and Accountability Act Compliance
  • Stock Control
  • Apache Hive
  • Tooling Assembly and Dismantling
  • Role-Based Access Control
  • Sarbanes-Oxley Act (SOX) Compliance
  • Client-facing
  • Blueprinting
  • Data Vault Modelling
  • Control Framework
  • Meta-Data Management
  • DataFlux
  • IBM InfoSphere (ETL Tools)
  • Phonetics
  • Reverse Engineering
  • Offshore Geotechnical Engineering
  • Professional Services

Summary

Title: Dataflux Architect
Location: NYC, NY(Remote)

Job Type: Contract

Job Description :

Architect (ATC)Engagement OverviewOur client is undertaking a strategic modernization initiative to migrate their enterprise data quality, MDM, anddata integration workloads from SAS DataFlux (dfPower Studio, Data Management Studio, and the DataFlux DataManagement Server) to the Databricks Lakehouse Platform.

We are seeking a Senior Onshore DataFlux SolutionArchitect to lead the architectural strategy, target-state design, and migration blueprint for this multi-phaseprogram.

This role is a hands-on, client-facing leadership position responsible for translating legacy DataFluxlogic, business rules, and MDM constructs into a modern, scalable Databricks-native architecture leveraging DeltaLake, Unity Catalog, and Delta Live Tables.

Key Responsibilities

Lead end-to-end solution architecture for the DataFlux to Databricks migration, including current-stateassessment, gap analysis, target-state design, and migration roadmap.

Reverse-engineer and document existing DataFlux jobs, data services, business rules, QKBs (QualityKnowledge Bases), and MDM hub configurations to produce a complete logical inventory.

Design the target Databricks Lakehouse architecture (medallion: bronze/silver/gold) with Delta Lake, UnityCatalog governance, and Delta Live Tables pipelines that replicate or improve upon DataFlux DQ and MDMfunctionality.

Define the strategy for migrating standardization, parsing, matching, clustering, and survivorship logic from DataFlux into Databricks-native patterns

(PySpark, SQL, and partner tools such as Reltio, Informatica CDQ,or Zingg where appropriate).

Architect MDM target-state for party, product, location, and reference data domains; define golden recordlogic, hierarchy management, and stewardship workflows on the Lakehouse.

Establish data quality frameworks (DQ rules, scorecards, exception handling) using Delta Live Tablesexpectations, Great Expectations, or Databricks Lakehouse Monitoring as DataFlux replacements.

Partner with the client's enterprise architecture, data governance, and security teams to align on UnityCatalog design, lineage, RBAC, and PII handling.

Provide technical leadership and mentorship to a blended onshore/offshore engineering team; conduct designreviews and enforce engineering standards.

Serve as the senior client-facing technical advisor - present architecture decisions, trade-offs, and migrationprogress to Director and VP-level stakeholders.

Own technical risk identification and mitigation across the migration lifecycle, including cutover strategy,parallel run validation, and decommissioning of DataFlux infrastructure.

Required Qualifications

DataFlux Expertise (Non-Negotiable)

10+ years of enterprise data architecture experience, with a minimum of 5 years of hands-on experiencedesigning and deploying solutions on SAS DataFlux (dfPower Studio and/or Data Management Studio).

Deep working knowledge of DataFlux Data Management Server, Architect jobs, Profile jobs, data services,and the QKB (Quality Knowledge Base) - including authoring custom definitions, regex libraries, phonetics,and locale-specific rules.

Demonstrated experience with DataFlux match codes, clustering, entity resolution, and survivorship rule design at enterprise scale.

Proven ability to reverse-engineer complex, undocumented DataFlux job flows and translate them intomodern equivalents.Master Data Management (MDM)

Strong architectural experience across MDM domains - Customer/Party, Product, Location, Vendor, Employee, and Reference Data.

Hands-on experience with at least one enterprise MDM platform in addition to DataFlux: Informatica MDM,Reltio, Profisee, IBM InfoSphere MDM, or Stibo STEP.

Expertise in match/merge logic, golden record creation, hierarchy management, cross-reference (XREF)design, and data stewardship workflows.Databricks & Modern Data Stack

Production experience architecting solutions on Databricks, including Delta Lake, Unity Catalog, Delta LiveTables, Workflows, and the medallion architecture pattern.

Strong PySpark and Spark SQL skills; able to design performant patterns for large-scale matching, deduplication, and DQ workloads.

Working knowledge of cloud platforms (Azure, AWS, or Google Cloud Platform) and modern ingestion tools (Fivetran, ADF,Airflow, dbt).Data Domains & Governance

Broad fluency across data quality, data governance, data modeling (3NF, dimensional, Data Vault), and metadata management.

Experience implementing data governance tooling (Collibra, Alation, Atlan, or Unity Catalog-native governance).

Familiarity with regulatory and privacy frameworks (HIPAA, GDPR, CCPA, SOX) and their impact on MDM and DQ design.

Preferred Qualifications

Prior experience leading at least one DataFlux modernization or sunset program.

Databricks certifications (Data Engineer Professional, Solutions Architect Professional).

Experience in healthcare payer, financial services, or insurance verticals.

Background in consulting or professional services - comfortable with SOW-driven delivery and billableutilization expectations.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91121487
  • Position Id: 2026-2374
  • Posted 2 hours ago

Company Info

About Edge Global

Edge Global is currently accepting resumes for a variety of positions. Please review the database of positions that we are seeking to fill and contact us for additional information about any specific opportunity.

Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

It looks like there aren't any Similar Jobs for this job yet.

Search all similar jobs