MPI Fabric Data And ML Engineer

Overview

Remote
Depends on Experience
Contract - W2

Skills

Data Lakehouse
Microsoft Fabric
Synapse
MLOPS
Python
Pyspark
CRM
Finance
Azure ML

Job Details

Job Role: MPI Fabric Data & ML Engineer Location: 100% Remote
Duration: 6+ Months Contract

Summary:
Hiring a Data & ML Engineer to support the person matching and identity resolution workflows of the MPI initiative, leveraging Microsoft Fabric, Synapse, and ML capabilities. This role involves creating data pipelines, cleansing and linking records, and operationalizing ML-based entity resolution models.

Key Responsibilities:

  • Build data pipelines and ML workflows within Microsoft Fabric for entity matching and deduplication across data domains.
  • Implement and optimize MLOps pipelines (training, scoring, and retraining).
  • Integrate data from multiple sources: CRM, EHRs, finance, HR, etc.
  • Develop reusable modules for fuzzy matching, rule-based, and ML-based identity resolution.
  • Collaborate with data scientists and SMEs to operationalize models using SynapseML, PySpark, or Azure ML.

Required Experience:

  • 5+ years of experience in data engineering and machine learning in the Azure ecosystem.
  • Proficient with Microsoft Fabric (Lakehouse, Pipelines, Notebooks), Synapse, and Azure ML.
  • Solid understanding of identity resolution techniques, especially ML-based approaches.
  • Strong programming skills in Python and PySpark.
  • Familiarity with data privacy, governance, and ethics in ML.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.