Requirement
5+ years of data engineering or MDM technical experience; 2+ years working with an enterprise MDM platform in production.
Hands-on experience with Tamr (strongly preferred) or a comparable ML-driven MDM platform (Reltio, Informatica MDM Multidomain, or equivalent).
Proficiency in Python for data pipeline development; experience with REST API integration and JSON data.
Experience building and maintaining data integration pipelines between source systems and a cloud data platform.
Understanding of entity resolution, deduplication, and record linkage concepts.
Strong documentation skills; able to produce architecture diagrams, runbooks, and integration specs.
Preferred Qualifications
Tamr platform certification or formal Tamr training.
Experience integrating MDM platforms with Azure Data Lakehouse or Databricks
Familiarity with active learning and human-in-the-loop ML workflows.
Background in manufacturing, distribution, or field services industries with complex product and asset data.
Experience with multi-domain MDM relationship configuration (e.g., party-address-contact hierarchies).
Success Metrics
All 5 domains onboarded to Tamr with production-grade ingestion pipelines operational.
Match precision and recall within defined thresholds for each domain (documented baseline established).
Golden record output pipeline live and publishing to the lakehouse on defined SLA for all active domains.
ML model retraining process documented and operational; active learning workflow in place with Master Data Steward.
Platform uptime at 99.5%+ for production Tamr environment; zero critical incidents unresolved beyond SLA.