Apply Now

Data Quality Manager

Remote • Posted 1 day ago • Updated 1 day ago

Contract Corp To Corp

Contract W2

Contract Independent

6 Months

No Travel Required

Remote

Depends on Experience

Fitment

Dice Job Match Score™

📊 Calculating match score...

Job Details

Skills

Data Quality Manager

Summary

JOB DESCRIPTION

Technical Lead - Azure Databricks

Remote Contractor | Legal Industry | Hourly Rate

Role Summary

The Technical Lead is the senior-most technical authority overseeing 7+ Senior Data Engineers designing and delivering enterprise-grade Azure Databricks solutions for the legal industry. Accountable for solution architecture, code quality, ingestion pipeline standards across three patterns (on-premises SQL Server, API, and flat file), and overall engineering excellence. Reports to the Senior Engineering Manager.

Engagement Details

Role Title	Technical Lead - Azure Databricks
Engagement Type	Remote Contractor, Hourly Rate
Work Location	Fully Remote
Industry	Legal
Team Size	7+ Senior Data Engineers
Reports To	Senior Engineering Manager

Duties and Responsibilities

Technical Leadership & Architecture

Serve as the primary technical authority for Azure Databricks solution design, architecture decisions, and implementation standards.
Define and enforce Medallion/Lakehouse (Bronze/Silver/Gold) architecture patterns, Delta Lake schema design, Unity Catalog governance, and cluster configuration best practices.
Drive design reviews and technical trade-off analyses, ensuring alignment with client objectives, timelines, and compliance requirements.

Team Leadership & People Management

Lead 7+ Senior Data Engineers with day-to-day technical direction, prioritization, and removal of blockers.
Conduct rigorous code reviews ensuring adherence to coding standards, performance best practices, and maintainability guidelines.
Mentor senior engineers on Databricks features, python/spark optimization, data modeling, and software engineering excellence.
Collaborate with the Senior Engineering Manager on effort estimates, risk identification, sprint planning, and stakeholder communications.

Ingestion Pipeline Design & Delivery

Architect and oversee delivery of all ingestion pipelines across three core patterns: on-premises SQL Server (JDBC/linked services, CDC), REST/SOAP API integrations (pagination, auth, rate-limiting), and flat file ingestion (CSV, JSON, XML, SFTP/blob-based landing zones).
Establish reusable, metadata-driven ingestion frameworks in PySpark and Python that support all three patterns with consistent error handling, logging, and retry logic.
Ensure robust integration with ADLS Gen2, Azure Key Vault, and Azure DevOps across all ingestion workflows.
Champion Delta Lake best practices: schema evolution, incremental loads, partition strategies, and optimize/ZORDER tuning.
Lead CI/CD pipeline implementation, automated testing, and security scanning for reliable, repeatable deployments.

Data Quality, Governance & Compliance

Establish data quality frameworks (Great Expectations or DQX) covering validation, automated monitoring, SLA alerting, and lineage tracking across all ingestion patterns.
Lead Unity Catalog adoption including workspace access controls, column-level security, and audit logging; ensure compliance with legal data confidentiality and privilege-protection requirements.

Performance, Optimization & Troubleshooting

Own platform performance: diagnose and resolve Spark, Delta Lake, and cluster issues; optimize job runtimes, resource utilization, and cost efficiency.
Set monitoring and observability standards using Databricks system tables and Azure Monitor; serve as escalation point for critical production issues.

Education and/or Experience

Required

Bachelor's degree in Computer Science, Engineering, Data Science, or a related field.
8+ years of hands-on data engineering experience including large-scale ETL/ELT and Lakehouse platform delivery.
3+ years on Databricks including production deployments, cluster management, and enterprise integrations.
Demonstrated experience technically leading teams of senior engineers, driving architecture direction and code quality.
Deep expertise in Databricks components: Delta Lake, Databricks SQL, Apache Spark, Unity Catalog, Databricks Workflows, and Notebooks.
Proven experience designing and building ingestion pipelines across all three patterns: on-premises SQL Server (JDBC, CDC), REST/SOAP APIs (auth flows, pagination, rate-limiting), and flat files (CSV, JSON, XML via blob storage or SFTP).
Advanced proficiency in PySpark, Spark SQL, and Python; working knowledge of Scala.
Strong Azure ecosystem experience: ADLS Gen2, Azure Data Factory, Azure DevOps, Azure Key Vault, and Azure Monitor.
Proven track record establishing engineering best practices: code reviews, CI/CD, test automation, and documentation standards.
Strong data governance and compliance understanding including RBAC, data classification, and legal data confidentiality requirements.
Excellent communication skills with ability to engage both engineering teams and non-technical legal stakeholders.

Preferred

"Proficiency with VS Code for development workflows including Python, Databricks Connect, and remote development extensions; familiarity with Claude Code for AI-assisted coding, code review support, and accelerating engineering productivity."
Databricks Certified Data Engineer Professional or Microsoft Certified: Azure Data Engineer Associate or equivalent.
Master's degree in Computer Science, Engineering, or a related field.
Experience with data quality frameworks such as Great Expectations or Databricks DQX for automated validation and expectation suites within Databricks pipelines.
Hands-on experience with Infrastructure as Code (IaC) using Terraform or Bicep for Azure resource provisioning.
Familiarity with enterprise data modeling tools (e.g., ERwin) for logical and physical data model interpretation.
Experience migrating legacy on-premises data warehouses or SQL Server environments to Azure Databricks Lakehouse architectures.
Exposure to ML workloads on Databricks (MLflow, Feature Store) and feature engineering pipeline collaboration.
Familiarity with Agile/Scrum delivery using Azure DevOps Boards or Jira.

This job description outlines the general nature and level of work expected. It is not exhaustive and may be updated as business needs evolve.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91123676
Position Id: 15003-1598-
Posted 1 day ago

Contact the job poster

Gopi Krishna Kancharla

Recruiter @ Robustware

View Profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Databricks SME - W2 only

Remote

•

Today

Key Responsibilities Champion Best Practices: Establish, document, and promote best-in-class approaches for data architecture, integration, and modelling. Pipeline Ownership: Oversee the design, development, and maintenance of robust data pipelines and data architectures that support large-scale, enterprise data needs. Drive Excellence: Initiate and manage efforts to improve data quality, operational efficiency, and process scalability. Data Governance: Consult on, design, and implement gove

Easy Apply

Contract

Depends on Experience

Data Engineer (Databricks | Healthcare Analytics)

Remote

•

23d ago

Data Engineer (Databricks | Healthcare Analytics)The Data Engineer is responsible for designing, building, and maintaining scalable data pipelines and data platforms that support enterprise analytics, business intelligence, and clinical reporting. This role plays a critical part in enabling data-driven decision-making by integrating data from Electronic Medical Record (EMR) systems and other healthcare sources into a modern data ecosystem. The ideal candidate has strong experience withDatabricks

Easy Apply

Third Party, Contract

60 - 70

Senior azure data Engineer

Remote

•

15d ago

Key ResponsibilitiesDesign, build, and orchestrate scalabledata pipelinesusing Azure-native services.Develop and optimizebatch and near real-time data processing solutionsusing Spark and Databricks.Implement and manageenterprise-grade Lakehouse architectures(Medallion architecture).Performdata processing, validation, and automationusing Python (PySpark, Pandas, NumPy).Optimize performance of large-scale data systems, includingSpark tuning and query optimization.Work withCDC (Change Data Capture)

Easy Apply

Contract

45 - 50

Sr. Data Scientist / Engineer

Remote

•

20d ago

Due to the nature of the work, Our Client is unable to work with candidates on H1-B visa or EAD. Our Client is looking for someone who blends data science + data engieering. Youll architect and build pipelines in Databricks, design data lake systems, and dig into data to find trends, outliers, and actual causal insights. As an Applied Data Scientist, you will bridge the gap between raw data and actionable insights. You are responsible for building robust data pipelines in Databricks and applyin

Easy Apply

Contract

75 - 90

Search all similar jobs