Apply Now

Data Engineering Lead

Hybrid in Pittsburgh, PA, US • Posted 5 hours ago • Updated 5 hours ago

Full Time

No Travel Required

Hybrid

Depends on Experience

Fitment

Dice Job Match Score™

📊 Calculating match score...

Job Details

Skills

Python

Summary

Job Title: Data Engineering Lead
Location: Pittsburgh, PA / Dallas, TX / Cleveland, OH (Hybrid)
Role Overview
We are seeking an experienced Data Engineering Lead to own the design, development, and delivery of enterprise-scale data pipelines and platforms within a large financial services environment. This role combines deep hands-on engineering expertise with strong project leadership capabilities — you will drive end-to-end delivery of data engineering workstreams while serving as the primary point of contact for business stakeholders, BSAs, product owners, and cross-functional delivery teams.
You will lead a team of data engineers and QA analysts, manage delivery timelines, govern quality standards, and translate complex business requirements into scalable technical solutions built on PySpark, Informatica IDMC, SQL, Hadoop, and Python.
Key Responsibilities
Data Pipeline Design & Engineering
• Design, develop, and optimize large-scale batch and near-real-time data pipelines using PySpark and Spark on OCP/Kubernetes in a Hadoop ecosystem.
• Build and maintain robust ETL/ELT workflows using Informatica IDMC (mappings, mapping tasks, taskflows, DQ rules) aligned to enterprise data standards.
• Develop reusable Python-based transformation utilities, data quality frameworks, and automation scripts to accelerate pipeline delivery.
• Write complex SQL for data transformations, validation, reconciliation, and performance tuning across Teradata, Hive, and ANSI-compliant databases.
• Implement medallion architecture patterns (Bronze / Silver / Gold) ensuring traceability, quality, and auditability at each layer.
• Integrate data pipelines with enterprise platforms including Kafka event streams, REST APIs, and file-based ingestion channels.
Project Leadership & Delivery Management
• Lead end-to-end delivery of data engineering workstreams from requirements intake through production deployment, managing scope, timelines, and risk.
• Own sprint planning, backlog grooming, and delivery governance in an Agile/Scrum model — coordinating onshore and offshore team members.
• Maintain and enforce delivery checklists, change request (CR) governance, and release cadence standards (e.g., bi-weekly release cycles, 10-day CR windows).
• Proactively identify delivery blockers, escalate risks, and drive resolution across engineering, QA, platform, and business stakeholders.
• Prepare and present delivery status, pipeline architecture overviews, and milestone updates to senior leadership and program sponsors.
Stakeholder Engagement & Business Partnership
• Serve as the primary technical liaison for Product Owners, BSAs, and business domain leads — translating requirements into technical designs and driving sign-off.
• Collaborate with data architects to ensure pipeline implementations align with enterprise architecture standards, data contracts, and governance policies.
• Facilitate data requirement workshops, technical walkthroughs, and design reviews with both technical and non-technical audiences.
• Communicate pipeline health, data quality metrics, and operational issues clearly to stakeholders across business and technology functions.
• Partner with data governance, security, and risk teams to ensure regulatory compliance (e.g., BCBS 239, data lineage, audit trails).
Team Leadership & Mentorship
• Lead a team of data engineers (onshore and offshore) and QA analysts, providing technical direction, code reviews, and hands-on mentorship.
• Define and enforce engineering standards — coding conventions, unit test coverage, CI/CD integration, and documentation practices.
• Drive a culture of quality and continuous improvement through retrospectives, root-cause analysis, and iterative process refinement.
• Support team capacity planning, onboarding, and skill development aligned to the project technology stack.
• Coordinate QA lead and QA team activities to ensure comprehensive test coverage, defect triage, and UAT readiness.
Data Quality, Observability & Governance
• Embed data quality checks (Great Expectations or equivalent) at ingestion, transformation, and output layers of all pipelines.
• Implement lineage tracking and metadata cataloging (e.g., Alation) to support governance and auditability requirements.
• Monitor pipeline health using observability tooling, define SLAs for data freshness and quality, and manage incident resolution.
• Enforce data access controls and masking/tokenization standards in collaboration with security and compliance teams (e.g., Protegrity).
Required Qualifications
• 8+ years of experience in data engineering, with at least 3 years in a lead or senior technical role.
• Proven track record delivering complex data pipeline projects in financial services or similarly regulated industries.
• Strong leadership and communication skills with demonstrated experience managing cross-functional delivery teams.
• Experience working directly with BSAs, product owners, and business stakeholders in an Agile delivery model.
• Bachelor''s degree in Computer Science, Engineering, Information Systems, or a related field.
Technical Skills
Category Technologies & Skills
Core Processing PySpark, Apache Spark (Standalone, OCP/Kubernetes), Spark Streaming, Spark SQL
ETL & Integration Informatica IDMC (mappings, taskflows, DQ rules), REST API ingestion, Kafka-based pipelines
Languages Python (pandas, PySpark, automation scripting), SQL (Teradata, Hive, ANSI SQL)
Data Platforms Hadoop (HDFS, YARN, Hive), Teradata, Hive Metastore, lakehouse architectures
Data Quality & Governance Great Expectations (or equivalent), Alation, Protegrity, data lineage, metadata management
DevOps & Delivery Git, CI/CD pipelines, Agile/Scrum, JIRA, release governance, CR management
Observability Pipeline monitoring, ELK stack (preferred), alerting and SLA management
Visualization & Reporting Executive status reporting, architecture diagrams (draw.io), delivery dashboards
Preferred Qualifications
• Banking or financial services experience (risk data, core banking, deposits, transactions, regulatory reporting).
• Familiarity with BCBS 239 or similar regulatory data compliance frameworks.
• Experience with graph databases (Neo4j) for relationship-centric data modeling.
• Exposure to cloud platforms (Azure, AWS, or Google Cloud Platform) and hybrid on-prem/cloud architectures.
• Experience with data mesh or domain-oriented operating models.
• Knowledge of data catalog and data lineage tools (Alation, Collibra, or similar).
• PMP, PMI-ACP, or equivalent project/program management certification.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91093425
Position Id: 8989740
Posted 5 hours ago

Contact the job poster

Dhruba Bhusal

Recruiter @ TUPPL Technology Inc

View Profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Manager - Data Engineering

Pittsburgh, Pennsylvania

•

Today

Job Description Title: Manager - Data Engineering Reports To: Director - Data Engineering Location: Pittsburgh, PA or New York, NY American Eagle is a youth culture brand grounded in denim. Our purpose extends beyond making the best jeans-we embrace self expression, culture, optimism and connection. Through the brand platform Live Your Life, we empower our community to be who they want to be all while wearing the clothes that make them most confident. Get to Know the Role: We are seeking

Full-time

Azure Data Engineer

Hybrid in Pittsburgh, Pennsylvania

•

7d ago

This role is Direct Hire on W2, no C2C or third party candidates The Data/ Cloud Platform Manager is a senior technical leadership role responsible for designing, building, optimizing, and managing scalable enterprise data platforms within a cloud-based Azure ecosystem. This position plays a critical role in driving enterprise data engineering, analytics, machine learning enablement, and business intelligence initiatives through the development of modern Lakehouse architectures and high-performa

Easy Apply

Full-time

Depends on Experience

Data Engineer (Data Pipelines & Modeling)

Warrendale, Pennsylvania

•

Today

Responsibilities: Design and implement robust data ingestion pipelines from multiple sources (APIs, databases, files, streaming systems). Support C4C offline database migration, ensuring data accuracy and consistency. Integrate data from enterprise systems into centralized data platforms. Design and implement data models for Workforce planning. Service operations forecasting. Develop optimized schemas for reporting and analytics. Ensure data quality, integrity, and consistency across models. Req

Full-time

Staff Tech Lead Manager, ML Data Services

Pittsburgh, Pennsylvania

•

Today

The ML Data Service team is seeking a highly experienced and motivated Staff Tech Lead Manager (TLM) to lead the development and operation of our core machine learning data infrastructure. This critical role requires a blend of deep technical expertise in machine learning systems, large-scale data processing, and proven leadership ability to manage both individual contributors and technical direction. The ML Data Service team is responsible for providing reliable, high-quality, and easily acces

Full-time

USD 172,000.00 - 229,000.00 per year

Search all similar jobs