Data Engineering Lead

Westlake, TX, US • Posted 30+ days ago • Updated 2 hours ago
Full Time
On-site
USD $157,000.00 - 205,000.00 per year
Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

  • Creative Problem Solving
  • Finance
  • Mutual Funds
  • Regulatory Compliance
  • EAGLE
  • Asset Management
  • Software Asset Management
  • Collaboration
  • ELT
  • Scalability
  • Advanced Analytics
  • Generative Artificial Intelligence (AI)
  • Use Cases
  • Data Architecture
  • Workflow
  • FOCUS
  • Data Modeling
  • Securities
  • Reference Data
  • Access Control
  • Change Management
  • Internal Communications
  • IC
  • Integrated Circuit
  • Design Review
  • Decision-making
  • Mentorship
  • Computer Science
  • Data Engineering
  • Stacks Blockchain
  • Google Cloud Platform
  • Google Cloud
  • Amazon Web Services
  • Cloud Computing
  • Data Warehouse
  • Snow Flake Schema
  • Apache Spark
  • Extract
  • Transform
  • Load
  • Orchestration
  • Apache Airflow
  • Semantics
  • SQL
  • Python
  • Modeling
  • Investments
  • Data Quality
  • Data Governance
  • Meta-data Management
  • Management
  • Analytics
  • Business Intelligence
  • Artificial Intelligence
  • Machine Learning (ML)
  • IT Management

Summary

Your Opportunity

At Schwab, you're empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us "challenge the status quo" and transform the finance industry together.

Schwab Asset Management (SAM) is a leading asset manager supporting mutual funds, ETFs, and managed account products governed under stringent regulatory and compliance requirements. SAM operates in a multi-cloud, multi-custodian, multi-vendor ecosystem, relying on a diverse set of external platforms such as Vestmark, Aladdin, Eagle, and others to serve its investment, operational, and regulatory functions.

We are seeking a Lead Data Engineer to drive the design and development of the cloud-native Data Platform for Schwab Asset Management (SAM). In this role, you will design and deliver end-to-end data solutions, not just pipelines-spanning raw data ingestion, curated data layers, enterprise data hubs, and the APIs and services that power downstream applications and analytics. You will work across a modern cloud data stack built on Snowflake and Google Cloud Platform (Google Cloud Platform to build scalable, resilient, and reusable platform capabilities.

Key Responsibilities:

Cloud-Native Data Engineering & Data Warehousing

Design, build, and operate cloud-native data pipelines using Google Cloud Platform and/or AWS.

Lead development of scalable ELT/ETL workflows supporting investment, operational, regulatory, and analytics use cases.

Serve as a Snowflake subject-matter expert, designing advanced data models, transformations, and performance-optimized workloads.

Engineer and curate data within cloud data warehouses and cloud-native data platforms, ensuring data is analytics-ready and AI-ready.

Design data hubs and domain data products that serve as authoritative sources for shared datasets, reducing duplication and ensuring consistent enterprise-wide data usage.

Optimize data solutions for performance, scalability, reliability, and cost efficiency.

Modern Data Architecture

Design and implement medallion data architectures (Bronze / Silver / Gold).

Build and evolve semantic data layers that provide consistent, reusable business metrics.

Design and curate AI-ready datasets to support advanced analytics, machine learning, and generative-AI use cases.

Leverage Snowflake's AI capabilities, including Snowflake Cortex and native Snowflake AI solutions, as part of the modern data architecture to enable intelligent data access, enrichment, and downstream AI workflows.

Ensure architectural alignment between curated data, semantic layers, and AI-enabled consumption patterns.

Data Modeling, Quality & Governance (Investment Domain Focus)

Lead complex data-modeling efforts across investment domains, including holdings, positions, transactions, securities, portfolios, benchmarks, performance, and reference data.

Apply investment domain knowledge to ensure models accurately represent real-world investment behavior and lifecycle events.

Define, implement, and enforce data quality standards, including validation rules, completeness checks, reconciliations, and anomaly detection.

Apply data governance principles, including metadata management, lineage, access controls, and policy enforcement.

Design and implement data contracts to define schema expectations, ownership, SLAs, and change-management between data producers and consumers.

Technical Leadership (IC Role)

Act as a technical lead for complex data-engineering initiatives and investment-domain data products.

Drive architecture discussions, design reviews, and technical decision-making.

Mentor junior and mid-level engineers through code reviews and technical guidance.

Partner closely with platform engineering, architecture, analytics, and business stakeholders.

What you have

Required Qualifications:

  • Bachelor's degree in computer science, Engineering, or related field (or equivalent practical experience).
  • 6-8+ years of experience in cloud-native data engineering.
  • Strong experience working on modern cloud data stacks using Google Cloud Platform and/or AWS.
  • Deep, hands-on experience with cloud data warehouses (Snowflake preferred) and Apache Spark based data pipeline development
  • Strong experience in data pipeline orchestration leveraging platforms like Apache Airflow
  • Proven experience designing and delivering:
  • Medallion data architectures
  • Semantic data layers
  • Analytics-ready and AI-ready datasets
  • Expert-level SQL and strong Python skills.
  • Ability to operate independently and lead technically without formal authority.

Preferred Qualifications:
  • Hands-on experience modeling investment data domains and building curated Investments data products for consumption across Investments management business functions.
  • Designing and enforcing data quality frameworks at scale.
  • Implementing data governance capabilities, including metadata, lineage, and controlled access.
  • Defining and managing data contracts between upstream producers and downstream consumers.
  • Supporting analytics, BI, and AI / ML workloads.
  • Acting as a technical lead on complex data initiatives.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90989465
  • Position Id: 21279d3ae173a7d0dbf8d608940ec3a9
  • Posted 30+ days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Southlake, Texas

Today

Full-time

USD 145,000.00 - 165,000.00 per year

Westlake, Texas

Today

Full-time

USD 92,872.00 - 127,900.00 per year

Southlake, Texas

Today

Full-time

USD 125,000.00 - 140,000.00 per year

Hybrid in Coppell, Texas

Today

Full-time

Search all similar jobs