AI/ML Developer with Data Engineering

Hybrid in Philadelphia, PA, US • Posted 6 hours ago • Updated 6 hours ago
Full Time
No Travel Required
Hybrid
$80 - $90/hr
Fitment

Dice Job Match Score™

📋 Comparing job requirements...

Job Details

Skills

  • AI
  • ML
  • llm
  • data engineerking sql
  • unstructured data

Summary

AI/ML Engineer with strong Data engineering backround

Must have investment services, investment banking exposure

* If they can work 3 days in the office in Philadelphia, PA
* If they have 4-8 ye
ars of software development/engineering with AI and Data Engineering experience
* If they have worked in the investment management, investment banking area processing FINANCIAL MARKET DATA pipelines, RAG, Vector databases
* If they are fluent with Python and API development and streaming systems like Kafka or similar
*
Prefer people who have worked at BlackRock, Fidelity Investments, Vanugard, State Street Global Advisors, ETrade, Charles Schwab, etc.
Title - Senior Data Engineer / AI Engineer (Agentic AI Platform Financial Data) Location: On-site 2-3 days hybrid URGENT
Experience: 4 8+ years
Type: Contracting
About the Role RECRUITERS MUST RUN CHECKLISTS, KEYWORDS UNDERLINED
We are building a platform that converts unstructured financial data ( emails, corporate actions, index announcements ) into high-quality, structured datasets used by financial institutions.
This is not a typical LLM wrapper role.
You will work on systems that:

  • Extract data from noisy, inconsistent sources
  • Validate and reconcile outputs across multiple inputs
  • Ensure correctness, traceability, and auditability

The challenge is not just applying LLMs it s making them reliable in production for financial workflows.
What You ll Work On

  • Designing pipelines that process high-volume financial documents (batch + near real-time)
  • Building LLM-powered extraction workflows ( classification, parsing, summarization )
  • Implementing validation layers (rule-based + model-based) to reduce hallucinations
  • Developing retrieval systems using embeddings and vector search
  • Architecting end-to-end systems: ingestion processing storage serving
  • Ensuring data quality, observability, and fault tolerance
  • Collaborating with product to turn messy data into usable financial intelligence

Core Requirements

  • Strong Python and backend/data engineering experience
  • Experience building production data pipelines (ETL, streaming, or async systems)
  • Solid understanding of distributed systems and failure modes
  • Experience working with LLM-based systems in production:
    • Prompt design
    • Output validation
    • Retry/fallback strategies
    • Evaluation and monitoring
  • Experience with data storage systems (SQL + NoSQL)
  • Familiarity with cloud infrastructure (AWS or similar)

Preferred Experience

  • Experience with RAG / vector search systems
  • Background in financial data or capital markets
  • Experience with streaming systems (Kafka, etc.)
  • Experience building multi-step or agent-style workflows

What Makes This Role Interesting

  • Work on high-accuracy AI systems where correctness matters
  • Solve real problems around:
    • LLM reliability and hallucination mitigation
    • Data consistency across conflicting sources
    • Real-time vs correctness tradeoffs
  • Build systems used in financial decision-making workflows
  • High ownership over core architecture in an early-stage environment

Nice to Know (but not required)

  • Experience with orchestration tools ( Airflow, etc.)
  • Exposure to evaluation frameworks for LLMs
  • Experience working with large-scale document processing

Tech Stack (Representative, not exhaustive)

  • Python, APIs, async processing
  • LLM APIs + embeddings
  • SQL / NoSQL databases
  • Cloud infrastructure (AWS)
  • Data pipelines and streaming systems

Vector Databases

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: sanny001
  • Position Id: aldata
  • Posted 6 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hybrid in Philadelphia, Pennsylvania

3d ago

Easy Apply

Contract

Depends on Experience

Philadelphia, Pennsylvania

Today

Full-time

USD 120,000.00 - 150,000.00 per year

Philadelphia, Pennsylvania

Today

Full-time

USD 105,000.00 - 145,000.00 per year

Philadelphia, Pennsylvania

Today

Full-time

Search all similar jobs