Principal Data Engineer

New York, NY, US • Posted 19 hours ago • Updated 6 hours ago
Full Time
On-site
USD $230,000.00 - 250,000.00 per year
Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

  • Core Data
  • Reporting
  • FOCUS
  • Unstructured Data
  • Meta-data Management
  • Semantics
  • Normalization
  • Data Deduplication
  • Enterprise Search
  • Discounted Cumulative Gain
  • Data Governance
  • Documentation
  • Workflow
  • Product Marketing
  • Data Science
  • Streaming
  • Messaging
  • Amazon SQS
  • Neo4j
  • Data Analysis
  • Use Cases
  • Data Architecture
  • TypeScript
  • Python
  • SQL
  • Apache Spark
  • Apache Kafka
  • Node.js
  • Cloud Computing
  • Amazon Web Services
  • Google Cloud
  • Google Cloud Platform
  • Microsoft Azure
  • Machine Learning (ML)
  • Named-Entity Recognition (NER)
  • Text Classification
  • Semantic Search
  • Artificial Intelligence
  • Management
  • Mentorship
  • Offshoring
  • Communication
  • Business Strategy
  • Digital Media
  • Publishing
  • Computer Science
  • Data Engineering
  • Generative Artificial Intelligence (AI)
  • OLAP
  • OLTP
  • Training
  • Life Insurance
  • Television
  • Media
  • Cabling
  • Business-to-business
  • Analytics
  • Aviation
  • Military
  • Law
  • Finance
  • Training And Development
  • Legal
  • Collaboration

Summary

Job Description

The Enterprise Corporate Data Team is looking for a Principal Data Engineer, a senior technical leader responsible for architecting the core data infrastructure and platforms that power enterprise-scale AI applications. Reporting to the VP of Engineering, this role will focus on building systems to surface content, audience, products etc through semantic search capabilities to support personalization, audience discovery, and intelligent content discovery. The Principal Data Engineer will lead the end-to-end design and implementation of scalable pipelines,platforms and systems that support semantic search across massive volumes of structured/semi-structured data using GeN AI technology. This individual will also co-ordinate with a team of off-shore engineers, ensuring consistent delivery, code quality, and alignment with business and technical goals. The ideal candidate will possess an entrepreneurial ethos, an ability to operate in a dynamic environment, and a working knowledge of the current digital media landscape. The candidate should be an expert/knowledgeable with Search systems including but not limited to Similarity, hybrid and semantic search. This role is based in New York City.

Key Responsibilities:

Lead the design and implementation of high-performance OLAP and OLTP systems to support similarity and semantic search.
Architect scalable data platforms that integrate structured and unstructured data,including behavioral signals, content metadata, and user engagement data for Gen AI use cases.
Build systems that enable semantic enrichment of content through entity recognition, disambiguation, normalization and deduplication techniques.
Lead the design and build of high throughput, low latency and highly relevant Enterprise search systems using Vectors, Graph and other search strategies.
Familiar with relevance measurement techniques like DCG, NDCG etc.
Partner closely with other Data engineers, ML engineers and data scientists to deploy and operationalize models for content and audience intelligence.
Oversee and co-ordinate with an offshore engineering team, providing technical guidance, code reviews, and project oversight to ensure timely, high-quality
deliverables.
Ensure best practices in data governance, quality, observability, and documentation across all engineering workflows.
Collaborate with stakeholders across product, marketing, and data science to translate business needs into scalable AI data systems.
Well versed in architecting, designing and developing large scale OLTP and OLAP systems.
Experience building and operating streaming systems using messaging systems like Kafka, Pub/sub, SQS etc.
Experience building an RAG/Graph RAG system with Google, OpenAI or another Gen AI platform.
Experience building a knowledge graph using Neo4j, Spanner, Neptune or another tool is a plus

Qualifications:

10+ years of experience in data engineering, with significant experience building large-scale, distributed data systems to support Data analysis, AI/ ML and key business use cases.
Proven expertise in Search and search related sub systems like Query understanding, search suggest, ranking, relevance with modern strategies like similarity search, hybrid search etc.
Strong coding and data architecture skills using Typescript, Python, SQL, and tools like Apache Spark, Kafka, Airflow, Node Js, and cloud-native platforms (e.g., AWS, Google Cloud Platform, or Azure).
Hands-on experience integrating ML models into production environments for tasks such as entity extraction, text classification, or semantic search.
Familiar with AI grounding strategies including but not limited to Entity graph
Experience managing and mentoring distributed/offshore engineering teams, with a track record of driving execution across time zones.
Excellent communication and collaboration skills, with the ability to bridge technical execution and business strategy.

Preferred Qualifications:

Experience in digital media, publishing, ad tech, or content platforms.
Bachelor's , Master's or Ph.D. in Computer Science, Data Engineering, or a related field.
Knowledge of LLMs and generative AI in applied settings (e.g., content summarization, auto-tagging, retrieval augmentation).
Working experience with OLAP and OLTP systems is a plus

In accordance with applicable law, Hearst is required to include a reasonable estimate of the compensation for this role if hired in New York City. The reasonable estimate, if hired in New York City, is $230,000 to $250,000. Please note this information is specific to those hired in New York City. For candidates outside New York City, the salary range will be aligned with the specific location. A final decision on the successful candidate's starting salary will be based on a number of permissible, non-discriminatory factors, including but not limited to skills, experience, training, certifications, and education. Hearst provides a competitive benefits package, including medical, dental, vision, disability, and life insurance, 401(k), paid holidays and paid time off, employee assistance programs, and more.

About Us

Hearst is a leading global, diversified information, services, and media company dedicated to innovating, informing audiences and leading with purpose, integrity and a culture of care.
Our portfolio includes more than 360 businesses worldwide. On the consumer side, we operate 35 television stations, 28 daily newspapers and publish more than 200 magazine editions featuring many of the most iconic brands in media. We also hold ownership stakes in leading cable networks such as A&E, HISTORY, Lifetime and ESPN. On the business-to-business side, our companies include Fitch Group, a global leader in financial information and analytics; Hearst Health, which provides intelligence and software that improve care outcomes; and Hearst Transportation, which delivers data and software for aviation, automotive and trucking.

Our strength lies in our people. We value the diverse perspectives that move us forward. We are an Equal Opportunity Employer and makes employment decisions without regard to race, color, religion, national origin, sex or gender, sexual orientation, gender identity, gender expression, age, disability, military or veteran status or any other status protected by federal, state, or local law. We also provide reasonable accommodations to applicants and employees consistent with applicable law.

About the Team

Our Corporate Teams deliver essential programs and services that support the entire Hearst enterprise. Spanning communications, employee benefits, finance, learning and development, legal, technology, and more, these teams lead initiatives that advance Hearst's mission to inform and inspire. Here, you'll find opportunities to grow, collaborate and make a lasting impact.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: RTX15f5c8
  • Position Id: 70cf8cca62dd1ef22a1f540ced7213e3
  • Posted 19 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

New York, New York

5d ago

Full-time

USD 160,160.00 - 240,240.00 per year

Jersey City, New Jersey

Today

Full-time

USD 176,720.00 - 265,080.00 per year

New York, New York

Today

Full-time

USD 90,000.00 - 150,000.00 per year

Remote or New York, New York

Today

Full-time

USD 198,000.00 - 220,000.00 per year

Search all similar jobs