AI Data Engineer

Overview

Hybrid
$75 - $85
Contract - W2
Contract - 6 Month(s)

Skills

Amazon Neptune
Amazon Web Services
Artificial Intelligence
Computer Science
Data Engineering
Data Processing
Data Quality
Data Warehouse
Generative Artificial Intelligence (AI)
Google Cloud Platform
Information Engineering
Insurance
Language Models
Large Language Models (LLMs)
Mentorship
Machine Learning (ML)
Workflow
Vertex
Vector Databases
Use Cases
Unstructured Data
Storage
Software Engineering
Graph Databases
Snow Flake Schema
Python
Prompt Engineering
Optimization
Neo4j

Job Details

Job ID: H#12850-1 - AI Data Engineer

****Hybrid in Hartford, CT or Charlotte, NC

PLEASE NOTE: This is a 6 month contract to hire and needs to meet Client full-time conversion policies. Those dependent on a work permit sponsor now or anytime in the future (ie H1B, OPT, CPT, etc) do not meet Client requirements for this opening.

We are seeking a talented and motivated AI Data Engineer to join our innovative team. The ideal candidate will gain strong expertise in generative AI technologies, experience in implementing AI pipelines, and knowledge of vector and graph databases. We're looking for someone with some level of hands-on experience in prompt engineering, unstructured data processing, and agentic workflow implementation. As an AI Data Engineer, you will contribute to the development of advanced AI systems that leverage state-of-the-art generative models, implement efficient RAG (Retrieval-Augmented Generation) architectures, and integrate with our data infrastructure. Familiarity with Snowflake integration and insurance industry use cases is a plus.

Primary Job Responsibilities:

  • Design, develop, and implement complex data pipelines for AI/ML, including those supporting RAG architectures, using technologies such as Python, Snowflake, AWS, Google Cloud Platform, and Vertex AI.
  • Implement on end-to-end generative AI pipelines, from data ingestion to pipeline deployment and monitoring.
  • Build and maintain data pipelines that ingest, transform, and load data from various sources (structured, unstructured, and semi-structured) into data warehouses, data lakes, vector databases (e.g., Pinecone, Weaviate, Faiss - consider specifying which ones you use or are exploring), and graph databases (e.g., Neo4j, Amazon Neptune - same consideration as above).
  • Develop and implement data quality checks, validation processes, and monitoring solutions to ensure data accuracy, consistency, and reliability.
  • Implement end-to-end generative AI data pipelines, from data ingestion to pipeline deployment and monitoring.
  • Develop complex AI systems, adhering to best practices in software engineering and AI development.
  • Work with cross-functional teams to integrate AI solutions into existing products and services.
  • Keep up-to-date with AI advancements and apply new technologies and methodologies to our systems.
  • Assist in mentoring junior AI/data engineers in AI development best practices.
  • Implement and optimize RAG architectures and pipelines.
  • Develop solutions for handling unstructured data in AI pipelines.
  • Implement agentic workflows for autonomous AI systems.
  • Develop graph database solutions for complex data relationships in AI systems.
  • Integrate AI pipelines with Snowflake data warehouse for efficient data processing and storage.
  • Apply GenAI solutions to insurance-specific use cases and challenges.

Required Qualifications:

  • Bachelor's in Computer Science, Artificial Intelligence, or a related field.
  • 6+ years of experience in data engineering
  • Awareness of data engineering, with at least some hands on with generative AI technologies.
  • Ability to showcase implementation of production-ready enterprise-grade GenAI pipelines.
  • Experience & awareness of prompt engineering techniques for large language models.
  • Experience & awareness in implementing Retrieval-Augmented Generation (RAG) pipelines, integrating retrieval mechanisms with language models.
  • Knowledge of vector databases and graph databases, including implementation and optimization.
  • Experience & awareness in processing and leveraging unstructured data for GenAI applications.
  • Proficiency in implementing agentic workflows for AI systems
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.