Job Title: Data AI Engineer with Vector Databases
Location: Plano, TX (5 days Onsite role )
Duration: 12 months (extendable/potential conversion)
Contract: W2
Exp: 9+
Key Responsibilities:
Design and build ETL/ELT pipelines and data processing workflows
Develop batch and real-time data pipelines using modern frameworks
Work with Python and SQL for data transformation and analytics
Implement GenAI data architectures, including RAG pipelines and vector indexing
Manage and optimize Vector Databases for embedding storage and similarity search
Build secure data solutions on AWS, ensuring data quality and compliance
Support analytics, reporting, and data modernization initiatives
Required Skills:
Strong experience in Python and SQL
Hands-on experience with ETL/ELT and data pipelines
Mandatory: Experience with Vector Databases
Experience with GenAI / LLM frameworks (LangChain or LangGraph)
Experience with Big Data frameworks (Apache Spark, Apache Kafka)
Workflow orchestration using Apache Airflow
Experience with data platforms like Databricks or Snowflake
AWS services: S3, Glue, Redshift
Nice to Have:
Experience with ML frameworks (Scikit-learn, PyTorch)
Knowledge of RAG architectures and embedding pipelines
Experience in financial services / fintech environments