Gen AI (Data Scientist/ ML Engineer) - 15 + Years

Remote • Posted 30+ days ago • Updated 5 days ago

Contract Corp To Corp

Contract Independent

Contract W2

50% Travel Required

Remote

$60 - $70/hr

Fitment

Dice Job Match Score™

🫥 Flibbertigibetting...

Job Details

Skills

ElasticSearch
OpenSearch
Python
LLM
GenAI
Semantic Search
Re-Ranking
AWS
Search Engineer

Summary

Lead AI Engineer with (Search Modernization) with 15+ Years of experience

Experience required :- 15+ years !

Mandatory Skills: ElasticSearch,OpenSearch,Python,LLM,GenAI,Semantic Search,Re-Ranking,AWS,Search Engineer

Location: Austin, TX (3 days work from office)

During the discovery stage, it will be 5 days working from office for the first 4 weeks of discovery

Job Description:

We are looking for an AI Engineer to modernize and enhance our existing regex/keyword-based ElasticSearch system by integrating state-of-the-art semantic search, dense retrieval, and LLM-powered ranking techniques.
This role will drive the transformation of traditional search into an intelligent, context-aware, personalized, and high-precision search experience.

The ideal candidate has hands-on experience with ElasticSearch internals, information retrieval (IR), embedding-based search, BM25, re-ranking, LLM-based retrieval pipelines, and AWS cloud deployment.

Roles & Responsibilities

Modernizing the Search Platform

Analyze limitations in current regex & keyword-only search implementation on ElasticSearch.
Enhance search relevance using:

BM25 tuning
Synonyms, analyzers, custom tokenizers
Boosting strategies and scoring optimization

Introduce semantic / vector-based search using dense embeddings.

2. LLM-Driven Search & RAG Integration

Implement LLM-powered search workflows including:

Query rewriting and expansion
Embedding generation (OpenAI, Cohere, Sentence Transformers, etc.)
Hybrid retrieval (BM25 + vector search)
Re-ranking using cross-encoders or LLM evaluators

Build RAG (Retrieval Augmented Generation) flows using ElasticSearch vectors, OpenSearch, or AWS-native tools.

3. Search Infrastructure Engineering

Build and optimize search APIs for latency, relevance, and throughput.
Design scalable pipelines for:

Indexing structured and unstructured text
Maintaining embedding stores
Real-time incremental updates

Implement caching, failover, and search monitoring dashboards.

4. AWS Cloud Delivery

Deploy and operate solutions on AWS, leveraging:

OpenSearch Service or EC2-managed ElasticSearch
Lambda, ECS/EKS, API Gateway, SQS/SNS
SageMaker for embedding generation or re-ranking models

Implement CI/CD for search models and pipelines.

5. Evaluation & Continuous Improvement

Develop search evaluation metrics (nDCG, MRR, precision@k, recall).
Conduct A/B experiments to measure improvements.
Tune ranking functions and hybrid search scoring.
Partner with product teams to refine search behaviors with real usage patterns.

Required Skills & Qualifications

5–10 years of experience in AI/ML, NLP, or IR systems, with hands-on search engineering.
Strong expertise in ElasticSearch/OpenSearch: analyzers, mappings, scoring, BM25, aggregations, vectors.
Experience with semantic search:

Embeddings (BERT, SBERT, Llama, GPT-based, Cohere)
Vector databases or ES vector fields
Approximate nearest neighbor (ANN) techniques

Working knowledge of LLM-based retrieval and RAG architectures.
Proficient in Python; familiarity with Java/Scala is a plus.
Hands-on AWS experience (OpenSearch, SageMaker, Lambda, ECS/EKS, EC2, S3, IAM).
Experience building and deploying APIs using FastAPI/Flask and containerizing with Docker.
Familiar with typical IR metrics and search evaluation frameworks.

Preferred Skills

Knowledge of cross-encoder and bi-encoder architectures for re-ranking.
Experience with query understanding, spell correction, autocorrect, and autocomplete features.
Exposure to LLMOps / MLOps in search use cases.
Understanding of multi-modal search (text + images) is a plus.
Experience with knowledge graphs or metadata-aware search.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91132356
Position Id: 8837505
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Gen AI (Data Scientist/ ML Engineer) - 15 + Years

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs