Lead AI Engineer

Overview

Accepts corp to corp applications

Contract - W2

Contract - Independent

Skills

Generative Artificial Intelligence (AI)

Art

Regular Expression

Optimization

Semantics

Workflow

Unstructured Data

Real-time

Caching

Failover

Dashboard

Cloud Computing

API

Amazon SQS

Continuous Integration

Continuous Delivery

Continuous Improvement

Artificial Intelligence

Machine Learning (ML)

Natural Language Processing

Search Engineering

Okapi BM25

Semantic Search

BERT

Vector Databases

Elasticsearch

Neural Network

Python

Java

Scala

Amazon Web Services

Amazon SageMaker

Amazon EC2

Amazon S3

Flask

Docker

Information Retrieval

International Relations

Investor Relations

Evaluation

Business Intelligence

Machine Learning Operations (ML Ops)

Use Cases

Meta-data Management

IMG

Job Details

This is Vamshi ,from Software Technology We have a job opening with our client for position Lead AI Engineer If you are available and looking for any new opportunities, please send me your updated resume for below position ASAP.

Job Title: Lead AI Engineer
Location: Austin, Texas (Hybrid)

Duration: Longterm Contract

Lead AI Engineer (Search Modernization)

Mandatory Skills: Elastic Search, OpenSearch, Python, LLM, GenAI, Semantic Search, Re-Ranking, AWS, Search Engineer

Job Description:

We are looking for an AI Engineer to modernize and enhance our existing regex/keyword-based Elastic Search system by integrating state-of-the-art semantic search, dense retrieval, and LLM-powered ranking techniques.
This role will drive the transformation of traditional search into an intelligent, context-aware, personalized, and high-precision search experience.

The ideal candidate has hands-on experience with Elastic Search internals, information retrieval (IR), embedding-based search, BM25, re-ranking, LLM-based retrieval pipelines, and AWS cloud deployment.

Roles & Responsibilities

Modernizing the Search Platform

Analyze limitations in current regex & keyword-only search implementation on ElasticSearch.
Enhance search relevance using:
- BM25 tuning
- Synonyms, analyzers, custom tokenizers
- Boosting strategies and scoring optimization
Introduce semantic / vector-based search using dense embeddings.

2. LLM-Driven Search & RAG Integration

Implement LLM-powered search workflows including:
- Query rewriting and expansion
- Embedding generation (OpenAI, Cohere, Sentence Transformers, etc.)
- Hybrid retrieval (BM25 + vector search)
- Re-ranking using cross-encoders or LLM evaluators
Build RAG (Retrieval Augmented Generation) flows using ElasticSearch vectors, OpenSearch, or AWS-native tools.

3. Search Infrastructure Engineering

Build and optimize search APIs for latency, relevance, and throughput.
Design scalable pipelines for:
- Indexing structured and unstructured text
- Maintaining embedding stores
- Real-time incremental updates
Implement caching, failover, and search monitoring dashboards.

4. AWS Cloud Delivery

Deploy and operate solutions on AWS, leveraging:
- OpenSearch Service or EC2-managed ElasticSearch
- Lambda, ECS/EKS, API Gateway, SQS/SNS
- SageMaker for embedding generation or re-ranking models
Implement CI/CD for search models and pipelines.

5. Evaluation & Continuous Improvement

Develop search evaluation metrics (nDCG, MRR, precision@k, recall).
Conduct A/B experiments to measure improvements.
Tune ranking functions and hybrid search scoring.
Partner with product teams to refine search behaviors with real usage patterns.

Required Skills & Qualifications

5 10 years of experience in AI/ML, NLP, or IR systems, with hands-on search engineering.
Strong expertise in ElasticSearch/OpenSearch: analyzers, mappings, scoring, BM25, aggregations, vectors.
Experience with semantic search:
- Embeddings (BERT, SBERT, Llama, GPT-based, Cohere)
- Vector databases or ES vector fields
- Approximate nearest neighbor (ANN) techniques
Working knowledge of LLM-based retrieval and RAG architectures.
Proficient in Python; familiarity with Java/Scala is a plus.
Hands-on AWS experience (OpenSearch, SageMaker, Lambda, ECS/EKS, EC2, S3, IAM).
Experience building and deploying APIs using FastAPI/Flask and containerizing with Docker.
Familiar with typical IR metrics and search evaluation frameworks.

Preferred Skills

Knowledge of cross-encoder and bi-encoder architectures for re-ranking.
Experience with query understanding, spell correction, autocorrect, and autocomplete features.
Exposure to LLMOps / MLOps in search use cases.
Understanding of multi-modal search (text + images) is a plus.
Experience with knowledge graphs or metadata-aware search.

Thanks,

Vamshi Thangadpalli

Technical Recruiter

Email: | Web:

100 Overlook Center, Suite 200

Princeton, NJ 08540.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Share