Lead AI Engineer

• Posted 14 days ago • Updated 6 days ago
Contract W2
Contract Independent
Contract Corp To Corp
Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

  • Generative Artificial Intelligence (AI)
  • Art
  • Regular Expression
  • Optimization
  • Semantics
  • Workflow
  • Unstructured Data
  • Real-time
  • Caching
  • Failover
  • Dashboard
  • Cloud Computing
  • API
  • Amazon SQS
  • Continuous Integration
  • Continuous Delivery
  • Continuous Improvement
  • Artificial Intelligence
  • Machine Learning (ML)
  • Natural Language Processing
  • Search Engineering
  • Okapi BM25
  • Semantic Search
  • BERT
  • Vector Databases
  • Elasticsearch
  • Neural Network
  • Python
  • Java
  • Scala
  • Amazon Web Services
  • Amazon SageMaker
  • Amazon EC2
  • Amazon S3
  • Flask
  • Docker
  • Information Retrieval
  • Investor Relations
  • International Relations
  • Evaluation
  • Business Intelligence
  • Machine Learning Operations (ML Ops)
  • Use Cases
  • Meta-data Management
  • IMG

Summary

This is Vamshi ,from Software Technology We have a job opening with our client for position Lead AI Engineer If you are available and looking for any new opportunities, please send me your updated resume for below position ASAP.

Job Title: Lead AI Engineer
Location: Austin, Texas (Hybrid)

Duration: Longterm Contract

Lead AI Engineer (Search Modernization)

Mandatory Skills: Elastic Search, OpenSearch, Python, LLM, GenAI, Semantic Search, Re-Ranking, AWS, Search Engineer

Job Description:

We are looking for an AI Engineer to modernize and enhance our existing regex/keyword-based Elastic Search system by integrating state-of-the-art semantic search, dense retrieval, and LLM-powered ranking techniques.
This role will drive the transformation of traditional search into an intelligent, context-aware, personalized, and high-precision search experience.

The ideal candidate has hands-on experience with Elastic Search internals, information retrieval (IR), embedding-based search, BM25, re-ranking, LLM-based retrieval pipelines, and AWS cloud deployment.

Roles & Responsibilities

Modernizing the Search Platform

  • Analyze limitations in current regex & keyword-only search implementation on ElasticSearch.
  • Enhance search relevance using:
    • BM25 tuning
    • Synonyms, analyzers, custom tokenizers
    • Boosting strategies and scoring optimization
  • Introduce semantic / vector-based search using dense embeddings.

2. LLM-Driven Search & RAG Integration

  • Implement LLM-powered search workflows including:
    • Query rewriting and expansion
    • Embedding generation (OpenAI, Cohere, Sentence Transformers, etc.)
    • Hybrid retrieval (BM25 + vector search)
    • Re-ranking using cross-encoders or LLM evaluators
  • Build RAG (Retrieval Augmented Generation) flows using ElasticSearch vectors, OpenSearch, or AWS-native tools.

3. Search Infrastructure Engineering

  • Build and optimize search APIs for latency, relevance, and throughput.
  • Design scalable pipelines for:
    • Indexing structured and unstructured text
    • Maintaining embedding stores
    • Real-time incremental updates
  • Implement caching, failover, and search monitoring dashboards.

4. AWS Cloud Delivery

  • Deploy and operate solutions on AWS, leveraging:
    • OpenSearch Service or EC2-managed ElasticSearch
    • Lambda, ECS/EKS, API Gateway, SQS/SNS
    • SageMaker for embedding generation or re-ranking models
  • Implement CI/CD for search models and pipelines.

5. Evaluation & Continuous Improvement

  • Develop search evaluation metrics (nDCG, MRR, precision@k, recall).
  • Conduct A/B experiments to measure improvements.
  • Tune ranking functions and hybrid search scoring.
  • Partner with product teams to refine search behaviors with real usage patterns.

Required Skills & Qualifications

  • 5 10 years of experience in AI/ML, NLP, or IR systems, with hands-on search engineering.
  • Strong expertise in ElasticSearch/OpenSearch: analyzers, mappings, scoring, BM25, aggregations, vectors.
  • Experience with semantic search:
    • Embeddings (BERT, SBERT, Llama, GPT-based, Cohere)
    • Vector databases or ES vector fields
    • Approximate nearest neighbor (ANN) techniques
  • Working knowledge of LLM-based retrieval and RAG architectures.
  • Proficient in Python; familiarity with Java/Scala is a plus.
  • Hands-on AWS experience (OpenSearch, SageMaker, Lambda, ECS/EKS, EC2, S3, IAM).
  • Experience building and deploying APIs using FastAPI/Flask and containerizing with Docker.
  • Familiar with typical IR metrics and search evaluation frameworks.

Preferred Skills

  • Knowledge of cross-encoder and bi-encoder architectures for re-ranking.
  • Experience with query understanding, spell correction, autocorrect, and autocomplete features.
  • Exposure to LLMOps / MLOps in search use cases.
  • Understanding of multi-modal search (text + images) is a plus.
  • Experience with knowledge graphs or metadata-aware search.

Thanks,

Vamshi Thangadpalli

Technical Recruiter

Email:

100 Overlook Center, Suite 200

Princeton, NJ 08540.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10235988
  • Position Id: 2025-38521/215602
  • Posted 14 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hybrid in Austin, Texas

27d ago

Easy Apply

Contract, Third Party

Depends on Experience

Remote

27d ago

Easy Apply

Contract

Depends on Experience

Cary, North Carolina

Today

Full-time

USD 110,000.00 - 188,200.00 per year

Remote

Today

Full-time

Search all similar jobs