Apply Now

AI Lead Engineer ? Generative AI & LLM Applications

Plano, TX, US • Posted 10 hours ago • Updated 1 hour ago

Full Time

On-site

USD90000.0/ANNUAL - USD130000.0/ANNUAL

Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

Python
AI/ML
LLM
GenAI
LangChain
LangGraph
AWS Bedrock
and Vertex AI
Azure OpenAI
Claude
RAG
(OpenSearch
Pinecone
Qdrant
Weaviate
pgvector
FAISS.)
GCP

Summary

Role: AI Lead Engineer - Generative AI & LLM Applications

Location - Plano, TX (onsite)

Full Time

Experience Required: 8-15 years in AI/ML development, with 3+ years specialized in Generative AI and LLM applications.

Role Overview

The AI Lead Engineer will design, build, and operate production-grade Generative AI solutions for complex enterprise scenarios. The role focuses on scalable LLM-powered applications, robust RAG pipelines, and multi-agent systems with MCP deployed across major cloud AI platforms.

Key Responsibilities

Technical Leadership & Development

Design and implement enterprise-grade GenAI solutions using LLMs (GPT, Claude, Llama and similar families).
Build and optimize production-ready RAG pipelines including chunking, embeddings, retrieval tuning, query rewriting, and prompt optimization.
Develop single- and multi-agent systems using LangChain, LangGraph, LlamaIndex and similar orchestration frameworks.
Design agentic systems with robust tool calling, memory management, and reasoning patterns.
Author MCP (Model Context Protocol) servers, tools, and resources, and integrate them with Cursor, Claude, Codex, Copilot, and internal enterprise systems.
Build plugins and extensions for Claude, Codex, Cursor and GitHub Copilot ecosystems.
Building AI Agents and Sub-Agents, Agent Skills for tools like Claude Code, Codex, and GitHub Copilot.
Build scalable Python + FastAPI/Flask or MCP microservices for AI-powered applications, including integration with enterprise APIs.
Implement model evaluation frameworks using RAGAS, DeepEval, or custom metrics aligned to business KPIs.
Implement agent-based memory management using Mem0, LangMem or similar libraries.
Fine-tune and evaluate LLMs for specific domains and business use cases.
Deploy and manage AI solutions on Azure (Azure OpenAI, Azure AI Studio, Copilot Studio), AWS (Bedrock, SageMaker, Comprehend, Lex), and Google Cloud Platform (Vertex AI, Generative AI Studio).
Implement observability, logging, and telemetry for AI systems to ensure traceability and performance monitoring.
Ensure scalability, reliability, security, and cost-efficiency of production AI applications.
Deep understanding of RAG architectures, hybrid retrieval, and context engineering patterns.
Translate business requirements into robust technical designs, architectures, and implementation roadmaps.
Drive innovation by evaluating new LLMs, orchestration frameworks, and cloud AI capabilities (including Copilot Studio for copilots and workflow automation).

Required Skills & Experience

Core Technical

Programming: Expert-level Python with production-quality code, testing, and performance tuning.
GenAI Frameworks: Strong hands-on experience with LangChain, LangGraph, LlamaIndex, agentic orchestration libraries.
LLM Integration: Practical experience integrating OpenAI, Anthropic Claude, Azure OpenAI, AWS Bedrock, and Vertex AI models via APIs/SDKs.
RAG & Search: Deep experience designing and operating RAG workflows (document ingestion, embeddings, retrieval optimization, query rewriting).
Vector Databases: Production experience with at least two of OpenSearch, Pinecone, Qdrant, Weaviate, pgvector, FAISS.

Cloud & AI Services

Azure: Azure OpenAI, Azure AI Studio, Copilot Studio, Azure Cognitive Search.
AWS: Bedrock, SageMaker endpoints, AWS Nova, AWS Transform etc.
Google Cloud Platform: Vertex AI (models, endpoints), Agentspace, Agent Builder.

Preferred Qualifications

Master's degree in Computer Science, AI/ML, Data Science, or related field.
Experience with multi-agent systems, Agent-to-Agent (A2A) communication, and MCP-based ecosystems.
Familiarity with LLMOps / observability platforms such as LangSmith, Opik, Azure AI Foundry.
Experience integrating graph databases and knowledge graphs to enhance retrieval and reasoning.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10365788
Position Id: W3GEXT-57600
Posted 10 hours ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Gen AI Engineer

Plano, Texas

•

Today

Job Description Gen AI Engineer + Strong DE Experience productionizing AI/ML or LLM-powered workflows (using LangGraph, LangChain, CrewAI, etc.) with a focus on reliability, reproducibility, and auditability Strong understanding of LLMOps fundamentals, including prompt/model/config versioning and traceable run metadata. Hands-on experience in Python, PySpark/Spark, NO-SQL (MongoDB), SQL- PostgreSQL, Redis, Databricks, Delta Lake, Azure Cloud Ability to build and maintain an LLM evaluation frame

Full-time

AI Solutions Lead - Agentic AI

Plano, Texas

•

Today

Overview As an AI Engineer specializing in AI Agents, you will play a pivotal role in our organization's transformation strategies by designing and developing domain-specific AI agents and solutions. Prototyping, iterating, and taking these solutions to production, you serve as the technical complement to the business side of the AI Solutions Lead. Your work will involve close collaboration with transformation teams, business stakeholders, and AI platform teams to create scalable, cross-domain

Full-time

USD 110,700.00 - 185,250.00 per year

AI Engineer, RAG, Agentic AI Azure Foundry, FastAPI

Carrollton, Texas

•

5d ago

We have been retained by our client in Dallas, Texas, a leader in AI and analytics, to deliver an AI Engineer (we need one Junior-Intermediate and second Senior Level AI Engineer) to work on RAG, Azure AI Factory, Agentic AI on a regular full-time, direct-hire basis. Work onsite Dallas location. Provide full-stack AI and analytics services & solutions to empower systems that achieve real outcomes and value at scale. This team is on a mission to push the boundaries of what AI and analytics can

Easy Apply

Full-time

135000 - 160000 /yr

DS with GEN AI

Plano, Texas

•

Today

DS with GEN AI (Location: Plano, TX ) Job Description: Advanced proficiency in Python, including experience with asynchronous programming, data structures, and object-oriented design. DS/ML algorithms and model working knowledge - XGBoost, Linear Regression, Clustering, Decicion Tree, KNN, SVN, etc. LangChain & LangGraph: Hands-on experience building, deploying, and maintaining applications using LangChain and LangGraph frameworks. Large Language Models (LLMs): In-depth understanding and practi

Full-time

Search all similar jobs