Apply Now

Senior AI Engineer, Agentic and RAG Systems

Remote • Posted 30+ days ago • Updated Just Now

Full Time

Remote

Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

Workflow
SAFE
Analytics
Orchestration
Streaming
Regression Analysis
Optimization
Accounting
Build Vs Buy
Data Engineering
Semantics
Software Engineering
Shipping
Research
Python
LangChain
LlamaIndex
Autogen
Stacks Blockchain
Caching
Vector Databases
Microsoft Azure
Routing
Kubernetes
Docker
Cloud Computing
Amazon Web Services
LangSmith
Evaluation
Continuous Integration
Continuous Delivery
Artificial Intelligence
GitHub
Jenkins
English
Databricks
Unity
PySpark
SQL
Microsoft Certified Professional
Servers
Generative Artificial Intelligence (AI)
Hardening
Machine Learning (ML)
Natural Language Processing
BERT
Time Series

Workflow
SAFE
Analytics
Orchestration
Streaming
Regression Analysis
Optimization
Accounting
Build Vs Buy
Data Engineering
Semantics
Software Engineering
Shipping
Research
Python
LangChain
LlamaIndex
Autogen
Stacks Blockchain
Caching
Vector Databases
Microsoft Azure
Routing
Kubernetes
Docker
Cloud Computing
Amazon Web Services
LangSmith
Evaluation
Continuous Integration
Continuous Delivery
Artificial Intelligence
GitHub
Jenkins
English
Databricks
Unity
PySpark
SQL
Microsoft Certified Professional
Servers
Generative Artificial Intelligence (AI)
Hardening
Machine Learning (ML)
Natural Language Processing
BERT
Time Series

Summary

We are seeking a hands-on Senior AI Engineer who designs, builds, and operates production GenAI systems - agentic workflows, RAG pipelines, and LLM-backed services with real users and real SLAs. This is an engineering role, not a research role. The bar is reliability, latency, cost, observability, and safe deployment at scale, with end-to-end ownership from architecture through on-call. Typical workloads include enterprise knowledge platforms, conversational analytics, agentic automation, and LLM-augmented data products. Responsibilities Design agent orchestration (graph/state, conditional routing, tool calling, memory, checkpointing) in LangGraph / LangChain or equivalent Build production RAG end-to-end: chunking, embeddings, vector stores, hybrid retrieval, reranking, caching, and grounded synthesis Own Python / FastAPI services - async, SSE streaming, session handling, and structured error contracts Instrument with tracing and evaluation harnesses (MLflow, OpenTelemetry, or equivalent) for accuracy, cost, and regression Ship on Docker + Kubernetes (EKS/AKS/GKE) via CI/CD with test, eval, and canary gates Drive LLM cost engineering - model routing, prompt optimization, caching, token accounting, and build-vs-buy decisions Apply GenAI safety & governance: hallucination control, prompt-injection defense, PII handling, and HITL where required Partner with data engineering on semantic layers and pipelines (PySpark / SQL where applicable) Requirements 5+ years in software engineering, with 2+ years shipping production LLM / agentic systems (not POCs or research) Proficiency in Python and FastAPI (async, REST, SSE) Production expertise in LangChain and LangGraph (or equivalent serious production experience with LlamaIndex, AutoGen, or MCP stacks) Background in production RAG: embeddings, chunking, and hybrid retrieval with reranking and caching Skills in vector databases such as Pinecone, Weaviate, pgvector, OpenSearch, or Databricks Vector Search Knowledge of at least one major LLM provider in production - AWS Bedrock (preferred), OpenAI / Azure OpenAI, or Anthropic - with model selection and routing trade-offs Competency in Kubernetes and Docker in real production environments (EKS/AKS/GKE) Expertise in cloud engineering on AWS Familiarity with observability and tracing tools (MLflow, LangSmith, OpenTelemetry), evaluation harnesses, and latency/cost ownership Capability to build CI/CD for AI systems (GitHub Actions, Jenkins, or equivalent) with test/eval gates Strong written and spoken English (B2 level); able to own design discussions with engineering and business stakeholders independently Nice to have Databricks depth - MLflow (tracking & serving), Vector Search, Unity Catalog / Metric Views, PySpark / SQL Experience with LLM fine-tuning - PEFT, LoRA, QLoRA Understanding of MCP servers and tool integration Qualifications in GenAI governance & FinOps - auditability, prompt-injection hardening, PII, and token cost in regulated environments Background in classical ML / DL - NLP, BERT-family, time-series, and CV

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10330481
Position Id: 683a0a9659a32be2ed024c64d83b6f28
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Remote

•

Today

We are looking for a seasoned Lead AI Engineer who architects, builds, and operates production GenAI platforms - agentic workflows, RAG pipelines, and LLM-backed services with real users and real SLAs - while leading engineers and setting the technical direction across multiple workstreams. This is an engineering leadership role, not a research role. The bar is reliability, latency, cost, observability, and safe deployment at scale, with end-to-end ownership from architecture through on-call, an

Full-time

Senior AI Engineer

Remote or Boston, Massachusetts

•

Today

The Opportunity If you've worked alongside hardware teams, you know the damage that results from a missed change request or critical context that was never relayed to the right person. Reflow exists to close that gap. We're building the first AI-powered platform built for hardware product development, one that listens across the tools teams already use, maintains a structured picture of every program, and proactively coordinates across disciplines when things inevitably change. This is an earl

Full-time

USD 143,666.72 - 215,500.08 per year

REMOTE -Principal Software Developer- Agentic AI, Healthcare AI

Remote

•

Today

Job Description Responsibilities Architect, design, develop, deploy, and operate production-grade AI systems powered by LLMs, agents, retrieval, and enterprise data. Build agentic AI systems that leverage tool use, memory, planning, orchestration, and workflow automation to solve complex business problems. Design and implement scalable RAG, search, retrieval, ranking, and knowledge systems across structured and unstructured data sources. Develop LLM-powered applications, including prompt and co

Full-time

USD 114,600.00 - 234,600.00 per year

AI Engagement Lead (Agentic AI)

Remote

•

Today

About the role: Client is looking for people with GenAI experience to join us in solving business problems for our Fortune 500 customers. You will be a key member of the Turing Intelligence delivery organizations and part of a GenAI project. You will be required to lead a team of other Turing engineers across different skill sets. In the past, the Turing GenAI delivery organization has implemented industry leading multi-agent LLM systems and LLM deployments for major enterprises. Required skill

Easy Apply

Full-time

Depends on Experience

Search all similar jobs

Remote jobs at EPAM Systems

Senior AI Engineer, Agentic and RAG Systems

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs