Apply Now

AI Application Engineer

Santa Clara, CALIFORNIA, US • Posted 4 hours ago • Updated 4 hours ago

Contract W2

On-site

DOE

Fitment

Dice Job Match Score™

👤 Reviewing your profile...

Job Details

Skills

Management
Regression Testing
Optimization
Collaboration
Continuous Improvement
Software Engineering
Python
Machine Learning (ML)
Application Development
Prompt Engineering
Training
LangChain
LlamaIndex
Vector Databases
NIM
Customer Facing
Evaluation
Orchestration
ServiceNow
API
Artificial Intelligence

Summary

JOB SUMMARY This role is for an AI Application Engineer to support the development and delivery of next-generation AI-powered applications. The position will concentrate on production-grade LLM application engineering, RAG quality, prompt engineering, AI safety, and the orchestration of complex multi-step AI pipelines. The engineer will design, develop, and optimize AI applications, ensuring AI quality, RAG accuracy, prompt engineering, and AI safety. Key activities include developing and maintaining orchestration pipelines, implementing and optimizing RAG pipelines, designing conversational AI experiences, integrating NVIDIA technologies, building automated evaluation pipelines, performing latency profiling, and implementing AI safety guardrails. Collaboration with global teams and support for production deployments are also integral to the role. Key Responsibilities Design, develop, and optimize production-grade LLM-powered applications. Own AI quality, RAG accuracy, prompt engineering, and AI safety across multiple applications. Develop and maintain multi-step LLM orchestration pipelines using LangChain, LlamaIndex, or custom frameworks. Implement and optimize RAG pipelines including chunking strategies, embedding selection, reranking, and hybrid search. Design multi-turn conversational AI experiences with context management and session memory. Integrate NVIDIA technologies including NIM, NeMo, NeMoGuardrails, and Riva into enterprise AI applications. Build automated evaluation pipelines for model quality, hallucination detection, regression testing, and release gating. Perform latency profiling and optimization across multi-step LLM call chains. Implement AI safety guardrails including prompt injection prevention, jailbreak mitigation, and topical control. Collaborate with globally distributed engineering and product teams to deliver scalable AI solutions. Support deployment, monitoring, and continuous improvement of AI applications in production environments. Required Qualifications 47 years of software engineering experience with at least 2 years focused on production LLM application development. Expert-level experience with Python for AI/ML application development and async programming. Strong expertise in prompt engineering including system prompts, few-shot prompting, and instruction tuning. 3+ years of hands-on experience with multi-step LLM orchestration frameworks such as LangChain or LlamaIndex. 3+ years of experience designing and optimizing RAG pipelines and retrieval systems. 3+ years of experience with vector databases, similarity search tuning, and reranking techniques. 3+ years of hands-on experience with NVIDIA NIM, NeMo, NeMoGuardrails, and Riva. 3+ years of experience implementing AI safety and guardrails for customer-facing applications. Strong knowledge of automated AI evaluation frameworks such as RAGAS or TruLens. 3+ years of experience profiling and optimizing latency in multi-step AI pipelines. Ability to work onsite in Santa Clara, CA. Preferred Qualifications Experience with adaptive learning systems or recommendation engines. Knowledge graph integration experience with RAG architectures. Experience with multi-agent orchestration patterns. ServiceNow API integration experience. Prior experience building AI products on NVIDIA infrastructure. Experience with streaming LLM response handling and real-time AI applications. Education: Bachelors Degree

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: compun
Position Id: KUMDC5806377
Posted 4 hours ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

AI Application Engineer || CA.

Santa Clara, California

•

Today

Role: AI Application Engineer Location: Santa Clara, CA (3 days onsite in a week) Duration: 13 months contract Required Skills: Minimum 13+ years of overall IT experience. 47 years of software engineering experience with at least 2 years focused on production LLM application development Expert-level experience with Python for AI/ML application development and async programming Strong expertise in prompt engineering including system prompts, few-shot prompting, and instruction tuning 3

Easy Apply

Contract, Third Party

Depends on Experience

Python Developer LLM / AI Applications (W2 Onsite)

Sunnyvale, California

•

20d ago

Maxonic maintains a close and long-term relationship with our direct client. In support of their needs, we are looking for a Python Developer LLM / AI Applications. Job Description: Job Title: Python Developer LLM / AI Applications Job Type: Contract Job Location: Sunnyvale, CA Work Schedule: On-site Description: We are looking for a Python developer to design, build, and deploy applications powered by large language models (LLMs). You ll work on integrating models like OpenAI API, Hugging Face

Easy Apply

Contract

Depends on Experience

AI Solution Engineer

Santa Clara, California

•

12d ago

Immediate need for a talented AI Solution Engineer. This is a 12+months contract opportunity with long-term potential and is located in Cupertino/Santa Clara Valley, CA (Onsite). Please review the job description below and contact me ASAP if you are interested. Job ID:26-14453 Pay Range: $45 - $52/hour. Employee benefits include, but are not limited to, health insurance (medical, dental, vision), 401(k) plan, and paid sick leave (depending on work location). Key Requirements and Technology E

Easy Apply

Contract

$45 - $52

Applied AI developer

Hybrid in Palo Alto, California

•

13d ago

Applied AI Developer Engineering (Enterprise AI Team) Location: Hybrid (Palo Alto, CA) 3 days onsite, 2 days remote. Role Type: Contract (W2) About Our Client Our client is a fast-growing, publicly traded technology company operating in the cybersecurity and data management space. They provide a modern platform focused on protecting, managing, and securing data across cloud, hybrid, and on-prem environments. Their solutions help enterprise organizations defend against cyber threats, ensure comp

Easy Apply

Contract

Depends on Experience

Search all similar jobs