Apply Now

AI Application Engineer

Hybrid in Santa Clara, CA, US • Posted 12 hours ago • Updated 12 hours ago

Contract W2

Contract Corp To Corp

6 Months

No Travel Required

Hybrid

Depends on Experience

VDart, Inc.

Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

AI
LLM
RAG
NVIDIA
Langchain

Summary

Role: AI Application Engineer

Location: Santa Clara, CA (Hybrid)

Type: Contract

Overview:

AI Application Engineer to support the development and delivery of next-generation AI-powered applications built on NVIDIA infrastructure. This role will focus on production-grade LLM application engineering, RAG quality, prompt engineering, AI safety, and orchestration of complex multi-step AI pipelines.

Day-to-Day Responsibilities

Design, develop, and optimize production-grade LLM-powered applications
Own AI quality, RAG accuracy, prompt engineering, and AI safety across multiple applications
Develop and maintain multi-step LLM orchestration pipelines using LangChain, LlamaIndex, or custom frameworks
Implement and optimize RAG pipelines including chunking strategies, embedding selection, reranking, and hybrid search
Design multi-turn conversational AI experiences with context management and session memory
Integrate NVIDIA technologies including NIM, NeMo, NeMoGuardrails, and Riva into enterprise AI applications
Build automated evaluation pipelines for model quality, hallucination detection, regression testing, and release gating
Perform latency profiling and optimization across multi-step LLM call chains
Implement AI safety guardrails including prompt injection prevention, jailbreak mitigation, and topical control
Collaborate with globally distributed engineering and product teams to deliver scalable AI solutions
Support deployment, monitoring, and continuous improvement of AI applications in production environments

Basic Qualifications:

4–7 years of software engineering experience with at least 2 years focused on production LLM application development
Expert-level experience with Python for AI/ML application development and async programming
Strong expertise in prompt engineering including system prompts, few-shot prompting, and instruction tuning
3+ Years of Hands-on experience with multi-step LLM orchestration frameworks such as LangChain or LlamaIndex
3+ Years of Experience designing and optimizing RAG pipelines and retrieval systems
3+ Years of Experience with vector databases, similarity search tuning, and reranking techniques
3+ Years of Hands-on experience with NVIDIA NIM, NeMo, NeMoGuardrails, and Riva
3+ Years of Experience implementing AI safety and guardrails for customer-facing applications
Strong knowledge of automated AI evaluation frameworks such as RAGAS or TruLens
3+ Years of Experience profiling and optimizing latency in multi-step AI pipelines
Ability to work onsite in Santa Clara, CA
Preferred Qualifications
Experience with adaptive learning systems or recommendation engines
Knowledge graph integration experience with RAG architectures
Experience with multi-agent orchestration patterns
ServiceNow API integration experience
Prior experience building AI products on NVIDIA infrastructure
Experience with streaming LLM response handling and real-time AI applications

Technology Stack

Python
LangChain
LlamaIndex
NVIDIA NIM
NeMo
NeMoGuardrails
NVIDIA Riva
Vector Databases
RAGAS / TruLens
LLM APIs and orchestration frameworks

Education

Bachelor’s degree in Computer Science, Engineering, Artificial Intelligence, or equivalent work experience.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10330808
Position Id: 97827-5195-
Posted 12 hours ago

Company Info

About VDart, Inc.

VDart, headquartered in Atlanta, GA, is a global leader in digital talent solutions and IT staffing, delivering top technology professionals to businesses worldwide. With a strong presence across North America, Europe and Asia, we specialize in helping organizations navigate complex technology landscapes with the right expertise.

Through a strategic, client-focused approach, we have placed over 20,000 professionals across key industries and advanced technology solutions. Whether placing top talent in cutting-edge roles or providing strategic digital workforce solutions, our network of 4,000 specialists across 13 countries is committed to excellence, agility and impact.

Backed by 18 years of industry experience, we go beyond staffing to build long-term partnerships that accelerate digital transformation and drive sustained growth. Whether you need a technology partner to fuel innovation or specialized workforce solutions to maintain a competitive edge, VDart delivers the right people, skills and mindset to create a lasting impact in a digital-first world.

Go to company profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Hybrid in Santa Clara, California

•

Yesterday

Role: Platform Engineer (AI/LLM Infrastructure) Location: Santa Clara, CA (Hybrid) Type: Contract Day to Day Job Duties: Lead the design, implementation, and operation of scalable infrastructure platforms supporting AI/LLM-based solutions for enterprise clients Act as a hands-on technical lead (player-coach), contributing to development while guiding a team of engineers Own end-to-end infrastructure architecture below the application layer, including compute, container orchestration, CI/C

Easy Apply

Contract, Third Party

Depends on Experience

Platform Engineer

Remote

•

16d ago

Platform Engineer Remote Contract The Platform LLM Infrastructure Engineer operates at the intersection of capacity planning GPU capacity optimization quota management model lifecycle management and production reliability This role is critical in ensuring scalable efficient and resilient infrastructure for large language model LLM platforms Key Responsibilities Manage shared GPU resources across regions including handling LLM capacity requests and optimizing utilization Depl

Easy Apply

Contract, Third Party

Depends on Experience

Full Stack Develoer

Hybrid in Bellevue, Washington

•

Today

Job Title: Full Stack Developer Location: Bellevue, WA Duration: / Term: 6+ months Job Description: Experience Desired: 8+ Years Job Description: Full Stack designs, develops, and delivers scalable software solutions that enable scalable, highly available, and secure systems across the enterprise. This role partners closely with architects, product owners, data engineers, and privacy stakeholders to build services and platforms that support regulatory compliance while maintaining p

Easy Apply

Third Party, Contract

$60 - $65

Software Engineer (Kafka)

Frisco, Texas

•

Today

Job Title: Software Engineer (Kafka) Location: Bellevue, WA / Frisco, TX Duration: / Term: 6+ months Job Description: Experience Desired: 8+ Years Job Description: Designs, develops, and delivers scalable software solutions that enable highly available and secure systems across the enterprise. Collaborating closely with data engineers, this role architect implements complex data pipelines, real-time event-driven messaging frameworks, and distributed microservices communication us

Easy Apply

Contract, Third Party

$60 - $65

Search all similar jobs

AI Application Engineer

VDart, Inc.

Dice Job Match Score™

Job Details

Skills

Summary

Company Info

About VDart, Inc.

Similar Jobs