Job Title: Prompt Engineer
Location: Jersey City, NJ (50% Remote / Hybrid)
Job Description
We are seeking a Prompt Engineer to design, test, govern, and continuously optimize prompts, system instructions, conversation flows, and interaction patterns for Large Language Model (LLM) applications. The ideal candidate will ensure AI outputs are accurate, grounded, secure, compliant, and aligned with business objectives.
Key Responsibilities
Design prompts for chatbots, copilots, RAG systems, document analysis, summarization, workflow agents, and knowledge assistants.
Develop system prompts, few-shot examples, tool-use instructions, response templates, and conversation policies.
Optimize prompts for accuracy, relevance, groundedness, safety, compliance, latency, token efficiency, and consistency.
Build reusable prompt libraries and enterprise prompt templates.
Evaluate prompt performance using metrics such as task success, hallucination rate, groundedness, completeness, and user satisfaction.
Partner with engineering teams to implement prompt versioning, testing, deployment, and monitoring.
Improve RAG quality by evaluating retrieval context, chunking strategies, source citations, and response synthesis.
Perform adversarial testing for prompt injection, jailbreaks, instruction conflicts, sensitive data leakage, and unsafe outputs.
Required Skills
Strong understanding of LLMs, prompt engineering, tokenization, context windows, RAG, embeddings, and model limitations.
Hands-on experience with OpenAI APIs, Azure OpenAI, Anthropic, LangChain, LlamaIndex, Semantic Kernel, or similar AI platforms.
Experience debugging LLM outputs through structured testing and iterative prompt refinement.
Knowledge of prompt security, including prompt injection, jailbreaks, hallucinations, and data leakage.
Excellent communication, analytical, and stakeholder management skills.
Preferred Qualifications
Background in NLP, Conversational AI, UX Writing, Technical Writing, Product Design, Knowledge Management, or Business Analysis.
Experience in financial services, legal, compliance, risk, operations, customer support, or enterprise knowledge domains.
Familiarity with prompt registries, A/B testing, human review workflows, and evaluation frameworks.