Apply Now

Senior Software Engineer - Retrieval-Augmented Generation (RAG)

Philadelphia, PA, US • Posted 9 hours ago • Updated 9 hours ago

Full Time

On-site

USD $95,300.00 - 158,800.00 per year

Fitment

Dice Job Match Score™

🤯 Applying directly to the forehead...

Job Details

Skills

Health Care
API
Collaboration
Testing
Management
Fusion
Vector Databases
Version Control
Access Control
Caching
Healthcare Information Technology
Regulatory Compliance
Authentication
Authorization
Data Masking
Auditing
Software Engineering
Python
Node.js
Natural Language Processing
Machine Learning (ML)
Machine Learning Operations (ML Ops)
IaaS
Amazon Web Services
Google Cloud
Google Cloud Platform
Microsoft Azure
Docker
Kubernetes
Continuous Integration
Continuous Delivery
Problem Solving
Conflict Resolution
Communication
Data Governance
Privacy
Workflow
Prompt Engineering
Evaluation
Quality Assurance
Dashboard
Data Processing
SQL
Pandas
Apache Spark
Performance Tuning
New Relic
Optimization
Security Controls
Artificial Intelligence
Data Analysis
STM
Research
Science
Social Sciences
Jersey
Recruiting

Summary

Job title: Senior Software Engineer II - Retrieval-Augmented Generation (RAG) System

About the role, we are seeking an experienced engineer to work with a team to build and support a healthcare centered production-scale RAG system that combines document retrieval with response generation to deliver accurate, context-aware answers. This engineer we be expected to design, implement, and operate end-to-end RAG pipelines- LLM interaction, API creation, and high-performance, secure delivery of knowledge-grounded capabilities. You will collaborate with data engineers, platform teams, and product partners to ship reliable, scalable, and observable systems.

About the team; This collaborative team is entrusted with building the Next Generation Health Solutions through the utilization of cutting-edge technology.

Role and responsibilities

Architecting, implementing, testing, and operating end-to-end RAG workflows:
Ingesting and normalizing documents from diverse sources
Generating and managing embeddings; index and query vector databases
Retrieve relevant passages, apply reranking or fusion strategies, and feed prompts to LLMs
Building scalable, low-latency services and APIs (Python preferred; other languages acceptable) and ensure production-grade reliability (monitoring, tracing, alerting)
Integrating with vector databases and embedding pipelines and optimize for latency, throughput, and cost
Designing and implementing ML Ops workflows: model/version management, experiments, feature stores, CI/CD for ML-enabled services, rollback plans
Developing robust data pipelines and governance around ingestion, provenance, quality checks, and access controls
Collaborating with data engineers to improve retrieval quality (embedding strategies, reranking, cross-encoder models, prompt engineering) and implement evaluation metrics (precision/recall, MRR, QA accuracy, user-centric metrics)
Implementing monitoring and observability for RAG components (latency, success rate, cache hit rate, retrieval quality, data drift)
Ensuring security, privacy, and compliance (authentication, authorization, data masking, PII handling, audit logging)

Required qualifications

5+ years of professional software engineering experience designing and delivering production systems
Strong programming skills (Python required; NodeJs a plus)
Deep understanding of retrieval-augmented or application-scale NLP systems and practical experience building RAG-like pipelines
Hands-on experience with ML workflow tooling and MLOps concepts (model serving, versioning, experiments, feature stores, reproducibility)
Proficiency with cloud infrastructure and modern software practices (AWS/Google Cloud Platform/Azure; Docker; Kubernetes; CI/CD)
Strong problem-solving skills, excellent communication, and ability to work with cross-functional teams
Familiarity with data governance, privacy, and security best practices

Preferred qualifications

Experience with agentic workflow tools (LangGraph) and familiarity with prompt engineering for LLMs
Exposure to working with and evaluating different LLMs
Knowledge of evaluation methodologies for retrieval and QA systems and the ability to set up A/B tests and dashboards
Experience with data processing frameworks (SQL, Pandas, Spark) and working with large-scale data pipelines
Background in performance optimization for low-latency AI services (MLflow)
Experience with monitoring and logging via New Relic, K9s, Portkey, etc
Experience with minimizing token usage and cost optimization
Comfortable with design and implementation of security controls for data-intensive AI systems

Elsevier is a renowned global information analytics company that primarily focuses on providing scientific, technical, and medical (STM) research content, tools, and services. It is one of the largest publishers of academic journals and scholarly literature in the world.

Elsevier operates in various domains, including science, technology, medicine, social sciences, and more. They publish a vast number of peer-reviewed journals covering a wide range of disciplines. These journals act as platforms for researchers and academics to share their findings and contribute to the advancement of knowledge in their respective fields.
U.S. National Base Pay Range: $95,300 - $158,800. Geographic differentials may apply in some locations to better reflect local market rates.If performed in New Jersey, the base pay range is $107,646 - $171,954.This job is eligible for an annual incentive bonus.
We know your well-being and happiness are key to a long and successful career. We are delighted to offer country specific benefits. Click here to access benefits specific to your location.

We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact 1-.

Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here

Please read our Candidate Privacy Policy.

We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.

USA Job Seekers:

EEO Know Your Rights.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: RTX152721
Position Id: 55739254e240d90edac97ab864cd94de
Posted 9 hours ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Senior ML Ops Engineer

Philadelphia, Pennsylvania

•

Today

Are you a collaborative Machine Learning Ops Engineer looking to work for a mission driven global organization? Are you looking to drive cutting edge products that have a true societal impact? About the team, this team that powers Elsevier's Health platforms: Clinical Key AI, Sherpath AI, and AI-driven automated clinical and content workflows. You will bridge Data Science and Engineering to turn experimental NLP/IR/GenAI models into secure, reliable, and scalable services. Our systems operate

Full-time

USD 95,300.00 - 158,800.00 per year

AI Engineer

Philadelphia, Pennsylvania

•

Today

AI Engineer remote for an Educational Company in PA! Amazing pay and benefits! Must have exp. building AI systems into existing platforms! This Jobot Job is hosted by: Alicia Blake Are you a fit? Easy Apply now by clicking the "Apply Now" button and sending us your resume. Salary: $105,000 - $145,000 per year A bit about us: We continue to innovate and improve how we fulfill the evolving needs of the healthcare community. This commitment starts and ends with the people at NBME. By recruiting

Full-time

USD 105,000.00 - 145,000.00 per year

AI Software Engineering Lead

Philadelphia, Pennsylvania

•

Today

Are you a collaborative Software Engineering Lead looking to work for a mission driven global organization? About the role - As Engineering Lead-You will lead and manage a small team of engineers, fostering their growth and ensuring delivery excellence. You will be entrusted with making sure your team is set up for success as they deliver products. This position serves as a subject matter expert for a specific team of Software Engineers. In addition to writing code on complex systems and applic

Full-time

USD 115,400.00 - 192,300.00 per year

Principal AI Software Engineer

Philadelphia, Pennsylvania

•

Today

Job Description Are you a collaborative Principal AI Engineering Lead looking to work for a mission driven global organization? About the role - As a Principal AI Engineer- This position serves as a subject matter expert for a specific team of Software Engineers. In addition to writing code on complex systems and applications, this position provides direction on project plans, schedules, and methodologies. About the team, This team supports CK AI Service team. The new Lead Engineer will have

Full-time

USD 115,400.00 - 192,300.00 per year

Search all similar jobs