Hybrid Memory-Retrieval Model: Enhancing Trust in Medical Chatbots

Sagarika Singh

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language model-based medical chatbots face two major challenges: hallucination, where models generate plausible but incorrect information, and context loss in multi-turn conversations. These issues reduce trust and safety in healthcare applications. This work presents a hybrid memory-retrieval architecture that enhances factual accuracy and conversational continuity. The system combines a dual-retriever pipeline using BM25 and MedCPT with long-term memory retrieval via ChromaDB. Retrieved documents and past interactions are fused using Reciprocal Rank Fusion and passed to a compact language model (Phi-2) for response generation. When relevant context is missing, the model defaults to fallback instructions to avoid hallucinated outputs. Evaluation on the MedQuAD dataset shows strong semantic alignment (BERTScore F1 = 0.8644), improved fluency, and significantly faster response times compared to baseline retrieval-augmented models. This approach demonstrates the effectiveness of integrating structured memory with selective retrieval to build more trustworthy and reliable medical chatbots.

Version published to 10.32388/r6ojq5
Sep 8, 2025

Agentic memory-augmented retrieval and evidence grounding for medical question-answering tasks

This article has 5 authors:
1. Shuyue Jia
2. Subhrangshu Bit
3. Varuna H. Jasodanand
4. Yi Liu
5. Vijaya B Kolachalama
This article has no evaluationsLatest version Aug 8, 2025
Retrieval-Augmented Generation (RAG) in Healthcare: A Comprehensive Review

This article has 3 authors:
1. Fnu Neha
2. Deepshikha Bhati
3. Deepak Kumar Shukla
This article has no evaluationsLatest version Sep 11, 2025
Efficient Retrieval Augmented Generation Based QA Chatbot Builder Using LLaMA 3.2B with LoRA

This article has 1 author:
1. Shreya Singh
This article has no evaluationsLatest version Sep 24, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Agentic memory-augmented retrieval and evidence grounding for medical question-answering tasks

Retrieval-Augmented Generation (RAG) in Healthcare: A Comprehensive Review

Efficient Retrieval Augmented Generation Based QA Chatbot Builder Using LLaMA 3.2B with LoRA