Retrieval-Augmented Generation

Charles Liu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Retrieval-augmented generation (RAG) is a hybrid architecture that combines the generative power of large language models (LLMs) with the factual reliability of information retrieval systems. Although the emergence of large language models (LLMs) has significantly improved the performance of natural language understanding and generation tasks. However, these models often suffer from information distortion, outdated information, and lack of transparency. Retrieval-augmented generation (RAG) addresses these limitations by introducing an external retrieval mechanism into the generation process. RAG systems follow the retrieve first, then generate paradigm, which retrieves relevant documents from knowledge sources and uses them as input to the language model. This approach enables the model to generate more accurate, solid, and timely responses. RAG has become an infrastructure for knowledge-intensive natural language processing (NLP) and LLM. In this review, we comprehensively review the basic architecture of RAG systems, analyze key components such as retrievers and generators, compare mainstream implementations, and evaluate their performance on various tasks. We also discuss challenges in the RAG pipeline, including latency, hallucinations, context filtering, and knowledge freshness. Finally, we highlight future research directions in terms of scalability, personalization, and integration with structured knowledge sources.

Version published to 10.31224/5781
Nov 10, 2025

Joint Modeling of Intelligent Retrieval-Augmented Generation in LLM-Based Knowledge Fusion

This article has 2 authors:
1. Di Wu
2. Shuaidong Pan
This article has no evaluationsLatest version Sep 30, 2025
Augmenting Large Language Models and Retrieval-Augmented Generation with an Evidence-Based Medicine-Enabled Agent System

This article has 3 authors:
1. Yi Yu
2. Lingli Li
3. Yaqin Li
This article has no evaluationsLatest version Oct 20, 2025
Reasoning in Large Language Models: A Survey

This article has 3 authors:
1. Yu Fu
2. Yongqi Kang
3. Yong Zhao
This article has no evaluationsLatest version Oct 14, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Joint Modeling of Intelligent Retrieval-Augmented Generation in LLM-Based Knowledge Fusion

Augmenting Large Language Models and Retrieval-Augmented Generation with an Evidence-Based Medicine-Enabled Agent System

Reasoning in Large Language Models: A Survey