Graph-Based RAG for Manuscript Collections: A LangGraph Approach

Yahya Momtaz
Guido Russo
Massimo Brescia
Luisa Di Landa

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper introduces a conversational agent designed for querying digitized historical manuscript collections, developed as part of the MAGIC project. The system incorporates hybrid sparse and dense retrieval, a Neo4j knowledge graph, ALTO XML-based visual grounding, and a multi-step LangGraph ReAct agent powered by a Llama-3.3-70B backend. To identify the most effective retrieval strategy, a benchmark of 100 hand-annotated queries was constructed, covering six query types across two 15th-century incunabula. Multiple retrieval methods were evaluated, including BM25, dense retrieval, hybrid Reciprocal Rank Fusion (RRF), cross-encoder reranking, graph-augmented retrieval, and Hypothetical Document Embeddings (HyDE), using standard information retrieval metrics and latency measurements. The results indicate that Hybrid RRF achieves the most favorable precision–latency trade-off and is positioned on the Pareto frontier for interactive applications. However, no single method demonstrates optimal performance across all query types. Graph-based expansion substantially enhances catalog and complex queries that require relational reasoning, but reduces effectiveness for semantic and entity queries. This outcome supports the adoption of a query-adaptive retrieval strategy within the agent. Furthermore, HyDE consistently underperforms on historical text due to temporal distribution mismatch, resulting in increased latency without corresponding improvements in retrieval quality. A Retrieval-Augmented Generation (RAG) evaluation across three generation backends demonstrates near-perfect faithfulness, indicating reliable grounding. However, answer relevance remains constrained by retrieval precision. These findings identify retrieval, rather than generation, as the primary bottleneck in historical manuscript question answering. All benchmark data, annotations, and system components are made available to support reproducibility and future research.

Version published to 10.21203/rs.3.rs-9293015/v1 on Research Square
Apr 3, 2026

Skill-Augmented Frontier Agents Nearly Saturate BixBench-Verified-50

This article has 1 author:
1. Xiaoyu Zhang
This article has no evaluationsLatest version May 1, 2026
RepGene: Toward a Unified Gene Representation Space Robust to Missing Biological Views

This article has 8 authors:
1. Haiyang Hou
2. Tianyi Xia
3. Luni Hu
4. Hua Qin
5. Yong Zhang
6. Yuxiang Li
7. Shuangsang Fang
8. Lei Cao
This article has no evaluationsLatest version Jun 15, 2026
LOCALE: Local-Alignment Embeddings for Noise-Robust DNA Search at SRA Scale

This article has 4 authors:
1. Ryan P. Synk
2. Prashant Pandey
3. S. Cenk Sahinalp
4. Ramani Duraiswami
This article has no evaluationsLatest version May 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Skill-Augmented Frontier Agents Nearly Saturate BixBench-Verified-50

RepGene: Toward a Unified Gene Representation Space Robust to Missing Biological Views

LOCALE: Local-Alignment Embeddings for Noise-Robust DNA Search at SRA Scale