Biomedical Hypothesis Explainability with Graph-Based Context Retrieval

Ilya Tyagin
Saeideh Valipour
Aliaksandra Sikirzhytskaya
Michael Shtutman
Ilya Safro

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

We introduce an explainability method for biomedical hypothesis generation systems, built on top of the novel Hypothesis Generation Context Retriever framework. Our approach combines semantic graph-based retrieval and relevant data-restrictive training to simulate real-world discovery constraints. Integrated with large language models (LLMs) via retrieval-augmented generation, the system explains hypotheses with contextual evidence using published scientific literature. We also propose a novel feedback loop approach, which iteratively identifies and corrects flawed parts of LLM-generated explanations, refining both the evidence paths and supporting context. We demonstrate the performance of our method with multiple large language models and evaluate the explanation and context retrieval quality through both expert-curated assessment and large-scale automated analysis. Our code is available at: https://github.com/IlyaTyagin/HGCR .

Version published to 10.1101/2025.10.17.683161 on bioRxiv
Oct 19, 2025

Tuning Knowledge Graph Embeddings in Clustering with LISE

This article has 5 authors:
1. Verdiana Schena
2. Simona Colucci
3. Donini Francesco Maria
4. Floriano Scioscia
5. Eugenio Di Sciascio
This article has no evaluationsLatest version Dec 15, 2025
A Modular Framework for Automated Hypothesis Validation and Refinement in Scientific Research

This article has 5 authors:
1. Chenhao Chen
2. Taiga Masuda
3. Tsubasa Hirakawa
4. Takayoshi Yamashita
5. Hironobu Fujiyoshi
This article has no evaluationsLatest version Jan 19, 2026
DiLLaB: Discussion Labeling with LLMs for Building Datasets

This article has 6 authors:
1. Ludimila Gonçalves
2. Márcia Lima
3. André Carvalho
4. Walter Nakamura
5. Igor Steinmacher
6. Tayana Conte
This article has no evaluationsLatest version Jan 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Tuning Knowledge Graph Embeddings in Clustering with LISE

A Modular Framework for Automated Hypothesis Validation and Refinement in Scientific Research

DiLLaB: Discussion Labeling with LLMs for Building Datasets