Biological Database Mining for LLM-Driven Alzheimer’s Disease Drug Repurposing

Rico Andre Schmitt
Konstantin Buelau
Leon Martin
Christoph Huettl
Michael Schirner
Leon Stefanovski
Petra Ritter

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

BACKGROUND

This study presents a software pipeline that leverages LLMs to apply knowledge stored in natural language (such as in pharmacological texts) and ontologies in a transparent Drug Repurposing (DR) information structure.

METHODS

Alzheimer’s Disease (AD) related entries in Gene Ontology and DrugBank were integrated into a Knowledge Graph database to inform LLM prompts. 16,581 drugs were screened for their DR potential by the LLM Llama3:8b. The vector embedding representation of the drugs in the LLM was investigated to asses if LLMs store pharmacological information in alignment with domain expert understanding of pharmacological groups. By measuring the semantic similarity of drugs quantitatively, the performance of the DR pipeline was examined. A manual hallucination check was performed to assess the impact of the ontology-database combination on LLM-hallucination performance. The results were compared against registered clinical trials (RCTs) and proposed medications in meta-analyses to evaluate their predictive value.

RESULTS

The embedding analysis showed that the vector representations of drugs in the LLM show clusters in alignment with pharmacological groups. The ontologically enhanced prompt was closer to the expert domain proposals than a zero-shot control prompt without that knowledge. The results of the ontology-based prompt showed fewer hallucinations in their responses compared to the zero-shot control prompting.

CONCLUSIONS

Ontology-augmented LLM interaction leads to fewer hallucinations and output closer to expert assessment in comparison with a zero-shot control. We propose retrospective analyses, considering the high-rated drugs and their effect on AD patients as a starting point for further (prospective) research.

Version published to 10.1101/2024.12.04.626255 on bioRxiv
Dec 8, 2024

Key Regulators of Alzheimer’s Disease: Network Biology and In-Silico Analysis with AChE and Glutamate Inhibitors

This article has 1 author:
1. Sayantan Das
This article has no evaluationsLatest version Jan 21, 2026
The Impact of Structural Variation on Alzheimer’s Disease in the Alzheimer’s Disease Sequencing Project

This article has 13 authors:
1. Songmi Lee
2. Adam C English
3. Gina M Peloso
4. Joshua C Bis
5. Eric Boerwinkle
6. Seung Hoan Choi
7. Nancy L Heard-Costa
8. Honghuang Lin
9. Rui Xia
10. Sudha Seshadri
11. Anita L Destefano
12. Myriam Fornage
13. Fritz J Sedlazeck
This article has no evaluationsLatest version Jan 13, 2026
Literature-Identified Serum miRNA Signatures for Cognitive Decline: Integrated Analysis and Machine-Learning Diagnostics in Alzheimer’s Disease

This article has 8 authors:
1. Zhiyan Chen
2. Yadi Liu
3. Huan Wang
4. Ke Liu
5. Yutong Li
6. Xiaohua Hu
7. Rui Li
8. Linjuan Sun
This article has no evaluationsLatest version Jan 28, 2026

Discuss this preprint

Listed in

Abstract

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

Article activity feed

Related articles

Key Regulators of Alzheimer’s Disease: Network Biology and In-Silico Analysis with AChE and Glutamate Inhibitors

The Impact of Structural Variation on Alzheimer’s Disease in the Alzheimer’s Disease Sequencing Project

Literature-Identified Serum miRNA Signatures for Cognitive Decline: Integrated Analysis and Machine-Learning Diagnostics in Alzheimer’s Disease