Leveraging Multimodal Large Language Models to Extract Mechanistic Insights from Biomedical Visuals: A Case Study on COVID-19 and Neurodegenerative Diseases

Elizaveta Popova
Marc Jacobs
Martin Hofmann-Apitius
Negin Sadat Babaiha

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

The COVID-19 pandemic has intensified concerns about its long-term neurological impact, with growing evidence linking SARS-CoV-2 infection to neurodegenerative diseases (NDDs) such as Alzheimer’s (AD) and Parkinson’s (PD). Patients with these conditions not only face higher risk of severe COVID-19 outcomes but may also undergo accelerated cognitive and motor decline following infection. Proposed mechanisms—ranging from neuroinflammation and blood–brain barrier disruption to abnormal protein aggregation—closely mirror core features of neurodegenerative pathology. Yet, current knowledge is fragmented across text, figures, and pathway diagrams, hindering integration into computational models capable of uncovering systemic patterns.

Results

To address this gap, we applied GPT-4 Omni (GPT-4o), a multimodal large language model, to extract mechanistic insights from biomedical figures. Over 10,000 images were retrieved through targeted searches on COVID-19 and neurodegeneration; after automated and manual filtering, a curated subset was analyzed. GPT-4o extracted biological relationships as semantic triples, which were grouped into six mechanistic categories—including microglial activation and barrier disruption—using ontology-guided similarity and assembled into a Neo4j knowledge graph.

Accuracy was evaluated against a gold-standard dataset of expert-annotated images using BioBERT-based semantic matching. This evaluation also enabled prompt tuning, threshold optimization, and hyperparameter assessment. Results demonstrate that GPT-4o successfully recovers both established and novel mechanisms, yielding interpretable outputs that illuminate complex biological links between SARS-CoV-2 and neurodegeneration.

Conclusions

This study showcases the potential of multimodal LLMs to mine biomedical visual data at scale. By complementing text mining and integrating figure-derived knowledge, our framework advances understanding of COVID-19–related neurodegeneration and supports future translational research.

Version published to 10.1101/2025.10.01.679928 on bioRxiv
Oct 2, 2025

Linking COVID-19 to Neurodegeneration: A Single-Cell Deep Learning Study of PBMCs in Multiple Sclerosis and Alzheimer’s Disease

This article has 5 authors:
1. Asiyeh Mirzaei Koli
2. Shokoofeh Ghiam
3. Mohammad Shirinpoor Kharf
4. Pourya Naderi Yeganeh
5. Changiz Eslahchi
This article has no evaluationsLatest version Oct 8, 2025
An Explainable Web-Based Diagnostic System for Alzheimer’s Disease Using XRAI and Deep Learning on Brain MRI

This article has 2 authors:
1. Serra Aksoy
2. Arij Daou
This article has no evaluationsLatest version Aug 21, 2025
CBM KG: A Comorbidity-Centric Knowledge Graph Uncovering Causal Pathomechanisms Between COVID-19 and Neurodegenerative Diseases

This article has 15 authors:
1. Heval Atas Guvenilir
2. Priya Sethumadhavan
3. Abish Kaladharan
4. Renata Vieira de Sa
5. Adithya Sridhar
6. Undine Haferkamp
7. Ole Pless
8. Andrea Martí-Sarrias
9. Sandra Acosta
10. Tamas Letoha
11. Anett Hudak
12. Jochen Ohnmacht
13. Feng Q. Hefeng
14. Martin Hofmann-Apitius
15. Alpha Tom Kodamullil
This article has no evaluationsLatest version Aug 19, 2025

Discuss this preprint

Listed in

Abstract

Background

Results

Conclusions

Article activity feed

Related articles

Linking COVID-19 to Neurodegeneration: A Single-Cell Deep Learning Study of PBMCs in Multiple Sclerosis and Alzheimer’s Disease

An Explainable Web-Based Diagnostic System for Alzheimer’s Disease Using XRAI and Deep Learning on Brain MRI

CBM KG: A Comorbidity-Centric Knowledge Graph Uncovering Causal Pathomechanisms Between COVID-19 and Neurodegenerative Diseases