Multimodal Deep Learning for Longitudinal Prediction of Glaucoma Progression Using Sequential RNFL, Visual Field, and Clinical Data

Mousa Moradi
Jerry Cao-Xue
Mohammad Eslami
Mengyu Wang
Tobias Elze
Nazlee Zebardast

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Forecasting glaucoma progression remains a major challenge in preventing irreversible vision loss. We developed and validated a multimodal, longitudinal deep learning framework to predict future progression using a large retrospective cohort of 10,864 patients from Mass Eye and Ear. The model integrates sequential structural (OCT RNFL scans), functional (visual-field maps), and clinical data from a two-year observation window to forecast progression over the subsequent two-to four-year horizon. Four backbone architectures (ConvNeXt-V2, ViT, MobileNet-V2, EfficientNet-B0) were coupled with a bidirectional LSTM to capture temporal dynamics. The ConvNeXt-V2-based model achieved 0.97 AUC and 0.94–0.96 accuracy, outperforming other backbones with robust performance across sex and race subgroups and only modest attenuation in those > 70 years. Saliency maps localized to clinically relevant arcuate bundles, supporting biological plausibility. By effectively fusing multimodal data over time, this framework enables accurate, interpretable, and equitable long-horizon risk stratification, advancing personalized glaucoma management.

Version published to 10.1101/2025.10.31.25339266 on medRxiv
Nov 4, 2025

Explainable Deep Learning for Glaucoma Detection: A DenseNet121-Based Classification with Grad-CAM Visualization

This article has 2 authors:
1. Heshan Chandeepa Pathmakumara
2. Gayan Perera
This article has no evaluationsLatest version Oct 9, 2025
Enhancing Multimodal Glaucoma Screening through Attention-Guided Vision-Language Fusion

This article has 3 authors:
1. Xiaoyu Liu
2. Jinchun Piao
3. Qi Wang
This article has no evaluationsLatest version Sep 22, 2025
Benchmarking Deep Learning Models for Real-Time Diabetic Retinal Blood Vessel Segmentation

This article has 11 authors:
1. Robert Ngabo Mugisha
2. Geoffrey Munyaneza
3. Fideli Nsanzumukunzi
4. Mediatrice Dusenge
5. Josue Uzigusenga
6. Theophilla Igihozo
7. Fabrice Mpozenzi
8. Emmanuella Nuwayo
9. Benny Uhoranishema
10. Prince Shema Musonerwa
11. Jean De Dieu Niyonteze
This article has no evaluationsLatest version Oct 8, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Explainable Deep Learning for Glaucoma Detection: A DenseNet121-Based Classification with Grad-CAM Visualization

Enhancing Multimodal Glaucoma Screening through Attention-Guided Vision-Language Fusion

Benchmarking Deep Learning Models for Real-Time Diabetic Retinal Blood Vessel Segmentation