Deep Learning for Diabetic Retinopathy Detection: A Review of Multimodal Data Fusion Approaches

Kartina Diah Kesuma Wardhani
Shahreen Kasim
Rohayanti Hassan
Rahmat Hidayat
Khaled Mahmud Sujon

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Diabetic retinopathy (DR) is a diabetes-induced eye disease that affects the blood vessels of the retina, and a lack of proper DR detection could result in the loss of vision. Although deep learning (DL) has successfully analyzed single-modality medical data, DR diagnosis often requires interpreting diverse information such as retinal imaging and clinical data. Multimodal data fusion has the potential to accommodate robust and complementary information between these sources for more accurate diagnostic decisions. However, DR detection using deep learning-based multimodal fusion is still challenging and underdeveloped. This review investigates recent advances in applying DL techniques to multimodal DR detection , focusing on model architecture, modality combinations, fusion strategies, and performance metrics. Among these architectures, convolutional neural networks (CNNs) are the most popular, and the fusion of fundus images with OCT or 1 EHR data is the most common pairing. Early and joint fusion strategies dominate, while model performance is typically assessed using accuracy, AUC, sensitivity, and F1-score. Despite promising progress, the field still faces challenges including modality heterogeneity, lack of standardized multimodal datasets, and limited model interpretability. Emerging trends point toward hybrid architecture, attention mechanisms, and self-supervised learning as potential solutions. This review highlights current developments and outlines future directions to support the design of scalable, generalizable, and clinically applicable multimodal DL systems for DR detection.

Version published to 10.21203/rs.3.rs-7196434/v1 on Research Square
Aug 7, 2025

Diagnostic Performance of Deep Learning Models in Detecting Diabetic Retinopathy: A Systematic Review and Meta-Analysis of Clinical Studies

This article has 1 author:
1. Ravi Teja Potla
This article has no evaluationsLatest version Jul 28, 2025
Evaluating ResNet for Automated Multi-Class Ocular Disease Detection in Retinal Imaging

This article has 2 authors:
1. Urja Parekh
2. Chris Cha
This article has no evaluationsLatest version Jul 16, 2025
Panoptic-Net: A Unified Deep Learning System for Classifying the Full Spectrum of Retinal Disease from Fundus Photographs

This article has 2 authors:
1. Uğur Şevik
2. Onur Mutlu
This article has no evaluationsLatest version Jul 30, 2025

Listed in

Abstract

Article activity feed

Related articles

Diagnostic Performance of Deep Learning Models in Detecting Diabetic Retinopathy: A Systematic Review and Meta-Analysis of Clinical Studies

Evaluating ResNet for Automated Multi-Class Ocular Disease Detection in Retinal Imaging

Panoptic-Net: A Unified Deep Learning System for Classifying the Full Spectrum of Retinal Disease from Fundus Photographs