Benchmarking Deep Learning Models for Real-Time Diabetic Retinal Blood Vessel Segmentation

Robert Ngabo Mugisha
Geoffrey Munyaneza
Fideli Nsanzumukunzi
Mediatrice Dusenge
Josue Uzigusenga
Theophilla Igihozo
Fabrice Mpozenzi
Emmanuella Nuwayo
Benny Uhoranishema
Prince Shema Musonerwa
Jean De Dieu Niyonteze

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background Diabetic retinopathy (DR) remains one of the leading causes of preventable blindness worldwide. Accurate segmentation of retinal blood vessels is essential for early DR detection, as vascular abnormalities provide key markers of disease onset and progression. Although recent deep learning (DL) methods have achieved strong segmentation accuracy, limited attention has been given to benchmarking their inference efficiency, a critical factor for real-time clinical deployment in large-scale screening and teleophthalmology. Objective This study systematically benchmarks U-Net, U-Net++, and SegFormer on the DRIVE dataset to jointly evaluate segmentation accuracy and inference time, thereby addressing the gap between performance reporting and practical clinical applicability. Methods All images were resized to 256×256 pixels, normalized, and augmented with rotations, flips, and scaling. U-Net and U-Net + + were implemented as convolutional encoder–decoder architectures with skip connections, while SegFormer employed a hierarchical Transformer backbone with a lightweight MLP decoder. Models were trained for 60 epochs using class-balanced cross-entropy loss. Evaluation metrics included pixel accuracy, Dice similarity coefficient (DSC), and per-image inference time. Results U-Net + + achieved the highest segmentation fidelity (DSC = 0.850; Accuracy = 0. 9778), narrowly outperforming U-Net (DSC = 0.847; Accuracy = 0.9783), while SegFormer performed lower in accuracy (DSC = 0.637; Accuracy = 0.9106) but delivered the fastest inference time (0.67 s per image), being ~ 11× faster than U-Net + + and ~ 4.8× faster than U-Net. Qualitative analysis confirmed U-Net + + best preserved thin vessels and vascular continuity, whereas SegFormer tended to thicken vessel boundaries and omit fine branches. Conclusion U-Net + + demonstrated superior segmentation accuracy, SegFormer provided a significant runtime advantage, and U-Net offered a balanced trade-off between quality and efficiency. These findings highlight that model selection for retinal vessel segmentation should depend on the specific priorities of deployment — whether precision, inference speed, or a compromise between both is most critical.

Version published to 10.21203/rs.3.rs-7762622/v1 on Research Square
Oct 8, 2025

Comparative Analysis of Deep Learning Models for Coronary Artery Segmentation: Performance and Inference Time Evaluation

This article has 10 authors:
1. Benny Uhoranishema
2. Mediatrice Dusenge
3. Yollanda Umutoni
4. Theophilla Igihozo
5. Emmanuella Nuwayo
6. Florien Ujemurwego
7. Fabrice Mpozenzi
8. Robert Ngabo Mugisha
9. Prince Shema Musonerwa
10. Jean De Dieu Niyonteze
This article has no evaluationsLatest version Oct 3, 2025
Enhancing Diabetic Retinopathy Prediction Using Transformer-based Attention in Hybrid CNN Models

This article has 4 authors:
1. Aayush Verma
2. Sanket Agrawal
3. Shreyans Jain
4. S Kanthimathi
This article has no evaluationsLatest version Sep 23, 2025
A Real-World Comparison of Three Deep-Learning Systems for Diabetic Retinopathy in Remote Australia

This article has 11 authors:
1. Jocelyn J. Drinkwater
2. Qiang Li
3. Kerry Woods
4. Emma Douglas
5. Mark Chia
6. Yukun Zhou
7. Steve Bartnik
8. Yachana Shah
9. Vaibhav Shah
10. Pearse A. Keane
11. Angus W. Turner
This article has no evaluationsLatest version Oct 13, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Comparative Analysis of Deep Learning Models for Coronary Artery Segmentation: Performance and Inference Time Evaluation

Enhancing Diabetic Retinopathy Prediction Using Transformer-based Attention in Hybrid CNN Models

A Real-World Comparison of Three Deep-Learning Systems for Diabetic Retinopathy in Remote Australia