Developing and Evaluating Deep Learning Approaches for Visual Field Denoising in Glaucoma

Julia Seungjoo Baek
Anagha Lokhande
Didier Neuenschwander
Min Shi
Mengyu Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Purpose

To investigate the relative efficacy of nine distinct visual field (VF) denoising artificial intelligence (AI) methods and a pathology-aware AI strategy to discourage over-correction of glaucomatous defects.

Design

Retrospective study

Participants

87,940 paired visual field (VF) and optical coherence tomography (OCT) samples from a tertiary academic center.

Methods

Denoising models were trained on a separate VF-only dataset and evaluated on an independent structure-function dataset of paired VF-OCT samples. We implemented and evaluated nine distinct VF denoising strategies representing three broad categories: baseline measurements, self-supervised and image restoration models (including Noise2Noise, Noise2Void, and NAFNet), and latent variable compression-based models (autoencoders and variational autoencoders). All models were designed to reconstruct VF sensitivity maps. We then predicted retinal nerve fiber layer thickness (RNFLT) maps from the denoised VFs using a fixed, independently trained VF-to-RNFLT prediction model.

Main Outcome Measures

Predicted VF and RNFLT maps and resultant evaluation metrics.

Results

The raw VF baseline achieved a global R² of 0.5468 and MAE of 16.83 μm. Restoration-based models maintained or slightly improved concordance, with the pathology-aware NAFNet achieving the highest global R² (0.5485) and a comparable MAE (16.82 μm). In contrast, compression-based models degraded concordance, with CNN-VAE showing a significant reduction (R² ≈ 0.50).

In severe glaucoma, concordance decreased across all methods; however, compression architectures exhibited disproportionately greater degradation compared with restoration-based approaches.

Conclusions

We present a comparative benchmark of AI-based VF denoising strategies paired with structure–function evaluation. While restoration-based models can reduce variability without loss of biological signal, latent compression risks attenuating clinically meaningful defects. Visually smoother fields are not necessarily more biologically accurate.

Version published to 10.64898/2026.05.29.26354019 on medRxiv
Jun 1, 2026

Deep Learning Prediction of Personalized Peripapillary Retinal Nerve Fiber Layer Thickness Norms from Fundus Images in Glaucoma

This article has 5 authors:
1. Elif Yildiz
2. Lucy Zha
3. Nazlee Zebardast
4. Min Shi
5. Mengyu Wang
This article has no evaluationsLatest version May 27, 2026
Can Demographic Information Be Reduced in Retinal Fundus Images While Preserving Glaucoma-Relevant Features?

This article has 2 authors:
1. Iyad Majid
2. Mengyu Wang
This article has no evaluationsLatest version Jun 25, 2026
Deriving OCT-Equivalent Retinal Nerve Fiber Layer Thickness Maps from Fundus Photographs with Deep Learning Improves Glaucoma Diagnosis

This article has 6 authors:
1. Lily Shi
2. Min Shi
3. In Young Chung
4. Louis R. Pasquale
5. Lucy Q. Shen
6. Mengyu Wang
This article has no evaluationsLatest version May 27, 2026

Discuss this preprint

Listed in

Abstract

Purpose

Design

Participants

Methods

Main Outcome Measures

Results

Conclusions

Article activity feed

Related articles

Deep Learning Prediction of Personalized Peripapillary Retinal Nerve Fiber Layer Thickness Norms from Fundus Images in Glaucoma

Can Demographic Information Be Reduced in Retinal Fundus Images While Preserving Glaucoma-Relevant Features?

Deriving OCT-Equivalent Retinal Nerve Fiber Layer Thickness Maps from Fundus Photographs with Deep Learning Improves Glaucoma Diagnosis