Automated Quantification of Decreased FAF in Stargardt Disease: Validation of a Novel Method Compared to Manual Grading Standards

Mohamed I. Ahmed
Hikmet Yucel
Rubbia Afridi
Thales A.C. de Guimarães
Isabel Sendino-Tenorio
Nam V. Nguyen
Kiran Romaisa
Sidrah Khan
Ufaq Khan
Syeda Sharaf un Nisa
Mauro Campigotto
Amir Hariri
Michel Michaelides
Hendrik P.N. Scholl
Nathan Mata
Quan D. Nguyen
Yasir J. Sepah

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Purpose

To evaluate the repeatability and reproducibility of a novel automated method compared with manual segmentation for measuring decreased autofluorescence (DAF) and definitely decreased autofluorescence (DDAF) in fundus autofluorescence (FAF) images of patients with Stargardt disease.

Design

Cross-sectional reproducibility and agreement study.

Participants

A total of 316 eyes from 158 genetically confirmed Stargardt patients were analyzed. For intra-grader repeatability, 114 FAF images were reassessed in a masked, repeated-measures design.

Methods

DAF and DDAF lesion areas were independently quantified by five certified graders using either manual delineation with Heidelberg RegionFinder or a threshold-based automated algorithm. Agreement and repeatability were assessed using intraclass correlation coefficients (ICC), standard error of measurement (SEM), minimal detectable change (MDC), Lin’s concordance correlation coefficient (CCC), Bland–Altman plots, and Passing–Bablok regression. Both raw and square-root-transformed lesion areas were evaluated.

Main Outcome Measures

Repeatability (intra-grader ICC, SEM, MDC), reproducibility (inter-grader ICC), and agreement (CCC, bias in regression analysis) between and within manual and automated methods.

Results

The automated method achieved excellent intra-grader repeatability for both DAF and DDAF (ICCs ≥0.988, SEM ≤0.71 mm², MDC ≤1.98 mm²), with minimal operator influence. Manual measurements showed variable repeatability (DAF ICCs 0.909–0.974; DDAF ICCs as low as 0.837), with square-root transformation reducing SEM and MDC. Inter-grader reproducibility was highest for automated methods (ICC = 0.989–0.992), whereas manual methods ranged from 0.764–0.939 (raw) and 0.867–0.922 (transformed). Cross-method agreement was strong (CCC = 0.91–0.96), though minor proportional and constant bias was observed in raw DAF data.

Conclusions

The automated approach provides near-perfect repeatability and high agreement with manual grading, offering a scalable, objective alternative for quantifying hypo-autofluorescent lesions in Stargardt disease. Manual methods are generally reliable but more variable, especially for DDAF, and benefit from square-root transformation.

Version published to 10.1101/2025.07.07.25330927 on medRxiv
Jul 7, 2025

Automated Videotopography for Dry Eye Diagnostics: Analytical Performance, Spatial Dynamics, and a Novel Multivariate Model

This article has 1 author:
1. Daniela Oehring
This article has no evaluationsLatest version Jan 8, 2026
3D Ultrasound Assessment of the Central Sulcus in Very Preterm Infants: Feasibility and Reproducibility of Opening Metrics Study

This article has 7 authors:
1. Carmen Rodríguez Barrios
2. Irene Gutiérrez Rosa
3. Simon Pedro Lubian Fernández
4. Isabel Benavente Fernández
5. Joaquin Pizarro
6. Manuel Lubian Gutiérrez
7. Simon Pedro Lubián López
This article has no evaluationsLatest version Dec 18, 2025
In healthy eyes, the accuracy and interchangeability of corneal tomography measurements obtained by Galilei G6 and Pentacom HR systems are compared

This article has 3 authors:
1. Marrwan Mohammed
2. Mustafa Tawfeeq Halboos
3. Noor Khamees Hamad
This article has no evaluationsLatest version Dec 9, 2025

Discuss this preprint

Listed in

Abstract

Purpose

Design

Participants

Methods

Main Outcome Measures

Results

Conclusions

Article activity feed

Related articles

Automated Videotopography for Dry Eye Diagnostics: Analytical Performance, Spatial Dynamics, and a Novel Multivariate Model

3D Ultrasound Assessment of the Central Sulcus in Very Preterm Infants: Feasibility and Reproducibility of Opening Metrics Study

In healthy eyes, the accuracy and interchangeability of corneal tomography measurements obtained by Galilei G6 and Pentacom HR systems are compared