Comparison of Automated White Matter Lesion Segmentation Approaches for Use in Large, Multi-Site Data Analyses in Parkinson’s Disease

Sarah Al-Bachari
So Hoon Yoon
Phoebe Emson
Shauna Angell
John Cain
Aswin Abraham
Azeem Chughtai
Edward Sizer
Edwin Barnes
Maryam Al-Wardy
Siddarth Kannan
Rohan Paul-Thaper
Joanna Bright
Conor Owens-Walton
Corey T. McMillan
Johannes C. Klein
Ludovica Griffanti
Sophia I. Thomopoulos
Neda Jahanshad
Paul M. Thompson
Ysbrand D. van der Werf
Chris Vriend
Laura M. Parkes
Hedley C.A. Emsley
Anette Schrag
Hamied A. Haroon

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

Parkinson’s disease (PD) is the second most common neurodegenerative disorder. PD currently lacks effective disease-modifying treatments, likely due to its diverse clinical features and underlying neuropathology. The vascular role in PD is emerging, with vascular mechanisms increasingly implicated, yet the literature remains conflicted, motivating large-data analyses with greater statistical power. White matter lesions (WML) are an accepted imaging marker of small vessel disease. Accurate automated WML segmentation techniques are crucial for large-scale studies in PD due to the impracticality of manual segmentation for extensive datasets and to ensure consistency. Evaluation of the optimum approach in PD for large-scale analysis is lacking. This study aimed to evaluate various automated WML segmentation algorithms to determine the most accurate and reliable method, among those selected, for assessing WML for multi-site large data analysis in PD.

Methods

We assessed whole-brain volumetric T1-weighted and FLAIR images from 201 PD patients (mean age, 66.6 ± 7.86 years) and 64 healthy controls (HC; mean age, 66.3 ± 8.67) across three datasets: the Parkinson’s Progression Markers Initiative (PPMI), the University of Pennsylvania (UPenn) and the Montreal Neurological Institute Biobank: Clinical Biological Imaging and Genetic Repository (C-BIG). The sample included different scanners, imaging parameters and lesion loads, as would be expected for multi-site data. WML were manually segmented to provide the gold standard, and four freely available automated algorithms were evaluated: FSL’s BIANCA, FreeSurfer, SPM’s LST-LPA and U-Net-pgs using the performance metrics: Dice score, Hausdorff distance, recall, precision, F1 score, log absolute volume difference (LOGAVD) and intraclass correlation coefficient (ICC). Subgroup analyses were performed based on lesion load and lobar regions. The associations of data from these automated approaches with age, and with Fazekas and Wahlund visual rating scales, were assessed through partial correlation analysis.

Results

U-Net-pgs performed best overall, with the highest Dice score (PD: 0.46 ± 0.21; HC: 0.39 ± 0.21), recall (PD: 0.76 ± 0.25; HC: 0.62 ± 0.31), precision (PD: 0.49 ± 0.25; HC: 0.63 ± 0.27), F1 score (PD: 0.54 ± 0.22; HC: 0.56 ± 0.22) and ICC (PD: 0.965; HC: 0.967) and lowest Hausdorff distance (PD: 8.89 ± 3.96; HC: 6.33 ± 2.91). U-Net-pgs achieved the lowest LOGAVD in the PD group (0.31 ± 0.31) whereas BIANCA-LOO with a threshold of 0.9 was lowest in HC (0.27 ± 0.30). U-Net also showed superior performances in all lesion loads for PD and overall across various brain regions in both PD and HC.

Conclusion

Overall, U-Net-pgs emerged as the best performing automated method, of those we evaluated, for WML segmentation in PD and HC within a dataset collected with various scanner and image acquisition parameters. U-Net-pgs consistently outperformed other automated approaches across lesion loads and brain regions, for most metrics. The accuracy and reliability of U-Net-pgs make it a promising tool for large-scale analyses, facilitating future research investigating WML in PD.

Version published to 10.64898/2026.05.27.726795 on bioRxiv
May 30, 2026

Automatic segmentation of choroid plexus using deep learning across neurodegenerative diagnoses in the multi-site COMPASS-ND Study

This article has 6 authors:
1. Manpreet Singh
2. Fanta Dabo
3. Lianne J. Trigiani
4. David Araujo
5. Sridar Narayanan
6. AmanPreet Badhwar
This article has no evaluationsLatest version May 18, 2026
Deep Learning and Machine Learning for Early Detection of Alzheimer’s Disease: A Systematic Review and Meta-Analysis

This article has 1 author:
1. Saketh Machiraju
This article has no evaluationsLatest version May 22, 2026
Multidomain Analysis of Clinical Cognitive Assessments and Imaging Data in Alzheimer's Disease Accurately Predicts Disease Stage and Grade Independent of Amyloid and Tau

This article has 6 authors:
1. Juan Antonio Kim Hoo Chong Chie
2. Scott A. Persohn
3. Olivia R. Simcox
4. Paul Salama
5. Paul R. Territo
6. for the Alzheimer's Disease Neuroimaging Initiative
This article has no evaluationsLatest version Apr 13, 2026

Discuss this preprint

Listed in

Abstract

Background

Methods

Results

Conclusion

Article activity feed

Related articles

Automatic segmentation of choroid plexus using deep learning across neurodegenerative diagnoses in the multi-site COMPASS-ND Study

Deep Learning and Machine Learning for Early Detection of Alzheimer’s Disease: A Systematic Review and Meta-Analysis

Multidomain Analysis of Clinical Cognitive Assessments and Imaging Data in Alzheimer's Disease Accurately Predicts Disease Stage and Grade Independent of Amyloid and Tau