Automatic detection of simulated artifacts on T1w magnetic resonance images: comparing performance of different QC strategies

Janine Hendriks
Michelle G. Jansen
Richard Joules
Óscar Peña-Nogales
Paulo R. Rodrigues
Frederik Barkhof
Anouk Schrantee
Henk J.M.M. Mutsaerts
the Alzheimer’s Disease Neuroimaging Initiative

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The reliability of MRI-derived measures critically depends on image quality. Poor-quality scans can obscure anatomical detail and compromise the accuracy of automated image analysis, underscoring the need for robust quality control (QC) procedures. Automated QC offers scalability for large neuroimaging datasets, yet the comparative performance of different approaches for detecting specific artifact types remains poorly understood.

We systematically compared rule-based (RB), classical machine learning (ML), and deep learning (DL) QC algorithms using 1,000 high-quality T1w scans. Four artifact types, blurring, ghosting, motion, and noise were synthetically introduced across ten severity levels using TorchIO, yielding 40,000 degraded images. Visual QC of a subset confirmed strong inter-rater reliability (Krippendorff’s α=0.82, mean Spearman’s ρ=0.87). RB and ML models used 62 image quality metrics (IQMs) from MRIQC, whereas DL models were trained directly on minimally preprocessed images. Models were trained with participant-level five-fold cross-validation and tested on an independent dataset.

DL models achieved the highest overall performance across artifact types (Youden’s Index=0.83–0.97). RB and ML performed comparably at high artifact severities (YI≥0.75) but showed limited sensitivity to subtle ghosting and noise (YI≤0.15). Feature analysis indicated that RB relied primarily on normative metrics, whereas ML flexibly adapted feature use by artifact type and severity.

These findings highlight DL’s superior generalizability for detecting subtle artifacts and provide practical guidance for selecting QC strategies in large-scale neuroimaging pipelines, where reliable QC is essential for maintaining statistical power and reproducibility.

Version published to 10.1101/2025.10.31.25339144 on medRxiv
Nov 2, 2025

Influence of deep learning image reconstruction and adaptive statistical iterative reconstruction-V on automated Alberta Stroke Program Early CT Score- evaluation

This article has 9 authors:
1. Estelle Akl
2. Daniel Cantré
3. Matthias Lütgens
4. Wiebke Hermann
5. Sönke Langner
6. Marc-André Weber
7. Ann-Christin Klemenz
8. Felix G. Meinel
9. Ebba Beller
This article has no evaluationsLatest version Nov 13, 2025
DeepFLAIR*: Improving Multiple Sclerosis Diagnostic Imaging Workflow Using Deep Learning

This article has 10 authors:
1. Inga Baburyan
2. Bryan Quah
3. Sreekanth Madhusoodhanan Nair
4. Omar Al-Louzi
5. Marcel Maya
6. Marwa Kaisey
7. Nancy Sicotte
8. Jason H Moore
9. Daniel Ontaneda
10. Pascal Sati
This article has no evaluationsLatest version Nov 27, 2025
Automated segmentation of multiple sclerosis lesions using 7 Tesla MRI multi-contrast data

This article has 10 authors:
1. Anna Petrova
2. Assunta Dal-Bianco
3. Rebeka Rumbak
4. Lukas Haider
5. Eva Niess
6. Wolfgang Bogner
7. Günther Grabner
8. Thomas Berger
9. Paulus Rommer
10. Stanislav Motyka
This article has no evaluationsLatest version Oct 1, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Influence of deep learning image reconstruction and adaptive statistical iterative reconstruction-V on automated Alberta Stroke Program Early CT Score- evaluation

DeepFLAIR*: Improving Multiple Sclerosis Diagnostic Imaging Workflow Using Deep Learning

Automated segmentation of multiple sclerosis lesions using 7 Tesla MRI multi-contrast data