Generalizability, Interpretability, and Clinical Readiness of Deep Learning Methods for Alzheimer’s Disease: A Systematic Literature Review

Sohni Malik
Tanya Kumari
Sanjana Bamnawat
Sonali

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Early and correct identification of Alzheimer's disease is essential for prompt intervention and medical care. Recent advancements use machine learning and deep learning techniques on neuroimaging, genetic, and clinical data to identify Alzheimer’s disease from cognitively normal patients and predict progression from moderate cognitive impairment. This systematic literature review examines studies published between 2019 and 2025 that utilized major datasets, including the Alzheimer’s Disease Neuroimaging Initiative, as well as inputs such as T1-weighted magnetic resonance imaging, electroencephalography, and multimodal neuroimaging data. The reviewed approaches encompass voxel-wise three-dimensional convolutional neural networks, hybrid convolutional neural network–transformer architectures, attention-based multimodal fusion frameworks, and conventional machine learning models such as Random Forest, Extreme Gradient Boosting, and Generalized Linear Models. Common preprocessing techniques include intensity correction, spatial normalization, skull stripping, and data augmentation through rotations, flips, and generative adversarial network–based oversampling. The primary evaluation metrics reported are accuracy, sensitivity, specificity, F1-score, and area under the receiver operating characteristic curve. Interpretability techniques such as Grad-CAM, Layer-Wise Relevance Propagation, and saliency maps were increasingly adopted to visualize discriminative brain regions. Models integrating hybrid architectures and multimodal information demonstrate enhanced robustness, external validation remains limited. Persistent challenges include class imbalance, subject-level data leakage, small dataset sizes, and poor cross-cohort generalizability. Future research should emphasize larger, multi-center datasets, standardized evaluation protocols, and interpretable models that are clinically meaningful and translatable.

Version published to 10.21203/rs.3.rs-8071648/v1 on Research Square
Nov 11, 2025

A Comparative Analysis of Deep Learning Models for Early Prediction of Alzheimer’s Disease using structural MRI

This article has 3 authors:
1. Rohit Kumar
2. Ankush Jain
3. Surendra Nagar
This article has no evaluationsLatest version Jan 6, 2026
Alzheimer’s Classification Using Hybrid Deep Learning Models

This article has 5 authors:
1. Muhammad Hanzla
2. Abdul Rehman Shinwari
3. Muhammad Suleman Hiader
4. Muhammad OwnRaza
5. Syed Muhammd Hussain Shah
This article has no evaluationsLatest version Feb 26, 2026
Integrating Structural Brain MRI and Clinical Phenotypes for Automated ADHD Diagnosis: A Multimodal Deep Learning Approach

This article has 2 authors:
1. Soumyadip Roy
2. Ratnakar Dash
This article has no evaluationsLatest version Jan 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Comparative Analysis of Deep Learning Models for Early Prediction of Alzheimer’s Disease using structural MRI

Alzheimer’s Classification Using Hybrid Deep Learning Models

Integrating Structural Brain MRI and Clinical Phenotypes for Automated ADHD Diagnosis: A Multimodal Deep Learning Approach