Automated Detection of Early-Stage Dementia Using Large Language Models: A Comparative Study on Narrative Speech

Kevin Mekulu
Faisal Aqlan
Hui Yang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The growing global burden of dementia underscores the urgent need for scalable, objective screening tools. While traditional diagnostic methods rely on subjective assessments, advances in natural language processing offer promising alternatives. In this study, we compare two classes of language models—encoder-based pretrained language models (PLMs) and autoregressive large language models (LLMs) for detecting cognitive impairment from narrative speech. Using the DementiaBank Pitt Corpus and the widely used Cookie Theft picture description task, we evaluate BERT as a representative PLM alongside GPT-2, GPT-3.5 Turbo, GPT-4, and LLaMA-2 as LLMs. Although all models are pretrained, we distinguish PLMs and LLMs based on their architectural differences and training paradigms. Our findings reveal that BERT outperforms all other models, achieving 86% sensitivity and 95% specificity. LLaMA-2 follows closely, while GPT-4 and GPT-3.5 underperform in this structured classification task. Interestingly, LLMs demonstrate complementary strengths in capturing narrative richness and subtler linguistic features. These results suggest that hybrid modeling approaches may offer enhanced performance and interpretability. Our study highlights the potential of language models as digital biomarkers and lays the groundwork for scalable, AIpowered tools to support early dementia screening in clinical practice.

Version published to 10.1101/2025.06.06.25329081 on medRxiv
Jun 7, 2025

A 60-Second Interpretable Voice Model for Early Dementia Screening

This article has 3 authors:
1. Kevin Mekulu
2. Faisal Aqlan
3. Hui Yang
This article has no evaluationsLatest version Jul 17, 2025
Detecting dementia using linguistic analysis: Terry Pratchett's Discworld tells a more personal story

This article has 3 authors:
1. Thomas DW Wilcockson
2. Melody Pattison
3. Ahmet Begde
This article has no evaluationsLatest version Jul 10, 2025
Character-Level Linguistic Biomarkers for Precision Assessment of Cognitive Decline: A Symbolic Recurrence Approach

This article has 3 authors:
1. Kevin Mekulu
2. Faisal Aqlan
3. Hui Yang
This article has no evaluationsLatest version Jun 13, 2025

Listed in

Abstract

Article activity feed

Related articles

A 60-Second Interpretable Voice Model for Early Dementia Screening

Detecting dementia using linguistic analysis: Terry Pratchett's Discworld tells a more personal story

Character-Level Linguistic Biomarkers for Precision Assessment of Cognitive Decline: A Symbolic Recurrence Approach