Uncovering Latent Cognitive Subgroups via MMSE Scores in a Bangladeshi Dementia Cohort: An integration of Data Coresets and Ranked Set Sampling in Gaussian Mixture Modeling

Sharmin Akther
Azizur Rahman
Rumana Rois

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background Alzheimer's disease (AD) has an incidence of 57 million people worldwide, with it being expected to increase further in low and middle-income nations (LMICs), where Bangladesh is reported to have a prevalence of 8.0% of dementia among persons aged ≥ 60 years and older. Although widely used in cognitive screening, the Mini-Mental State Examination (MMSE) exhibits variable sensitivity (23–76% for mild cognitive impairment) and significant age-education effects, which preclude the use of fixed cutoff strategies to define cognitive subgroups. New evidence suggests that AD has heterogeneous subtypes with different cognitive pathways, other than homogeneous disease progression. Methods This article used Gaussian Mixture Models (GMMs) to determine latent cognitive subgroups among a Bangladeshi AD cohort (N = 663) of the National Institute of Neurosciences and Hospital, Dhaka (January 2019-August 2024). The scores with MMSE were divided into severe dementia (MMSE 0–10), moderate dementia (MMSE 11–20), mild cognitive impairment (MMSE 21–25), and normal cognition (MMSE > 25). Two computationally efficient subsampling methods were compared: Ranked Set Sampling (RSS; N = 320) and coreset construction (N = 198), on the overlap minimization and distributional fidelity. Four-component GMMs were estimated using the Expectation-Maximization algorithm on the entire dataset and the two subsamples. The fitted models were evaluated based on log-likelihood values, convergence behavior (ε = 10⁻⁸), and pairwise overlap percentages between components. Results The MMSE Score had significant pair-wise overlap between the neighboring severity elements (46.5% severe-moderate) as an expression of linear cognitive decline. RSS (N = 320) focused on mild impairment (58.28%), with a low degree of overlap (1.2%), but with a serious lack of severe cases (7.32%). The balanced severity representation (severe: 11.61% of original data; 30%, which contains coreset sampling) and moderate overlaps (18.23%) and weighted mean (19.79) were closest to the population mean (19.74). Conclusions Coreset sampling was better than the Ranked Set Sampling because it maintained the severity representation balance in all the levels of cognitive impairment with only 30% of the initial data. The scalable method facilitates effective CD (cognitive disease) subtype identification in resource-limited environments, supporting enhanced clinical trial design and individualized risk assessment.

Version published to 10.21203/rs.3.rs-8996904/v1 on Research Square
Mar 8, 2026

Detection of Cognitive Decline by the Montreal Cognitive Assessment in a Latin America Multi-Cohort Study: Normative scores regarding age and educational level

This article has 17 authors:
1. Alejandra Lázaro-Figueroa
2. Andrés A Morales-de-Arcia
3. Paula Reyes-Pérez
4. Fernanda Bravo-García
5. Alejandra Schäfer
6. Alejandra Medina-Rivera
7. Juan Esquivias
8. Sarael Alcauter
9. Miguel E Rentería
10. Juan Fernandez-Ruiz
11. Maira Rozenfeld Olchik
12. Artur F. S. Schuh
13. Matias López-Razquin
14. Jorge Orozco
15. Ignacio F. Mata
16. Beatriz Muñoz Ospina
17. Alejandra E. Ruiz-Contreras
This article has no evaluationsLatest version Feb 23, 2026
Cognitive Impairment in Stable Schizophrenia: Insights from a Large-Scale Chinese Cohort

This article has 11 authors:
1. Chuan Shi
2. Yuanyuan Zhou
3. Yumei Cai
4. Bing Guo
5. Qiyao Yang
6. Jun Cai
7. Vilhelm Bohr
8. Gong Chen
9. Chi Zhang
10. Zhiquan Li
11. Xin Yu
This article has no evaluationsLatest version Feb 13, 2026
Identifying Mild Cognitive Impairment Using Decision Tree–Based Machine Learning with Physical, Functional, and Psychosocial Measures in Community-Dwelling Older Adults: Evidence from the Northern Japanese ORANGE Registry

This article has 8 authors:
1. Ayuto Kodama
2. Takako Ohnuma
3. Kana Sasaki
4. Kaoru Sugawara
5. Nobuhiro Fujiyama
6. Youko Umetsu
7. Tsuyoshi Ono
8. Hidetaka Ota
This article has no evaluationsLatest version Mar 11, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Detection of Cognitive Decline by the Montreal Cognitive Assessment in a Latin America Multi-Cohort Study: Normative scores regarding age and educational level

Cognitive Impairment in Stable Schizophrenia: Insights from a Large-Scale Chinese Cohort

Identifying Mild Cognitive Impairment Using Decision Tree–Based Machine Learning with Physical, Functional, and Psychosocial Measures in Community-Dwelling Older Adults: Evidence from the Northern Japanese ORANGE Registry