Identifying Mild Cognitive Impairment Using Decision Tree–Based Machine Learning with Physical, Functional, and Psychosocial Measures in Community-Dwelling Older Adults: Evidence from the Northern Japanese ORANGE Registry

Ayuto Kodama
Takako Ohnuma
Kana Sasaki
Kaoru Sugawara
Nobuhiro Fujiyama
Youko Umetsu
Tsuyoshi Ono
Hidetaka Ota

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background Mild cognitive impairment (MCI) is common in later life and represents a key target for early identification and prevention. Scalable, non-imaging approaches using routinely collected community health data may support risk stratification and guide follow-up assessments. Methods We analyzed community health check-up data from Akita Prefecture, Japan. The outcome was binary MCI classification (0 = non-MCI; 1 = MCI). Candidate predictors included demographics, medical history, physical function, and psychosocial measures. Data were split into training (70%) and test (30%) sets using stratification. We trained a decision tree, random forest, and gradient-boosted trees with five-fold cross-validation and hyperparameter tuning. Model discrimination and classification metrics were evaluated on the independent test set. Permutation importance was computed for the best-performing model, and a shallow decision tree was derived using the top-ranked predictors for interpretability. Results The analytic sample included 2,650 participants (non-MCI: n = 1,893; MCI: n = 757). On the test set, the random forest model achieved the highest ROC AUC (0.719). At a 0.5 threshold, accuracy was 0.753, with sensitivity 0.254 and specificity 0.952. Using the Youden threshold (~ 0.256) increased sensitivity to 0.794 while reducing specificity to 0.537. Permutation importance ranked GDS-15 score, osteoporosis, social frailty, and living alone among the top predictors. Conclusions A random forest model demonstrated moderate discrimination for classifying MCI using routinely collected community health variables. The choice of operating threshold had a substantial impact on the sensitivity–specificity trade-off, underscoring the importance of clearly defining intended use and decision thresholds. External validation and prospective evaluation are required before clinical deployment.

Version published to 10.21203/rs.3.rs-8729100/v1 on Research Square
Mar 11, 2026

Development and Validation of a 7-Year Risk Prediction Model for Progression from Subjective Cognitive Decline to Mild Cognitive Impairment: A Prospective Cohort Study Using CHARLS Data

This article has 8 authors:
1. Tianjiao Li
2. Lingxuan Li
3. Hongyang Xie
4. Yuwei Zhang
5. Xiujuan Bai
6. Gang Yu
7. Xuan Z
8. Bo Sun
This article has no evaluationsLatest version Feb 17, 2026
RETRACTED: Assessment of Disability in the Elderly Based on Multimodal Health Care Data: A Machine Learning–Driven Approach

This article has 3 authors:
1. guangpeng chen
2. Shunyu Wang
3. li luo
This article has no evaluationsLatest version Mar 24, 2026
Development and Validation of a Risk Prediction Model for Oral Frailty in Older Adults With Mild Cognitive Impairment: A Cross-Sectional Study

This article has 4 authors:
1. Yangkun Zhang
2. Jingjing Wang
3. Xiaoming Liu
4. Jing Li
This article has no evaluationsLatest version Mar 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Development and Validation of a 7-Year Risk Prediction Model for Progression from Subjective Cognitive Decline to Mild Cognitive Impairment: A Prospective Cohort Study Using CHARLS Data

RETRACTED: Assessment of Disability in the Elderly Based on Multimodal Health Care Data: A Machine Learning–Driven Approach

Development and Validation of a Risk Prediction Model for Oral Frailty in Older Adults With Mild Cognitive Impairment: A Cross-Sectional Study