Building and validating a predictive model for stroke risk in Chinese community-dwelling patients with chronic obstructive pulmonary disease using machine learning methods

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background

The occurrence of stroke in patients with chronic obstructive pulmonary disease (COPD) can have potentially devastating consequences; however, there is still a lack of predictive models that accurately predict the risk of stroke in community-based COPD patients in China. The aim of this study was to construct a novel predictive model that accurately predicts the predictive model for the risk of stroke in community-based COPD patients by applying a machine learning methodology within the Chinese community.

Methods

The clinical data of 809 Community COPD patients were analyzed by using the 2020 China Health and Retirement Longitudinal Study (CHARLS) database. The least absolute shrinkage and selection operator (LASSO) and multivariate logistic regression were used to analyze predictors. Multiple machine learning (ML) classification models are integrated to analyze and identify the optimal model, and Shapley Additive exPlanations (SHAP) interpretation was developed for personalized risk assessment.

Results

The following six variables:Heart_disease,Hyperlipidemia,Hypertension,ADL_score, Cesd_score and Parkinson are predictors of stroke in community-based COPD patients. Logistic classification model was the optimal model, test set area under curve (AUC) (95% confidence interval, CI):0.913 (0.835-0.992), accuracy: 0.823, sensitivity: 0.818, and specificity: 0.823.

Conclusions

The model constructed in this study has relatively reliable predictive performance, which helps clinical doctors identify high-risk populations of community COPD patients prone to stroke at an early stage.

Article activity feed