Development and External Validation of an Interpretable Machine-Learning Model for HFpEF Comorbidity Risk in COPD Patients

Jing Cao
Boyu Kang
Shuangshuang Li
Yan Lei
Dan Liu
Chunmei Li
Wei Guo
Binghua Zhang
Xiaoyan Xie

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

BACKGROUND Chronic Obstructive Pulmonary Disease (COPD) and Heart Failure with preserved Ejection Fraction (HFpEF) frequently coexist, leading to increased hospitalization, mortality, and healthcare burden. Early identification of HFpEF risk in COPD patients is critical for timely intervention. AIM To develop and validate an interpretable machine learning (ML) model for predicting HFpEF risk in COPD patients and to identify key predictors using explainable artificial intelligence techniques. METHODS This retrospective study analyzed 1,550 COPD patients, divided into COPD-only and COPD-HFpEF groups. Feature selection was performed using LASSO regression, logistic regression, and Boruta random forest. Ten ML models were developed and evaluated on an internal test set, with the best model further validated on an external cohort (n = 69). Model interpretability was assessed using SHapley Additive exPlanations (SHAP). RESULTS Nine predictors were consistently selected: NT-proBNP, red blood cell count, fibrinogen, cholesterol, arterial PaO₂, inspiratory capacity (IC), IC% predicted, late diastolic mitral inflow velocity, and the COPD Assessment Test score. The XGBoost model achieved the best performance, with an AUC of 0.898 (95% CI: 0.867–0.929) on the internal test set and 0.851 (95% CI: 0.753–0.948) on external validation. SHAP analysis identified NT-proBNP as the most influential predictor. CONCLUSION The developed XGBoost model accurately predicts HFpEF risk in COPD patients and offers clinically interpretable insights into key risk factors, supporting early identification and stratified management.

Version published to 10.21203/rs.3.rs-7911218/v1 on Research Square
Nov 10, 2025

Frailty prediction in heart failure patients with acute infections: the potential role of thiazide diuretics?

This article has 8 authors:
1. Tinghui Huang
2. Shuyi Liu
3. Siyu Zhang
4. Xi Song
5. Ming Xu
6. Huiling Wu
7. Jianjun Zou
8. Yuying Shen
This article has no evaluationsLatest version Oct 17, 2025
Interpretable Machine Learning for Early Prediction of Acute Kidney Disease (AKD) in Sepsis-Associated Acute Kidney Injury (SA-AKI): A Multicenter Cohort Study with External Validation

This article has 8 authors:
1. Shuang Chen
2. Guang Li
3. Qingzhan Zeng
4. Xiancheng Xu
5. Chanlin Li
6. Xiaoyue Li
7. Shaohong Li
8. Heng Li
This article has no evaluationsLatest version Sep 22, 2025
Machine Learning Risk Prediction for Prolonged Hospitalization in Frail Older Adults with Multimorbidity

This article has 16 authors:
1. Innocent Tesha
2. Wang Jiasi
3. Zhao Xizhe
4. Nassor Makame
5. Maryam Mbarak
6. Ding Lin
7. Yue Chen
8. Maxwell Ahiafor
9. Sidney Amadi
10. Njoka Irene
11. Jermaine Sikombe
12. Mwila Kafwembe
13. Deogratius Galikano
14. Masoud Mtore
15. Wellington Ngari
16. Liu Xinyu
This article has no evaluationsLatest version Oct 20, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Frailty prediction in heart failure patients with acute infections: the potential role of thiazide diuretics?

Interpretable Machine Learning for Early Prediction of Acute Kidney Disease (AKD) in Sepsis-Associated Acute Kidney Injury (SA-AKI): A Multicenter Cohort Study with External Validation

Machine Learning Risk Prediction for Prolonged Hospitalization in Frail Older Adults with Multimorbidity