Development of a Machine Learning-Based Prediction Model for Postoperative Delirium in Frail Elderly Patients Undergoing Non-Cardiac Surgery Under General Anesthesia

Qiufeng Wang
Didi Mu
Xiaofeng wang
Wenmeng Han
Jianpeng Wang
Jun Shen
Cai Ning
guanghong Xu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background : In frail older adults, the incidence of postoperative delirium is markedly increased, leading to greater morbidity, prolonged length of stay, and higher healthcare costs. An accurate POD prediction model can direct preventive strategies and improve patient outcomes. Employing advanced machine-learning techniques, this study develops a POD prediction model using comprehensive pre-operative and intra-operative data. Methods : We enrolled 2,089 frail patients aged ≥65 years undergoing general anesthesia for non-cardiac surgery at Fuyang People’s Hospital between February 2023 and February 2025. Thirty-eight baseline, anesthetic, and laboratory variables were extracted; missing data were handled by multiple imputation using chained equations (MICE). The dataset was randomly split 7:3 into training and validation sets. After feature selection with Boruta and LASSO, eight machine-learning models—logistic regression, random forest, support-vector classifier, XGBoost, artificial neural network, naïve Bayes, k-nearest neighbors, and decision tree—were trained and compared, with ROC-AUC as the primary metric, accompanied by accuracy, precision, recall, and F1-score. Model interpretability was achieved using SHAP analysis for the best-performing algorithm. Results : Among 2,089 frail elderly patients, the incidence of POD was 16.52%. After Boruta and LASSO identified 15 key predictors, the XGBoost model achieved an AUC of 0.813, outperforming the other seven algorithms. SHAP analysis identified MMSE score, Charlson Comorbidity Index, and age as the strongest predictors. External validation demonstrated high clinical utility on decision-curve analysis, with an ROC-derived sensitivity of 0.813 and specificity of 0.793, confirming robust performance without overfitting. Conclusions : This study presents a robust XGBoost-based model for predicting postoperative delirium in frail elderly patients undergoing non-cardiac surgery, demonstrating the potential of machine learning for clinical risk stratification. With its balanced performance and high accuracy, the model enables clinicians to identify high-risk patients and initiate timely interventions. Future work should focus on integration into clinical workflows and further external validation.

Version published to 10.21203/rs.3.rs-7554250/v1 on Research Square
Oct 13, 2025

Machine Learning Models for Predicting 28-Day Mortality in Gastrointestinal Bleeding with Acute Kidney Injury: A MIMIC-IV-Based Study

This article has 8 authors:
1. Xiangyu Zhang
2. Yanpeng Hu
3. Xingye Zhu
4. Chan Yu
5. Cuicui Liu
6. Jian Xue
7. Yingfeng Su
8. Baoqing Ma
This article has no evaluationsLatest version Oct 1, 2025
Building a Machine Learning Model to Predict the Early Mortality Risk in Pediatric ICU Sepsis Patients

This article has 6 authors:
1. Lin Yang
2. Na Zang
3. Ying Yang
4. KaiBing Pu
5. Cong Liu
6. LiPing Tan
This article has no evaluationsLatest version Nov 3, 2025
An interpretable machine-learning model for predicting postoperative recovery quality after cardiovascular surgery: development, validation, and clinical applicability

This article has 7 authors:
1. Luo Zhang
2. Bei Ma
3. Zhi Xing
4. Yudong Wang
5. Shunping Tian
6. Zhuan Zhang
7. Jianyou Zhang
This article has no evaluationsLatest version Nov 11, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Machine Learning Models for Predicting 28-Day Mortality in Gastrointestinal Bleeding with Acute Kidney Injury: A MIMIC-IV-Based Study

Building a Machine Learning Model to Predict the Early Mortality Risk in Pediatric ICU Sepsis Patients

An interpretable machine-learning model for predicting postoperative recovery quality after cardiovascular surgery: development, validation, and clinical applicability