Explainable Machine Learning Predicts Mortality in Critically Ill Patients with Nonvariceal Upper Gastrointestinal Bleeding: A MIMIC-IV Study with External Validation

Jialin Lu
Chuting Yu
Hangbang Li
Qiuxin Li
Ye Gao
Wei Wang
Luowei Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background: Non-variceal upper gastrointestinal bleeding (NVUGIB) poses significant mortality in critically ill patients, necessitating accurate early prognostication for timely interventions. Recent advances in machine learning have demonstrated potential to significantly improve predictive performance compared to conventional clinical scores. Accordingly, this study aims to establish a machine learning model named NVUPreM to predict 30-day NVUGIB mortality and validate its superiority over traditional scoring systems. Methods: This retrospective study derived data from the Medical Information Mart for Intensive Care IV (n=11,237) and the eICU Collaborative Research Database (n=7,742) databases for model development and external validation. Predictors were selected via least absolute shrinkage and selection operator regression to minimize multicollinearity. Thirty-six machine learning algorithms were evaluated using tenfold cross-validation. The optimal model (NVUPreM) was compared against eight clinical scoring systems (AIMS65, Charlson, GBS, GCS, Admission-Rockall, SAPSII, SOFA) using the area under the receiver operating characteristic curve (AUC), calibration, decision curve analysis, and SHapley Additive exPlanations for interpretability. Results: The NVUPreM model demonstrated superior discrimination (AUC=0.876, [95% CI 0.846-0.907]) and sensitivity (0.86), showing the best predictive performance among all models. In internal validation, the NVUPreM model outperformed all clinical scores according to the results of AUCs (AIMS65: AUC=0.693; Charlson: AUC=0.636; GBS: AUC=0.575; GCS: AUC=0.707; NVUPreM: AUC=0.876; Admission-Rockall: AUC=0.633; SAPSII: AUC=0.777; SOFA: AUC=0.665), decision curve analysis and calibration curve. External validation in eICU confirmed robustness of the NVUPreM model in terms of discrimination (AUC=0.82, [95% CI 0.803-0.837]), calibration, and clinical application. The interpretability analysis revealed directional feature contributions, identifying predictors with significantly positive and negative impacts on the model output. Conclusion: The NVUPreM model significantly outperforms existing clinical scores in predicting 30-day NVUGIB mortality, offering both accuracy and interpretability, which could assist clinicians in early high-risk patient identification and personalized intervention.

Version published to 10.21203/rs.3.rs-7332852/v1 on Research Square
Aug 21, 2025

Development and validation of machine learning models for predicting short- and long-term mortality in gastroparesis patients: a retrospective cohort study using the MIMIC-IV database

This article has 5 authors:
1. Lei Zhu
2. Qi Han
3. Bei Pei
4. Jie Zhang
5. Haolong Qi
This article has no evaluationsLatest version Dec 31, 2025
Development and validation of an Explainable Machine Learning Model for Predicting Multiple Organ Failure in Patients with Acute Pancreatitis: a Multicenter Cohort Study

This article has 7 authors:
1. Yi Hao
2. Peiyi Bai
3. Yunpeng Zhou
4. Yi Wang
5. Qinyang Du
6. Rongshen Guan
7. Gaopeng Li
This article has no evaluationsLatest version Dec 22, 2025
Development and External Validation of a Nurse-Friendly Machine Learning Model for Early Identification of Intradialytic Hypotension in ICU Patients Receiving Renal Replacement Therapy

This article has 8 authors:
1. Zhenyuan Yu
2. Huan Tang
3. Wenjia Ye
4. Zixin Gu
5. Yu Fu
6. Rong Yao
7. Ying Guan
8. Yonghong Shen
This article has no evaluationsLatest version Jan 23, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Development and validation of machine learning models for predicting short- and long-term mortality in gastroparesis patients: a retrospective cohort study using the MIMIC-IV database

Development and validation of an Explainable Machine Learning Model for Predicting Multiple Organ Failure in Patients with Acute Pancreatitis: a Multicenter Cohort Study

Development and External Validation of a Nurse-Friendly Machine Learning Model for Early Identification of Intradialytic Hypotension in ICU Patients Receiving Renal Replacement Therapy