Utilizing machine learning models for predicting outcomes in acute pancreatitis: development and validation in three retrospective cohorts

Kaier Gu
Yang Liu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

Acute pancreatitis (AP) is associated with a high readmission rate; however, there is a paucity of models capable of predicting post-discharge outcomes. Furthermore, existing in-hospital prediction models exhibit notable limitations. This study leverages machine learning (ML) technology to develop prognosis prediction models for AP patients, encompassing in-hospital mortality, readmission rates, and post-discharge mortality.

Methods

A retrospective analysis was carried out on the clinical and laboratory data of AP patients from three databases (MIMIC database, eICU database, and Wenzhou Hospital in China), and they were divided into a training set and two validation sets. In the training set, key variables were screened using univariate logistic regression and the LASSO method. Six ML algorithms were employed to construct predictive models. The performance of these models was appraised using receiver operating characteristic curves, decision curve analysis, Shapley additive explanations plots, and other relevant metrics. A comparison was made between the predictive capabilities of the ML models and clinical scores. Subsequently, the performance of the machine learning models was subjected to further validation within two external validation sets.

Results

A total of 2,559 AP patients were included. There were 12–26 variables selected for model training. Among the six ML models under assessment, the Logistic Regression, Random Forest, and eXtreme Gradient Boosting (XGB) models exhibited relatively superior performance in predicting in-hospital mortality, mortality within 180/365 days after discharge. Findings from the decision curve analysis and two external validation sets further indicated that the XGB model exhibited the optimal performance in predicting the in-hospital mortality of AP patients admitted to the intensive care unit. Specifically, the XGB model demonstrated stability in the area under the curve across different centers, achieved a balance between sensitivity and specificity, and effectively prevented overfitting through regularization mechanisms. These features are highly congruent with the core requirements for robustness in the medical context.

Conclusions

By collecting the dynamic variables of patients during their hospitalization and establishing an XGB model, it is conducive to identifying the short-term and long-term prognoses of AP patients and promoting the decision-making of clinicians.

Clinical trial number

Not applicable.

Version published to 10.1186/s12911-025-03103-7
Jul 11, 2025
Version published to 10.21203/rs.3.rs-5881028/v1 on Research Square
Feb 3, 2025

Development and validation of an Explainable Machine Learning Model for Predicting Multiple Organ Failure in Patients with Acute Pancreatitis: a Multicenter Cohort Study

This article has 7 authors:
1. Yi Hao
2. Peiyi Bai
3. Yunpeng Zhou
4. Yi Wang
5. Qinyang Du
6. Rongshen Guan
7. Gaopeng Li
This article has no evaluationsLatest version Dec 22, 2025
Development and validation of machine learning models for predicting short- and long-term mortality in gastroparesis patients: a retrospective cohort study using the MIMIC-IV database

This article has 5 authors:
1. Lei Zhu
2. Qi Han
3. Bei Pei
4. Jie Zhang
5. Haolong Qi
This article has no evaluationsLatest version Dec 31, 2025
Development and Validation of a Machine Learning–Based Model for Predicting Textbook Outcome after Minimally Invasive Pancreaticoduodenectomy

This article has 7 authors:
1. Pengcheng Ma
2. Zhichen Jiang
3. Yuanyu Wang
4. Ze Jin
5. Zhiang Zhang
6. Yiping Mou
7. Weiwei Jin
This article has no evaluationsLatest version Dec 10, 2025

Discuss this preprint

Listed in

Abstract

Background

Methods

Results

Conclusions

Clinical trial number

Article activity feed

Related articles

Development and validation of an Explainable Machine Learning Model for Predicting Multiple Organ Failure in Patients with Acute Pancreatitis: a Multicenter Cohort Study

Development and validation of machine learning models for predicting short- and long-term mortality in gastroparesis patients: a retrospective cohort study using the MIMIC-IV database

Development and Validation of a Machine Learning–Based Model for Predicting Textbook Outcome after Minimally Invasive Pancreaticoduodenectomy