Development and External Validation of a Machine Learning–Based Model for Early Prediction of Multiple Organ Dysfunction Syndrome in Critically Ill Patients with Sepsis

Jinbin Yang
Linying Cai
Xuyang Liu
Kaihuan Zhou
Junyu Lu
Yegui Yang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background Multiple organ dysfunction syndrome (MODS) is a key determinant of prognosis in sepsis, yet conventional severity scoring systems based on linear assumptions and static variables fail to capture complex nonlinear physiological disturbances and dynamic inter organ interactions. Although machine learning has shown promise in outcome prediction among critically ill patients, studies focusing on MODS while ensuring interpretability and external validation remain limited. Methods This retrospective cohort study used data from the Medical Information Mart for Intensive Care IV and the eICU Collaborative Research Database. Adult patients meeting Sepsis 3 criteria and admitted to the ICU for the first time were included. Feature selection was performed using least absolute shrinkage and selection operator regression. Multiple machine learning models were developed, including logistic regression, random forest, gradient boosting machine, extreme gradient boosting, Light Gradient Boosting Machine, artificial neural networks, and support vector machines. Model performance was evaluated using the area under the receiver operating characteristic curve, calibration curves, and decision curve analysis. Shapley additive explanations were used for model interpretation, and external validation was conducted in an independent eICU cohort. Results Among 23,018 patients with sepsis, 4,931 (21.4%) developed MODS during ICU hospitalisation. All models showed acceptable discrimination, with LightGBM achieving the highest AUC (0.829), followed by GBM (0.824), random forest (0.823), and XGBoost (0.822). Logistic regression and elastic net showed moderate performance (both AUC 0.802), the neural network showed intermediate discrimination (AUC 0.803), whereas support vector machines (0.759) and k nearest neighbours (0.727) performed less well. LightGBM demonstrated stable discrimination, good calibration, and greater clinical net benefit in both internal testing and external validation. SHAP analysis identified the Sequential Organ Failure Assessment score, respiratory rate, lactate, coagulation indices including international normalised ratio, acid base status, and vasoactive agent use as key predictors with pronounced nonlinear effects. Conclusion Among the evaluated models, the gradient boosting based LightGBM showed the most robust performance for predicting MODS risk in sepsis, supporting early risk stratification and individualised ICU management. Prospective multicentre studies are warranted to confirm its clinical impact.

Version published to 10.21203/rs.3.rs-8681490/v1 on Research Square
Mar 8, 2026

Dynamic Landmark-Based Prediction of Sepsis Using Interpretable and Balanced Machine Learning Models in Respiratory-Supported Critically ill Patients

This article has 7 authors:
1. Ayao Sangenis Assogba
2. Jennifer H. Gladius
3. Komi Selassi Gayi
4. Samadou Tchakondo
5. Yendouname Kandjoni
6. Richard Sagacity Tugbeh
7. Rachana Das
This article has no evaluationsLatest version Mar 25, 2026
Explainable Machine Learning Model for Predicting Early Neurological Deterioration in Patients with Acute Ischemic Stroke

This article has 3 authors:
1. Tingting Huang
2. Shoucai Zhao
3. Kai Wang
This article has no evaluationsLatest version Apr 1, 2026
Development and validation of an interpretable machine learning model for predicting in-hospital mortality in patients with ventricular fibrillation

This article has 9 authors:
1. Chengdi Chen
2. Kaixiang Zhang
3. Tongchun Zhong
4. Haochun Li
5. Zibei Feng
6. Zhijian Guo
7. Zhixiong Yang
8. Shian Huang
9. Lingpin Pang
This article has no evaluationsLatest version Mar 26, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Dynamic Landmark-Based Prediction of Sepsis Using Interpretable and Balanced Machine Learning Models in Respiratory-Supported Critically ill Patients

Explainable Machine Learning Model for Predicting Early Neurological Deterioration in Patients with Acute Ischemic Stroke

Development and validation of an interpretable machine learning model for predicting in-hospital mortality in patients with ventricular fibrillation