Dynamic Landmark-Based Prediction of Sepsis Using Interpretable and Balanced Machine Learning Models in Respiratory-Supported Critically ill Patients

Ayao Sangenis Assogba
Jennifer H. Gladius
Komi Selassi Gayi
Samadou Tchakondo
Yendouname Kandjoni
Richard Sagacity Tugbeh
Rachana Das

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background Early recognition of sepsis in critically ill patients remains challenging due to dynamic physiological changes and nonspecific clinical presentation. Most prediction models rely on static or continuously updated data streams without explicitly accounting for evolving risk over clinically meaningful time intervals. The study aimed to develop and evaluate a landmark-based dynamic machine learning framework to predict sepsis within a 6-hour horizon among respiratory-supported intensive care unit (ICU) patients. Methods This is a secondary analysis using data from the MIMIC-IV database. Adult intensive care unit patients receiving respiratory support were evaluated at four landmarks (6, 12, 18, and 24 hours). At each point, sepsis-free patients were used to predict sepsis onset within the next 6 hours. Models included logistic regression, random forest, and XGBoost. The patient–level train–test splitting and group cross-validation prevented information leakage. Performance was assessed using discrimination, classification metrics, and calibration. A balanced ensemble approach addressed class imbalance in sensitivity analysis, and interpretability was examined using permutation importance and regression effect estimates. Results A total of 41,871, 39,912, 36,472, and 31,367 patients were included at the 6, 12, 18, and 24-hour landmarks, respectively. Sepsis incidence declined from 1.48% to 0.37% across time points. Model performance varied, with the 18-hour landmark showing the best balance between discrimination and clinically meaningful operating characteristics. Logistic regression achieved the highest discrimination in the primary analysis (AUROC = 0.78), while random forest performed best in sensitivity analyses (AUROC = 0.77). Both consistently identified the 18-hour landmark as optimal, indicating that temporal risk structure outweighed algorithm choice. Calibration was checked overall but showed overestimation at higher predicted risks. Key predictors reflected respiratory, hemodynamic, neurological, and comorbidity factors. Conclusions Landmark-based dynamic modelling provides a clinically interpretable and temporally informed strategy for early sepsis prediction in respiratory-supported intensive care unit patients. The consistent identification of the 18-hour window as the most informative prediction point suggests that intermediate ICU time frames may offer the best balance between timeliness and predictive stability. Further work should focus on recalibration, threshold optimization, and external validation before clinical implementation.

Version published to 10.21203/rs.3.rs-8737800/v1 on Research Square
Mar 25, 2026

Case-Control Matching Erodes Feature Discriminability for Machine Learning-Based Sepsis Prediction in ICUs: A Retrospective Cohort Study

This article has 6 authors:
1. Sophia Ehlers
2. Youssef Farag
3. Fanny Tranchellini
4. Tim Hahn
5. Catherine Jutzeler
6. Lakmal Meegahapola
This article has no evaluationsLatest version Apr 9, 2026
The Study on the Prognostic Assessment Value of Pan-Immune-Inflammation Value (PIV) in Patients with Sepsis

This article has 6 authors:
1. Jieyu Liu
2. Yuxin Dong
3. Jingyuan Wang
4. Jiaxuan Sun
5. Songtao Shou
6. Linning Cai
This article has no evaluationsLatest version Mar 10, 2026
Application Of Multi-Inflammatory Index To Predict 28-Day Mortality In ICU Patients With Heart Failure: A Retrospective Machine Learning Study Based On The MIMIC-IV Database

This article has 5 authors:
1. Longcha Liu
2. Zhenjie Dai
3. Xueshu Yu
4. Zhi Chen
5. Yanqiu Lin
This article has no evaluationsLatest version Mar 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Case-Control Matching Erodes Feature Discriminability for Machine Learning-Based Sepsis Prediction in ICUs: A Retrospective Cohort Study

The Study on the Prognostic Assessment Value of Pan-Immune-Inflammation Value (PIV) in Patients with Sepsis

Application Of Multi-Inflammatory Index To Predict 28-Day Mortality In ICU Patients With Heart Failure: A Retrospective Machine Learning Study Based On The MIMIC-IV Database