An Integrated Random Forest– and LASSO-Derived Nomogram for Predicting Postoperative Nosocomial Infections in Colorectal Cancer Patients

Ranran Lu
Xiujuan Xue
Shuhui Wang
Tongtong Chen
Yanhong Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Objective We sought to delineate the independent risk factors underlying postoperative nosocomial infections in colorectal cancer patients and to construct and validate a nomogram for individualized risk prediction, thereby enabling early clinical identification of high-risk individuals and the implementation of targeted preventive strategies. Methods We retrospectively analyzed 1,146 colorectal cancer patients who underwent surgical resection, stratifying those treated between 2020 and 2021 (n = 762) as the training set and those treated in 2022 (n = 384) as the validation set. Candidate predictors were first evaluated by univariate analysis. We then applied a random forest to quantify variable importance and employed LASSO regression to refine feature selection and mitigate multicollinearity. Independent risk factors emerging from these steps were confirmed via multivariate logistic regression. Based on these determinants, we developed a nomogram for individualized risk estimation. Model performance was rigorously assessed in both cohorts: discrimination was measured by the area under the receiver operating characteristic curve, calibration was examined through calibration plots, and clinical benefit was appraised using decision curve analysis. Results Postoperative nosocomial infections occurred in 9.6% (110/1,146) of patients, most frequently presenting as lower respiratory tract infections (34.6%) and surgical-site infections (30.9%). Multivariate logistic regression identified prolonged operative duration, the presence of postoperative complications, open surgical approach, ASA score ≥ III, a history of coronary artery disease, use of postoperative drainage, and persistent fever lasting ≥ 3 days as independent predictors. The resulting nomogram demonstrated excellent discrimination, with an area under the ROC curve of 0.860 (95% CI, 0.815–0.905) in the training cohort and 0.827 (95% CI, 0.774–0.880) in the validation cohort. Calibration plots showed high concordance between predicted and observed infection rates, and decision curve analysis confirmed the model’s clinical utility across relevant threshold probabilities. Conclusions Our nomogram enables precise stratification of colorectal cancer patients by their postoperative infection risk, highlighting perioperative factors—such as operative duration, surgical approach, and ASA grade—that warrant targeted management. Future prospective, multicentre validation will be essential to refine and generalize the model’s applicability.

Version published to 10.21203/rs.3.rs-7361808/v1 on Research Square
Oct 1, 2025

Machine learning prediction and interpretive analysis of multidrug-resistant microbial infection risk in septicemia patients: A study from the MIMIC-IV database

This article has 5 authors:
1. Qianqian Zhang
2. Nianzhi Zhang
3. Ying Zheng
4. Jing Zhou
5. Ling Liu
This article has no evaluationsLatest version Dec 30, 2025
A Multicenter Machine Learning Model Incorporating Circulating Tumor Cells for Postoperative Recurrence Prediction in Localized Renal Cell Carcinoma

This article has 21 authors:
1. Zihao Li
2. Chunzhi Qi
3. Yue Chong
4. Qiang Wei
5. Shaogang Wang
6. Jianbin Bi
7. Jinkai Shao
8. Xiaoping Zhang
9. Xin Gou
10. Wenhao Shen
11. Weiyang He
12. Xiaoming Cao
13. Wei Xiong
14. Guojun Chen
15. Xiaojian Yang
16. Jianxin Qiu
17. Yingyi Li
18. Jianzhou Liu
19. Yuan Shen
20. Tie Chong
21. Zhenlong Wang
This article has no evaluationsLatest version Jan 23, 2026
Machine Learning-Based Survival Time Prediction in Colorectal Cancer with Peritoneal Metastasis: A Multi-Institutional Registry-Based Study

This article has 32 authors:
1. Yoshiko Bamba
2. Michio Itabashi
3. Hirotoshi Kobayashi
4. Kenjiro Kotake
5. Masayasu Kawasaki
6. Yukihide Kanemitsu
7. Yusuke Kinurgasa
8. Hideki Ueno
9. Kotaro Maeda
10. Takeshi Suto
11. Kimihiko Funahashi
12. Heita Ozawa
13. Fumikazu Koyama
14. Shingo Noura
15. Hideyuki Ishida
16. Masayuki Ohue
17. Tomomichi Kiyomatsu
18. Soichiro Ishihara
19. Keiji Koda
20. Hideo Baba
21. Kenji Kawada
22. Yojiro Hashiguchi
23. Takanori Goi
24. Yuji Toiyama
25. Naohiro Tomita
26. Eiji Sunami
27. Yoshito Akagi
28. Jun Watanabe
29. Kenichi Hakamada
30. Goro Nakayama
31. Kenichi Sugihara
32. Yoichi Ajioka
This article has no evaluationsLatest version Jan 21, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Machine learning prediction and interpretive analysis of multidrug-resistant microbial infection risk in septicemia patients: A study from the MIMIC-IV database​

A Multicenter Machine Learning Model Incorporating Circulating Tumor Cells for Postoperative Recurrence Prediction in Localized Renal Cell Carcinoma

Machine Learning-Based Survival Time Prediction in Colorectal Cancer with Peritoneal Metastasis: A Multi-Institutional Registry-Based Study

Machine learning prediction and interpretive analysis of multidrug-resistant microbial infection risk in septicemia patients: A study from the MIMIC-IV database