Development and Internal Validation of an Explainable Machine-Learning Model to Predict 3-Year overall survival rate After Radical Cystectomy

Yunze Wang
Aikeshanjiang Ailiyaer
Shiming Chen
Wenguang Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background: This study aimed to develop and internally validate an explainable machine-learning model using routinely available clinicopathologic and laboratory variables for predicting 3-year overall survival (OS) after radical cystectomy. Methods: We retrospectively included 300 patients who underwent radical cystectomy between January 2018 and December 2022. Predictors were selected in the training set using LASSO logistic regression followed by random-forest recursive feature elimination. Ten variables were retained. Seven algorithms (logistic regression, KNN, SVM-RBF, random forest, XGBoost, LightGBM, and CatBoost) were trained on a 70% training set and evaluated on a 30% internal validation set. Discrimination, calibration, and clinical utility were assessed, and the final model was interpreted using Shapley additive explanations (SHAP). Results: In internal validation, AUCs ranged from 0.834 to 0.950. CatBoost achieved the best overall classification performance (AUC = 0.931, accuracy = 0.862, sensitivity = 0.647, specificity = 0.951, PPV = 0.846, and NPV = 0.867). SHAP analyses identified tumor stage (T, N, and M stage) as the dominant drivers of predicted risk, with additional contributions from age, BMI, albumin, globulin, lymphocyte count, platelet count, and preoperative creatinine. Conclusions: We developed an internally validated, SHAP-interpretable CatBoost model for predicting 3-year overall survival (OS) after radical cystectomy. External validation and recalibration in independent cohorts are required before clinical use.

Version published to 10.21203/rs.3.rs-8754027/v1 on Research Square
Feb 11, 2026

Interpretable Machine Learning Models for Bladder Cancer Overall Survival Prediction Development and External Validation via SEER Database and Chinese Cohort Analysis

This article has 7 authors:
1. Saimaitikari Abudoubari
2. Abudouresuli Tuersun
3. Sailidan Mutailipu
4. Wenbin chen
5. Qiange Li
6. Mayidili Nijiati
7. Xiaoguang Zou
This article has no evaluationsLatest version Feb 10, 2026
Construction and validation of a nomogram for overall survival prognosis in patients with advanced (stage Ⅲ/Ⅳ) pancreatic cancer

This article has 10 authors:
1. Dongqi Yang
2. Chenjie Wang
3. Ke Su
4. Xin Liu
5. Zunyuan Tan
6. Jianwen Zhang
7. Han Li
8. Zhenjiang Li
9. Kun He
10. Yunwei Han
This article has no evaluationsLatest version Apr 3, 2026
A Predictive Tool Powered by Machine Learning for Evaluating the Status of Surgical Margins After Robot-Assisted Radical Prostatectomy

This article has 4 authors:
1. Gen Fan
2. Yang Li
3. Yushui Chen
4. Tielong Tang
This article has no evaluationsLatest version Feb 11, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Interpretable Machine Learning Models for Bladder Cancer Overall Survival Prediction Development and External Validation via SEER Database and Chinese Cohort Analysis

Construction and validation of a nomogram for overall survival prognosis in patients with advanced (stage Ⅲ/Ⅳ) pancreatic cancer

A Predictive Tool Powered by Machine Learning for Evaluating the Status of Surgical Margins After Robot-Assisted Radical Prostatectomy