Developing a Predictive Model for Significant Prostate Cancer Detection in Prostatic Biopsies from Seven Clinical Variables: Is Machine Learning Superior to Logistic Regression?
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Objective: This study compares machine learning (ML) and logistic regression (LR) algorithms in developing a predictive model for sPCa using the seven predictive variables from the Barcelona (BCN-MRI) predictive model. Method: A cohort of 5005 men suspected of having PCa who underwent MRI and targeted and/or systematic biopsies was used for training, testing, and validation. A feedforward neural network (FNN)-based SimpleNet model (GMV) and a logistic regression-based model (BCN) were developed. The models were evaluated for discrimination ability, precision–recall, net benefit, and clinical utility. Both models demonstrated strong predictive performance. Results: The GMV model achieved an area under the curve of 0.88 in training and 0.85 in test cohorts (95% CI: 0.83–0.90), while the BCN model reached 0.85 and 0.84 (95% CI: 0.82–0.87), respectively (p > 0.05). The GMV model exhibited higher recall, making it more suitable for clinical scenarios prioritizing sensitivity, whereas the BCN model demonstrated higher precision and specificity, optimizing the reduction of unnecessary biopsies. Both models provided similar clinical benefit over biopsying all men, reducing unnecessary procedures by 27.5–29% and 27–27.5% of prostate biopsies at 95% sensitivity, respectively (p > 0.05). Conclusions: Our findings suggest that both ML and LR models offer high accuracy in sPCa detection, with ML exhibiting superior recall and LR optimizing specificity. These results highlight the need for model selection based on clinical priorities.