Exploring the Potential of Machine Learning in Gastric Cancer: Prognostic Biomarkers, Subtyping, and Stratification
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background Advancements in the management of gastric cancer (GC) and innovative therapeutic approaches highlight the significance of the role of biomarkers in GC prognosis. Machine-learning (ML)-based methods can be applied to identify the most important predictors and unravel their interactions to classify patients, which might guide prioritized treatment decisions. Methods A total of 140 patients with histopathological confirmed GC who underwent surgery between 2011 and 2016 were enrolled in the study. The inspired modification of the partial least squares (SIMPLS)-based model was used to identify the most significant predictors and interactions between variables. Predictive partition analysis was employed to establish the decision tree model to prioritize markers for clinical use. ML models have also been developed to predict TNM stage and different subtypes of GC. Latent class analysis (LCA) and principal component analysis (PCA) were carried out to cluster the GC patients and to find a subgroup of survivors who tended to die. Results The findings revealed that the SIMPLS method was able to predict the mortality of GC patients with high predictabilities (Q 2 = 0.45–0.70). The analysis identified MMP-7, P53, Ki67, and vimentin as the top predictors. Correlation analysis revealed different patterns of prognostic markers in the non-survivor and survivor cohorts and different GC subtypes. The main prediction models were verified via other ML-based analyses, with a high area under the curve (AUC) (0.84–0.99), specificity (0.82–0.99) and sensitivity (0.87–0.99). Patients were classified into three clusters of mortality risk, which highlighted the most significant mortality predictors. Partition analysis prioritizes the most significant predictors P53 ≥ 6, COX-2 > 2, vimentin > 2, Ki67 ≥ 13 in mortality of patients (AUC = 0.85–0.90). Conclusion The present study highlights the importance of considering multiple variables and their interactions to predict the prognosis of mortality and stage in GC patients through ML-based techniques. These findings suggest that the incorporation of molecular biomarkers may enhance patient prognosis compared to relying solely on clinical factors. Furthermore, they demonstrate the potential for personalized medicine in GC treatment by identifying high-risk patients for early intervention and optimizing therapeutic strategies. The partition analysis technique offers a practical tool for identifying cutoffs and prioritizing markers for clinical application. Additionally, providing Clinical Decision Support systems with predictive tools can assist clinicians and pathologists in identifying aggressive cases, thereby improving patient outcomes while minimizing unnecessary treatments. Overall, this study contributes to the ongoing efforts to improve patient outcomes by advancing our comprehension of the intricate nature of GC.