Prostate Cancer Detection in Bi-parametric MRI Using Deep Learning Model

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Prostate cancer is an example of a widespread cancer among men in the world and early and accurate diagnosis plays a vital role in enhancing the likelihood of a more favorable patient outcome and reducing the occurrence of invasive surgeries. In the last several years, computer-aided diagnosis systems with deep learning have also shown significant potential on the analysis of medical images, although the conventional convolutional neural networks have the tendency to fail to recreate the long-range contextual attributes in multi-faceted data of a magnetic resonance imaging (MRI). To address these limitations, the current research work is premised on investigating the effectiveness of transformer-based architecture to identify prostate cancer with a comparative analysis of two architectures, Vision Transformer (ViT) and Swin Transformer. The first step in this research involves processing prostate MRI images by first using a complete preprocessing process that entails image normalization, data augmentation to a clinical relevance that ensures that images are better and that the process also tries to eliminate class imbalance. ViT and Swin Transformer are then pretrained and used to learn prostate tissue discriminative representation by extracting features using their respective self-attention mechanisms. The extracted features are then subjected to the supervised classification, in which the performance of the model is evaluated using the typical metrics of analysis such as the accuracy and precision, recall, F1-score, and area under the receiver operating characteristic curve (AUC). Both transformer-based models can be compared as competitive in the prostate cancer detection task, as Vision Transformer is more effective in capturing the global context, and Swin Transformer is more effective in capturing the hierarchical feature representation. The cross-validation findings are also in favor of the stability of the proposed framework and its capacity to be generalized. Overall, the current paper demonstrates that the transformer-based models can possibly be applied in automated diagnosis of prostate cancer, and that it may be possible to gain a clearer idea of their flaws and strengths to create AI-assisted screening systems that are clinically reliable in the future.

Article activity feed