Prediction of TP53 biomarkers and survival outcomes from whole slide images using a vision transformer-based multi-instance learning framework

Abadh K Chaurasia
Patrick W Toohey
Matthew T Bennett
Helen C Harris
Alex W Hewitt

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

Accurate molecular profiling and prognostication from routine histopathology slides could transform precision oncology. We developed a Vision Transformer (ViT)-based multi-instance learning (MIL) framework for combined predictions of 32 solid tumour types, TP53 biomarker detection, and survival prediction directly from Whole Slide Images (WSIs).

Methods

11,060 primary tumours were curated from the TCGA Pan-Cancer Atlas with corresponding somatic mutations, RNA-seq, and clinical outcome data. TP53 alterations were classified as pathogenic drivers using COSMIC and hotspot annotations. WSIs underwent tissue masking, quality control, stain normalisation, and patch extraction (518 x 518) at 6x downsampling. Each patch was encoded by a ViT into a 768-dimensional embedding, which formed a token sequence for a 6-layer Transformer aggregator with learnable classification and positional embeddings. Seven task heads were developed to generate predictions for various outcomes, including cancer type, TP53 mutation status, TP53 RNA expression levels, overall survival (OS), progression-free interval (PFI), and the corresponding times for OS and PFI. The training process had two stages. First, the model was trained on tumour tissue patches from WSIs at five magnifications. In the second stage, it was fine-tuned using patches from all tissue regions with a content-aware strategy, updating all MIL layers for a maximum of 150 epochs at a learning rate of 1 × 10⁻⁵. The model’s performance was evaluated on an independent validation set of 1,729 slides using classification metrics, including the area under the receiver operating characteristic curve (AUROC), regression metrics, and Concordance indices (C-index).

Results

The multi-resolution ViT-based MIL model achieved an AUROC of 0.775 (95% CI: 0.749–0.801) for TP53 mutation detection on the validation set, demonstrating strong overall performance across classification and survival prediction tasks. The fine-tuned model attained robust performance across the tasks, with 0.7569 accuracy for cancer classification, 0.745 AUROC for TP53 mutation detection, C-indices of 0.686 and 0.650 for OS and PFI, and a mean squared error of 1.072 for TP53 RNA expression level estimation. The fine-tuned model attained an accuracy of 65.9% (95% CI: 0.636–0.681) in tumour classification and an AUROC of 0.766 (95% CI: 0.743–0.789) for detecting TP53 mutations on the external validation set. However, most tumour classes, aside from ovarian cancer, reached an AUROC above 0.88 with class-specific thresholding using the Youden Index. This indicates strong generalisation across 32 tumour types, providing reasonable molecular profiling but offering limited prognostic utility in surgical oncology.

Conclusion

A ViT-based MIL model can simultaneously infer tumour taxonomy, TP53 mutation status, and TP53 RNA expression levels directly from WSIs, with performance comparable to conventional genomic assays, while prognostic risk remains limited. This integrated, slide-level approach offers a scalable pipeline toward computational pathology.

Version published to 10.1101/2025.11.11.25340052 on medRxiv
Nov 13, 2025

Prognosis Prediction in Bladder Cancer Pathological Images Based on Nuclear Structure Encoding

This article has 7 authors:
1. Bo Guan
2. Yuan Gao
3. Feng Wang
4. Guangdi Chu
5. Jianchang Zhao
6. Haitao Niu
7. Jianmin Li
This article has no evaluationsLatest version Dec 29, 2025
Inferring Clinically Relevant Molecular Subtypes of Pancreatic Cancer from Routine Histopathology Using Deep Learning

This article has 13 authors:
1. Abdul Rehman Akbar
2. Alejandro Levya
3. Ashwini Esnakula
4. Elshad Hasanov
5. Anne Noonan
6. Upender Manne
7. Vaibhav Sahai
8. Lingbin Meng
9. Susan Tsai
10. Anil Parwani
11. Wei Chen
12. Ashish Manne
13. Muhammad Khalid Khan Niazi
This article has no evaluationsLatest version Jan 16, 2026
Survival Risk Stratification in Glioma from Multimodal MRI: An Interpretable Tool for Preoperative Treatment Planning

This article has 13 authors:
1. Xiaoyu Hua
2. Gulinigar Wusiman
3. Dongzhen Xue
4. Die Cai
5. Shiqi Lin
6. Qianni Zhu
7. Huazhen Deng
8. Hanwen Zhang
9. Zijun Liu
10. Kai Guo
11. Yuli Wang
12. Jianbing Li
13. Yujun Geng
This article has no evaluationsLatest version Dec 30, 2025

Discuss this preprint

Listed in

Abstract

Background

Methods

Results

Conclusion

Article activity feed

Related articles

Prognosis Prediction in Bladder Cancer Pathological Images Based on Nuclear Structure Encoding

Inferring Clinically Relevant Molecular Subtypes of Pancreatic Cancer from Routine Histopathology Using Deep Learning

Survival Risk Stratification in Glioma from Multimodal MRI: An Interpretable Tool for Preoperative Treatment Planning