Tabular Foundation Model for Breast Cancer Prognosis using Gene Expression Data

Tuyen Vu
Ha X. Tran
Xiaomei Li
Lin Liu
Jiuyong Li
Jia Tina Du
Thuc D. Le

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Survival analysis is essential in oncology for modeling time-to-event outcomes such as overall survival and disease recurrence. Traditional approaches, such as the Cox Proportional Hazards (CPH) model, have been widely used due to their interpretability but rely on restrictive assumptions of linearity and proportional hazards, which limit their ability to capture the nonlinear relationships present in high-dimensional genomic data. Recent machine learning methods, including Random Survival Forests (RSF) and deep learning models such as DeepSurv, have improved flexibility and predictive performance but require extensive training data, hyperparameter tuning, and computationally expensive optimization, which hinder their practical use. We propose TabSurv, a novel survival prediction framework that leverages a foundation model for tabular data tasks using in-context learning. TabSurv predicts survival times in a regression setting, allowing rapid adaptation to new datasets with minimal computational cost. The model is trained using only uncensored samples and evaluated with the concordance index (C-index) and stability metrics to assess both accuracy and robustness. We benchmark TabSurv against seven state-of-the-art survival models across 12 breast cancer datasets. The results demonstrate that TabSurv achieves competitive or superior performance, obtaining the best C-index on six datasets and the highest overall stability score. These findings highlight TabSurv as a powerful and efficient tool for breast cancer prognosis using high-dimensional molecular data.

Version published to 10.1101/2025.10.03.25337265 on medRxiv
Oct 5, 2025

Cross-Platform Reproducible Modeling of Breast Cancer Prognosis Using the Core-PAM50 Gene Signature

This article has 2 authors:
1. Rafael de Negreiros Botan
2. Joao Batista de Sousa
This article has no evaluationsLatest version Dec 19, 2025
Multiomics and Machine Learning Identify Prognostic Immune Related Gene Signatures in Ovarian Cancer

This article has 4 authors:
1. Xiulan Wang
2. Xuewang Guo
3. Yanying Xu
4. Shaofang Hua
This article has no evaluationsLatest version Dec 18, 2025
Machine Learning-Based Survival Time Prediction in Colorectal Cancer with Peritoneal Metastasis: A Multi-Institutional Registry-Based Study

This article has 32 authors:
1. Yoshiko Bamba
2. Michio Itabashi
3. Hirotoshi Kobayashi
4. Kenjiro Kotake
5. Masayasu Kawasaki
6. Yukihide Kanemitsu
7. Yusuke Kinurgasa
8. Hideki Ueno
9. Kotaro Maeda
10. Takeshi Suto
11. Kimihiko Funahashi
12. Heita Ozawa
13. Fumikazu Koyama
14. Shingo Noura
15. Hideyuki Ishida
16. Masayuki Ohue
17. Tomomichi Kiyomatsu
18. Soichiro Ishihara
19. Keiji Koda
20. Hideo Baba
21. Kenji Kawada
22. Yojiro Hashiguchi
23. Takanori Goi
24. Yuji Toiyama
25. Naohiro Tomita
26. Eiji Sunami
27. Yoshito Akagi
28. Jun Watanabe
29. Kenichi Hakamada
30. Goro Nakayama
31. Kenichi Sugihara
32. Yoichi Ajioka
This article has no evaluationsLatest version Jan 21, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Cross-Platform Reproducible Modeling of Breast Cancer Prognosis Using the Core-PAM50 Gene Signature

Multiomics and Machine Learning Identify Prognostic Immune Related Gene Signatures in Ovarian Cancer

Machine Learning-Based Survival Time Prediction in Colorectal Cancer with Peritoneal Metastasis: A Multi-Institutional Registry-Based Study