Large Language Models Improve Cancer Survival Prediction Using Real-World Clinical Notes

Niklas Kiermeyer
Tim Lenfers
Amin Dada
Julian Friedrich
Sameh Khattab
Eric Knop
Jan Egger
Markus Pauly
Andreas Jung
Grégoire Montavon
Jens T. Siveke
Marcel Wiesweg
Stefan Kasper
Ulf P. Neumann
Frederick Klauschen
Sylvia Hartmann
Martin Schuler
Philipp Keyl
Jens Kleesiek
Julius Keyl

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In medical documentation, vast amounts of unstructured text are generated that are still underutilized in current prognostic models. We investigate the potential of self-hosted large language models (LLM) to extract clinically meaningful, patient-specific information from routine clinical notes for personalized risk stratification in cancer care.

We collected real-world medical notes from 2,708 non-small cell lung cancer (NSCLC) patients and 814 colon cancer patients documented before treatment at a large comprehensive cancer center. LLMs extracted key prognostic indicators, including comorbidities, metastatic sites, and qualitative descriptors of patient condition, in a zero-shot manner without prior task-specific training. Integrating these LLM-derived features into machine learning models significantly improved the prediction of overall survival compared to TNM staging alone (C-Index: NSCLC, 0.72 vs 0.64; colon cancer, 0.70 vs 0.59), and surpassed models using text embeddings. Based on the LLM-informed risk scores, patients were stratified into four distinct risk groups, enabling reclassification of 61.4% of NSCLC and 68.3% of colon cancer patients. Analysis of model drivers revealed that LLM-derived factors, such as the physical condition, substantially modulated the prognostic impact of TNM stage.

These findings highlight the potential of self-hosted LLM to extract clinically meaningful information from unstructured clinical documentation and support clinical decision-making.

Version published to 10.1101/2025.08.17.25333835 on medRxiv
Aug 19, 2025

A Multicenter Machine Learning Model Incorporating Circulating Tumor Cells for Postoperative Recurrence Prediction in Localized Renal Cell Carcinoma

This article has 21 authors:
1. Zihao Li
2. Chunzhi Qi
3. Yue Chong
4. Qiang Wei
5. Shaogang Wang
6. Jianbin Bi
7. Jinkai Shao
8. Xiaoping Zhang
9. Xin Gou
10. Wenhao Shen
11. Weiyang He
12. Xiaoming Cao
13. Wei Xiong
14. Guojun Chen
15. Xiaojian Yang
16. Jianxin Qiu
17. Yingyi Li
18. Jianzhou Liu
19. Yuan Shen
20. Tie Chong
21. Zhenlong Wang
This article has no evaluationsLatest version Jan 23, 2026
Personalized Disease Risk Prediction from Multimodal Health Data Using Large Language Models

This article has 2 authors:
1. Hanieh Arjmand
2. Alexandre Tomberg
This article has no evaluationsLatest version Jan 25, 2026
Machine learning models for predicting severe clinical events in hospitalized patients with coronary artery disease

This article has 16 authors:
1. Hao Liu
2. Meijun Liu
3. Xinmiao Guan
4. Feng Cao
5. Changhao Liang
6. Zhongwen Qi
7. Jiaqi Hui
8. Junnan Zhao
9. Jingli Xing
10. Jianguo Zhou
11. Dong Zhang
12. Lei Liu
13. Xiaoliang Hao
14. Minjing Luo
15. Fengqin Xu
16. Yutong Fei
This article has no evaluationsLatest version Jan 12, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Multicenter Machine Learning Model Incorporating Circulating Tumor Cells for Postoperative Recurrence Prediction in Localized Renal Cell Carcinoma

Personalized Disease Risk Prediction from Multimodal Health Data Using Large Language Models

Machine learning models for predicting severe clinical events in hospitalized patients with coronary artery disease