A machine learning framework for supervised treatment response prediction from tumor transcriptomics: A large-scale pan-cancer study

Lipika Ray Pal
Edward Michael Gertz
Nishanth Ulhas Nair
Sumit Mukherjee
Sumeet Patiyal
Thomas Cantore
Emma M. Campagnolo
Tiangen Chang
Saugato Rahman Dhruba
Yewon Kim
Eldad David Shulman
Padma Sheila Rajagopal
Danh-Tai Hoang
Alejandro A Schaffer
Eytan Ruppin

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Precision oncology aims to guide treatment decisions using biomarkers. While DNA-based panels are increasingly applied, RNA transcriptomics remain underused due to limited datasets and the absence of robust models. We assembled the largest transcriptomic resource for drug response prediction to date, spanning 69 cohorts, 3,729 patients, nine cancer types, and six frontline therapies: anti-PD-1/PD-L1 immune-checkpoint inhibitors, trastuzumab, bevacizumab, BRAF inhibitors, paclitaxel, and FAC/FEC (Fluorouracil-Adriamycin-Cyclophosphamide/Fluorouracil-Epirubicin-Cyclophosphamide) chemotherapy. We developed EXPRESSO (EXpression-Profile-RESponSe-Optimizer), a supervised machine-learning framework that predicts treatment response from pre-treatment transcriptomes by integrating drug targets and context-specific biomarkers. EXPRESSO achieves ROC-AUCs of 0.64 - 0.73 and odds ratios of 2.4 - 4.6 across therapies, outperforming 20 published transcriptomic signatures. Robustness analysis reveals that predictive performance plateaued for some therapies with increasing training cohorts but continued to improve for others. These findings suggest inherent limits of supervised brute-force learning for certain treatments, but additional data and deeper mechanistic modeling may further enhance transcriptomics-based predictors.

Version published to 10.1101/2025.10.24.684491 on bioRxiv
Oct 26, 2025

12-Gene Signature for Prediction of Chemotherapy Response in Gastric Cancer

This article has 2 authors:
1. Nhan Tran
2. Minh Nam Nguyen
This article has no evaluationsLatest version Jan 16, 2026
Cross-Platform Reproducible Modeling of Breast Cancer Prognosis Using the Core-PAM50 Gene Signature

This article has 2 authors:
1. Rafael de Negreiros Botan
2. Joao Batista de Sousa
This article has no evaluationsLatest version Dec 19, 2025
Integrative In Silico Transcriptomic and Pharmacogenomic Analysis of CD276 as a Candidate Prognostic Biomarker and Therapeutic Target in Bladder Cancer

This article has 8 authors:
1. Faruk Recep Özalp
2. Halil İbrahim Ellez
3. Ahmet Melih Arslan
4. Erkut Demirciler
5. Savaş Gökçek
6. Oktay Halit Aktepe
7. Hüseyin Salih Semiz
8. Aziz Karaoğlu
This article has no evaluationsLatest version Feb 2, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

12-Gene Signature for Prediction of Chemotherapy Response in Gastric Cancer

Cross-Platform Reproducible Modeling of Breast Cancer Prognosis Using the Core-PAM50 Gene Signature

Integrative In Silico Transcriptomic and Pharmacogenomic Analysis of CD276 as a Candidate Prognostic Biomarker and Therapeutic Target in Bladder Cancer