Explainable AI for Precision Oncology: A Task-Specific Approach Using Imaging, Multi-omics, and Clinical Data

Yaeseong Park
Sohyun Park
Bae EunJeong

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Despite continued advances in oncology, cancer remains a leading cause of global mortality, highlighting the need for diagnostic and prognostic tools that are both accurate and interpretable. Unimodal approaches often fail to capture the biological and clinical complexity of tumors. In this study, we present a suite of task-specific AI models that leverage CT imaging, multi-omics profiles, and structured clinical data to address distinct challenges in segmentation, classification, and prognosis.

We developed three independent models across large public datasets. Task 1 applied a 3D U-Net to segment pancreatic tumors from CT scans, achieving a Dice Similarity Coefficient (DSC) of 0.7062. Task 2 employed a hierarchical ensemble of omics-based classifiers to distinguish tumor from normal tissue and classify six major cancer types with 98.67% accuracy. Task 3 benchmarked classical machine learning models on clinical data for prognosis prediction across three cancers (LIHC, KIRC, STAD), achieving strong performance (e.g., C-index of 0.820 in KIRC, AUC of 0.978 in LIHC).

Across all tasks, explainable AI methods such as SHAP and attention-based visualization enabled transparent interpretation of model outputs. These results demonstrate the value of tailored, modality-aware models and underscore the clinical potential of applying such tailored AI systems for precision oncology.

Technical Foundations

Segmentation (Task 1): A custom 3D U-Net was trained using the Task07_Pancreas dataset from the Medical Segmentation Decathlon (MSD). CT images were preprocessed with MONAI-based pipelines, resampled to (64, 96, 96) voxels, and intensity-windowed to HU ranges of –100 to 240.
Classification (Task 2): Multi-omics data from TCGA—including gene expression, methylation, miRNA, CNV, and mutation profiles—were log-transformed and normalized. Five modality-specific LightGBM classifiers generated meta-features for a late-fusion ensemble. Stratified 5-fold cross-validation was used for evaluation.
Prognosis (Task 3): Clinical variables from TCGA were curated and imputed (median/mode), with high-missing-rate columns removed. Survival models (e.g., Cox-PH, Random Forest, XGBoost) were trained with early stopping. No omics or imaging data were used in this task.
Interpretability: SHAP values were computed for all tree-based models, and attention-based overlays were used in imaging tasks to visualize salient regions.

Version published to 10.1101/2025.07.12.25331423 on medRxiv
Jul 14, 2025

AI-Based Computational Pathology for Precision Lung Cancer Management: A Systematic Review and Meta-Analysis of Diagnostics and Prognostic Algorithms

This article has 6 authors:
1. Ingkar Chegedekova
2. Akim Kapsalyamov
3. Prashant Kumar Jamwal
4. Zaidagul Kystaubayeva
5. Dimitris Parthimos
6. Bharat Jasani
This article has no evaluationsLatest version Aug 6, 2025
Integrated histopathologic modeling of detailed tumor subtypes and actionable biomarkers

This article has 63 authors:
1. Kevin M Boehm
2. Madison Darmofal
3. Arfath Pasha
4. Andrew Aukerman
5. Raymond Lim
6. Evan Seffar
7. Tom Pollard
8. Natasha Rekhtman
9. Jason Chang
10. Armaan Kohli
11. Darin Moore
12. Marta Ligero
13. JianJiong Gao
14. Georgios Asimomitis
15. Anika Begum
16. Fresia Pareja
17. Hikmat Al-Ahmadie
18. Klaus J Busam
19. Nikolaos M Dimitriou
20. Meera Hameed
21. Ahmet Dogan
22. Lora H Ellenson
23. Jie-Fu Chen
24. Daniel Gomez
25. Nancy Lee
26. Eric Sherman
27. Himanshu Nagar
28. Lior Z Braunstein
29. Britta Weigelt
30. James Fagin
31. Susan Fu
32. Jonathan Alarcon
33. Neelraj Patil
34. Areej Alsaafin
35. A. Rose Brannon
36. Kofi Amoah
37. Jordan Eichholz
38. Martin Voss
39. Justin Jee
40. Christopher Fong
41. Michele Waters
42. Luke R.G. Pike
43. Pedram Razavi
44. Paul Romesser
45. Atif Khan
46. Walid K Chatila
47. Aasiya Islam
48. Elana Sverdlik
49. Ino de Bruijn
50. Zain-Ul-Abideen Nasir
51. Karl Pichotta
52. Jinru Shia
53. Cristina R. Antonescu
54. Victor Reuter
55. Jake June-Koo Lee
56. Marc Ladanyi
57. Orly Ardon
58. Kojo Elenitoba-Johnson
59. Michael F Berger
60. David Solit
61. Nikolaus Schultz
62. Sohrab P Shah
63. Francisco Sánchez-Vega
This article has no evaluationsLatest version Aug 16, 2025
Explainable AI and Multiclassifiers for Staging Biomarker Discovery in Lung Squamous Cell Carcinoma

This article has 4 authors:
1. Débora V. C. Lima
2. Patrick Terrematte
3. Beatriz Stransky
4. Adrião D. D. Neto
This article has no evaluationsLatest version Jul 11, 2025

Listed in

Abstract

Technical Foundations

Article activity feed

Related articles

AI-Based Computational Pathology for Precision Lung Cancer Management: A Systematic Review and Meta-Analysis of Diagnostics and Prognostic Algorithms

Integrated histopathologic modeling of detailed tumor subtypes and actionable biomarkers

Explainable AI and Multiclassifiers for Staging Biomarker Discovery in Lung Squamous Cell Carcinoma