Analyzing Information Disparities across Modalities in Mortality Prediction

Chanhwi Kim
WonJin Yoon
Hoonick Lee
Jung-Oh Lee
Majid Afshar
Jaewoo Kang
Timothy Miller

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recent advances in deep learning have enabled the integration of heterogeneous data modalities for clinical prediction, allowing models to exploit complex information embedded within electronic health records (EHRs). Among these modalities, chest radiographs (CXRs) provide a rich source of visual information that can enhance patient outcome prediction for patients in the intensive care unit (ICU). However, the comparative impact of different CXR representations— raw images versus radiology reports—on predictive performance has not been systematically investigated. Such comparisons are essential for identifying the most informative modality and understanding how it complements other data sources. This study compares the predictive utility of raw CXRs versus radiology reports for 30-day post-discharge mortality prediction in ICU patients. We employed a Vision–Language Model (VLM) with patient discharge notes. On a filtered subset of the MIMIC-IV dataset (n = 1,360), augmenting discharge notes with CXRs achieved the best performance (AUROC = 0.843), surpassing both the discharge-note-only (AUROC = 0.816) and radiology-report-augmented (AUROC = 0.804) models. The experiments demonstrated that combining raw CXRs with discharge notes consistently outperformed models augmented with radiology reports. A radiologist’s review further revealed that reports often omitted clinically relevant findings visible in the images, highlighting that CXRs convey richer prognostic signals for mortality risk. These findings underscore the critical role of modality selection in clinical AI systems and suggest that textual summaries should be used as surrogates for multimodal data with caution, as they may fail to capture critical predictive information.

Version published to 10.1101/2025.10.30.25339162 on medRxiv
Nov 2, 2025

Research on Automatic Classification and Prognosis Prediction of Intracerebral Hemorrhage Based on Deep Learning Models

This article has 2 authors:
1. Ying Mao
2. Xiaoyu Wang
This article has no evaluationsLatest version Nov 3, 2025
PANCDetect: Early Detection of Pancreatic Cancer from Multimodal EHR data with LLM Embeddings

This article has 8 authors:
1. Zicheng Jin
2. Xuhui Guo
3. Zehua Wang
4. Qiang Yang
5. Xiaotong Yang
6. Xinyu Zhang
7. Rui Yin
8. Lana X. Garmire
This article has no evaluationsLatest version Oct 7, 2025
Risk assessment in cardiac surgery: Exploring machine learning and laboratory indices as adjunctive tools

This article has 2 authors:
1. Wesley Chorney
2. John Hinchion
This article has no evaluationsLatest version Oct 13, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Research on Automatic Classification and Prognosis Prediction of Intracerebral Hemorrhage Based on Deep Learning Models

PANCDetect: Early Detection of Pancreatic Cancer from Multimodal EHR data with LLM Embeddings

Risk assessment in cardiac surgery: Exploring machine learning and laboratory indices as adjunctive tools