Clinical Evaluation of a PACS-Integrated Deep Learning Tool for Intracranial Hemorrhage Severity Assessment: Comparison with LLM-Based Report Interpretation

Santiago Cepeda
Olga Esteban-Sinovas
Ignacio Arrese
Trinidad Escudero
Jesús Garzón
María Hernández
Teresa Guerra
Pilar Sanz
Hermógenes Calero-Aguilar
Francisco Herrero
Juan José Jiménez González
Diego Hernán Ferradal
Rosario Sarabia

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background : Radiology reports are the standard method for communicating imaging findings in intracranial hemorrhage (ICH), yet it remains unclear whether narrative descriptions accurately reflect overall hemorrhage burden compared with objective quantitative imaging metrics. We aimed to compare the clinical associations of an automated deep learning–based severity index (CEREBLEED) with severity assessments extracted from radiology reports using large language models (LLMs). Methods : This prospective single-center study enrolled consecutive adult patients with ICH on non-contrast CT between June and December 2025. CEREBLEED, integrated into the institutional PACS, automatically segmented hemorrhages and computed a composite Severity Index incorporating volume, location, intraventricular extension, and midline shift. Three LLMs (Claude, GPT-4, Gemini) independently extracted severity categories (mild/moderate/severe) from radiology reports using standardized prompts. Outcomes included Glasgow Coma Scale (GCS) at admission, emergent surgical intervention, and Glasgow Outcome Scale Extended (GOSE) at discharge. Agreement was assessed with Cohen’s κ, correlations with Spearman’s ρ, discriminative performance with AUC and reclassification indices (NRI, IDI), and prognostic value with ordinal logistic regression and likelihood ratio testing. Results : Of 186 patients analyzed (mean age 70.5 years, 52.7% male), 56.5% required ICU admission, 17.2% underwent emergent surgery, and 40.3% had unfavorable outcome (GOSE 1–4). Agreement between CEREBLEED and LLM-derived severity was moderate (κ=0.51–0.52), while inter-LLM agreement was substantial (κ=0.77–0.82), suggesting systematic differences in information content rather than extraction variability. CEREBLEED correlated more strongly with GOSE (ρ=−0.715) than LLM-derived severity (ρ=−0.569 to −0.628). For surgical intervention prediction, CEREBLEED achieved superior discrimination (AUC 0.843 vs 0.733–0.754) with significant reclassification improvement over all LLMs (NRI 0.26–0.35, all p<0.01). In univariable ordinal regression, CEREBLEED showed the best model fit for GOSE prediction (pseudo-R²=0.194 vs 0.123–0.164 for LLMs). In multivariable analysis adjusted for age and GCS, CEREBLEED remained independently prognostic (OR 0.30 per 1-SD, p<0.001). Likelihood ratio testing confirmed significant incremental value of CEREBLEED beyond age, GCS, and hemorrhage volume (χ²=18.97, p<0.001). Conclusions : Automated severity quantification with CEREBLEED showed stronger prognostic performance for clinical outcomes than severity estimates derived from radiology-report interpretation by LLMs. This likely reflects the added value of objective, continuous imaging biomarkers compared with information available in narrative reports. Quantitative imaging tools may therefore complement routine radiological assessment, supporting more consistent severity communication and clinical decision-making in ICH care.

Version published to 10.21203/rs.3.rs-8917238/v1 on Research Square
Feb 23, 2026

Segmenting with Confidence: Uncertainty Quantification for Brain Tumor Imaging

This article has 8 authors:
1. Yassine Guennoun
2. Pierre Nedelec
3. Mark McArthur
4. Evan Bloch
5. Jinchi Wei
6. Leo Sugrue
7. Evan Calabrese
8. Andreas Rauschecker
This article has no evaluationsLatest version Jan 9, 2026
Analysis of Risk Factors and Construction of Nomogram Prediction Model for Hydrocephalus after Intracranial Hemorrhage in Children

This article has 1 author:
1. Jianxun He
This article has no evaluationsLatest version Feb 18, 2026
Detection of hemorrhagic transformation of cerebral infarcts with Ultra-low field MRI

This article has 13 authors:
1. Dimah Hasan
2. Julian Sauer
3. Clara Heller
4. Annika Rieder
5. Konstantin Ueffing
6. Frederic de Beukelaer
7. Jörg Schulz
8. Johannes Schiefer
9. Manuel Dafotakis
10. Arno Reich
11. Martin Wiesmann
12. Florian Holtbernd
13. Charlotte Weyland
This article has no evaluationsLatest version Feb 4, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Segmenting with Confidence: Uncertainty Quantification for Brain Tumor Imaging

Analysis of Risk Factors and Construction of Nomogram Prediction Model for Hydrocephalus after Intracranial Hemorrhage in Children

Detection of hemorrhagic transformation of cerebral infarcts with Ultra-low field MRI