Large Language Models in Stroke Management: A Review of the Literature

Shelly Soffer
Aya Mudrik
Orly Efros
Mahmud Omar
Girish N Nadkarni
Eyal Klang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Stroke care generates vast free-text records that slow chart review and hamper data reuse. Large language models (LLMs) have been trialed as a remedy in tasks ranging from imaging interpretation to outcome prediction. To assess current applications of LLMs in stroke management, we conducted a narrative review by searching PubMed and Google Scholar databases on January 30, 2025, using stroke- and LLM-related terms. This review included fifteen studies demonstrating that LLMs can: (i) extract key variables from thrombectomy reports with up to 94% accuracy, (ii) localize stroke lesions from case-report text with F1 scores of 0.74–0.85, and (iii) forecast functional outcome more accurately than legacy bedside scores in small pilot cohorts.

These results, however, rest on narrow, retrospective datasets-often from single centers or publicly available case reports that the models may have encountered during pre-training. Most evaluations use proprietary systems, limiting reproducibility and obscuring prompt design. None stratify performance by sex, language, or socioeconomic status, and few disclose safeguards against hallucination or data leakage.

We conclude that LLMs are credible research tools for text mining and hypothesis generation in stroke, but evidence for clinical deployment remains preliminary. Rigorous, multisite validation, open benchmarks, bias audits, and human-in-the-loop workflows are prerequisites before LLMs can reliably support time-critical decisions such as thrombolysis or thrombectomy triage.

Version published to 10.1101/2025.06.28.25330477 on medRxiv
Jul 1, 2025

Implementation of Large Language Models in Electronic Health Records

This article has 3 authors:
1. Maxime Griot
2. Jean Vanderdonckt
3. Demet Yuksel
This article has no evaluationsLatest version Jul 4, 2025
The Silent Author in Urology: Quantifying Large Language Model Influence at the Corpus Level

This article has 3 authors:
1. Adel Arezki
2. James Man Git Tsui
3. Wassim Kassouf
This article has no evaluationsLatest version Jul 21, 2025
AI-literacy training enhances physician-LLM diagnostic collaboration in a resource-limited setting: a randomized controlled trial

This article has 6 authors:
1. Ihsan Ayyub Qazi
2. Ayesha Ali
3. Asad Ullah Khawaja
4. Muhammad Junaid Akhtar
5. Ali Zafar Sheikh
6. Muhammad Hamad Alizai
This article has no evaluationsLatest version Jun 6, 2025

Listed in

Abstract

Article activity feed

Related articles

Implementation of Large Language Models in Electronic Health Records

The Silent Author in Urology: Quantifying Large Language Model Influence at the Corpus Level

AI-literacy training enhances physician-LLM diagnostic collaboration in a resource-limited setting: a randomized controlled trial