Exploring the limits of LLMs in low-resource information extraction: Case study in brain MRI reports for Epilepsy

Thinh Hung Truong
Emma Foster
Timothy Fazio
Sarah Holper
Karin Verspoor

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Information extraction (IE) from specialized clinical texts such as brain MRI reports is important for various clinical and population health contexts. However, this topic is under-explored due to privacy concerns limiting data availability and the inherent complexity and domain-specificity of clinical language. Common methods relying on substantial amounts of training data fail. The recent advances in large language model (LLM) research provide a promising solution to bridge the data scarcity gap, with improved ability to adapt to novel tasks with little supervision. We introduce a new, challenging dataset of 100 expert-annotated brain MRI reports, featuring 152 fine-grained entity types and 4 relation types, characterised by low inter-annotator agreement. This task reflects the inherent complexity and real-world ambiguity of medical text.

We evaluate a small, open-weight LLM across span detection, named entity recognition, and relation extraction tasks. We compare few-shot prompting and parameter-efficient fine-tuning against specialized off-the-shelf biomedical IE systems. Our results demonstrate that both few-shot and fine-tuned LLM approaches substantially outperform off-the-shelf baselines. While LLMs show superiority, absolute performance, particularly for complex relations and fine-grained entities, remains modest, correlating with the dataset’s inherent difficulty and the extreme low-resource setting.

Version published to 10.1101/2025.08.02.25332570 on medRxiv
Aug 5, 2025

Large Language Models for Psychiatric Phenotype Extraction from Electronic Health Records

This article has 10 authors:
1. Clara Frydman-Gani
2. Alejandro Arias
3. Maria Perez Vallejo
4. John Daniel Londoño Martínez
5. Johanna Valencia-Echeverry
6. Mauricio Castaño
7. Alex A. T. Bui
8. Nelson B. Freimer
9. Carlos Lopez-Jaramillo
10. Loes M. Olde Loohuis
This article has no evaluationsLatest version Aug 12, 2025
BioPars: A Pretrained Biomedical Large Language Model for Persian Biomedical Text Mining

This article has 6 authors:
1. Baqer M. Merzah
2. Tania Taami
3. Salman Asoudeh
4. Amir reza Hossein pour
5. Saeed Mirzaee
6. Amir Ali Bengari
This article has no evaluationsLatest version Jul 21, 2025
Large Language Models in Stroke Management: A Review of the Literature

This article has 6 authors:
1. Shelly Soffer
2. Aya Mudrik
3. Orly Efros
4. Mahmud Omar
5. Girish N Nadkarni
6. Eyal Klang
This article has no evaluationsLatest version Jul 1, 2025

Listed in

Abstract

Article activity feed

Related articles

Large Language Models for Psychiatric Phenotype Extraction from Electronic Health Records

BioPars: A Pretrained Biomedical Large Language Model for Persian Biomedical Text Mining

Large Language Models in Stroke Management: A Review of the Literature