Implementing a Resource-Light and Low-Code Large Language Model System for Information Extraction from Mammography Reports: A Case Study

Fabio Dennstädt
Simon Fauser
Nikola Cihoric
Max Schmerder
Paolo Lombardo
Grazia Maria Cereghetti
Sandro von Däniken
Thomas Minder
Jaro Meyer
Lawrence Chiang
Roberto Gaio
Luc Lerch
Irina Filchenko
Daniel Reichenpfader
Kerstin Denecke
Caslav Vojvodic
Igor Tatalovic
André Sander
Janna Hastings
Daniel M Aebersold
Hendrik von Tengg-Kobligk
Knud Nairz

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

Large Language Models (LLMs) have been successfully used to extract structured data from free-text radiology reports. Most of current studies were conducted with private models accessed via Application Programming Interface (API). We aimed to evaluate the feasibility of using open-source LLMs, deployed on limited local hardware resources for extraction of structured information from free-text mammography reports, according to a Common Data Elements (CDE)-based framework.

Methods

Seventy-nine CDEs were defined by an interdisciplinary expert panel, reflecting real-world reporting practice. Sixty-one reports were classified by two independent researchers with 1533 classifications assigned to establish ground truth. Five different open-source LLMs deployable on a single GPU were used for data extraction using the general-classifier Python package. Extractions were performed for two different prompt approaches with classification metrics calculated overall and on subgroups. Additional analyses were conducted using thresholds for the relative probability of classifications.

Results

High inter-rater agreement was observed between manual classifiers (Cohen’s Kappa 0.83). Using default prompts, the LLMs achieved accuracies of 59.23–72.86%. Adapting prompts to better explain classification tasks improved performance for all models, with accuracies of 64.71–85.32%. Setting certainty thresholds further improved accuracies to >90% but reduced the coverage rate to <50%.

Conclusion

Locally deployed open-source LLMs can effectively extract information from mammography reports with good accuracy, addressing data privacy concerns while maintaining compatibility with limited computational resources. Prompt engineering substantially increases performance, highlighting the importance of optimization in clinical applications. Using a CDE-based framework provides clear semantics and structure, facilitating interoperability and consistent data extraction.

Version published to 10.1101/2025.04.08.25325371v1 on medRxiv
Apr 11, 2025

A synthetic data generation framework for scalable and resource-efficient medical AI assistants

This article has 10 authors:
1. Abdurrahim Yilmaz
2. Furkan Yuceyalcin
3. Rahmetullah Varol
4. Ece Gokyayla
5. Ozan Erdem
6. Donghee Choi
7. Ali Anil Demircali
8. Gulsum Gencoglan
9. Joram M. Posma
10. Burak Temelkuran
This article has no evaluationsLatest version May 18, 2025
Privacy-Preserving Information Extraction Framework for Diverse Imaging Reports using Large Language Models

This article has 9 authors:
1. Dabin Min
2. Soyeon Kim
3. Sangheum Hwang
4. Kwang Nam Jin
5. SangHeum Bang
6. Won Gi Jeong
7. Jinwook Choi
8. Jung Min Chang
9. Chang Min Park
This article has no evaluationsLatest version May 6, 2025
A Standard Framework for Converting Coronary Angiography Reports into Machine-Readable Format Using Large Language Models

This article has 5 authors:
1. Ji Woo Song
2. Ji Yong Jang
3. Hyeongsoo Kim
4. Young-Guk Ko
5. Seng Chan You
This article has no evaluationsLatest version May 6, 2025

Listed in

Abstract

Background

Methods

Results

Conclusion

Article activity feed

Related articles

A synthetic data generation framework for scalable and resource-efficient medical AI assistants

Privacy-Preserving Information Extraction Framework for Diverse Imaging Reports using Large Language Models

A Standard Framework for Converting Coronary Angiography Reports into Machine-Readable Format Using Large Language Models