Medical Abbreviation Disambiguation with Large Language Models: Zero- and Few-Shot Evaluation on the MeDAL Dataset

Nima Shafiei Rezvani Nezhad
Meysam Mansouri
Rabih Abdulkarim Zakaria
Ruhollah Abolhasani

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Abbreviation disambiguation is a critical challenge in processing clinical and biomedical texts, where ambiguous short forms frequently obscure meaning. In this study, we assess the zero-shot performance of large language models (LLMs) on the task of medical abbreviation disambiguation using the MeDAL dataset, a large-scale resource constructed from PubMed abstracts. Specifically, we evaluate GPT-4 and LLaMA models, prompting them with contextual information to infer the correct long-form expansion of ambiguous abbreviations without any task-specific fine-tuning. Our results demonstrate that GPT-4 substantially outperforms LLaMA across a range of ambiguous terms, indicating a significant advantage of proprietary models in zero-shot medical language understanding. These findings suggest that LLMs, even without domain-specific training, can serve as effective tools for improving readability and interpretability in biomedical NLP applications.

Version published to 10.1101/2025.09.12.675926 on bioRxiv
Sep 17, 2025

CLEVER: Clinical Large Language Model Evaluationby Expert Review

This article has 4 authors:
1. Veysel Kocaman
2. Mustafa Kaya
3. Andrei Ferrer
4. David Talby
This article has no evaluationsLatest version Jul 23, 2025
Accelerating Insight Discovery in Large Biomedical Text with Scalable Processing Framework

This article has 3 authors:
1. Dongeun Kim
2. Megan Hauptman
3. Matthew T. Patrick
This article has no evaluationsLatest version Aug 19, 2025
Retrieval augmented generation based dynamic prompting for few-shot biomedical named entity recognition using large language models

This article has 4 authors:
1. Yao Ge
2. Sudeshna Das
3. Yuting Guo
4. Abeed Sarker
This article has no evaluationsLatest version Aug 25, 2025

Listed in

Abstract

Article activity feed

Related articles

CLEVER: Clinical Large Language Model Evaluationby Expert Review

Accelerating Insight Discovery in Large Biomedical Text with Scalable Processing Framework

Retrieval augmented generation based dynamic prompting for few-shot biomedical named entity recognition using large language models