Prior Knowledge Shapes Fine-Tuning Success for Biomedical Term Normalization

Daniel B Hier
Steven Keith Platt
Anh Nguyen

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large language models (LLMs) often fail to correctly link biomedical terms to their standardized ontology identifiers, posing challenges for downstream applications that depend on accurate, machine-readable codes. These linking failures can compromise the integrity of data used in precision medicine, clinical decision support, and population health. Fine-tuning can partially remedy these issues, but the degree of improvement varies across terms and terminologies. Focusing on the Human Phenotype Ontology (HPO), we show that a model’s prior knowledge of term–identifier pairs, acquired during pre-training, strongly predicts whether fine-tuning will enhance its linking accuracy. We evaluate prior knowledge in three complementary ways: (1) Latent probabilistic knowledge, revealed through stochastic prompting, captures hidden associations not evident in deterministic output; (2) Partial subtoken knowledge, reflected in incomplete but non-random generation of identifier components; and (3) Term familiarity, inferred from annotation frequencies in the biomedical literature, which serve as a proxy for training exposure. We then assess how these forms of prior knowledge influence deterministic accuracy in identifier linking. Fine-tuning performance varies most for terms in what we call the reactive middle zone of the ontology—terms with intermediate levels of prior knowledge that are neither absent nor fully consolidated. These terms exhibit the largest gains or losses in accuracy during fine-tuning, suggesting that the success of knowledge injection critically depends on the initial level of model familiarity with the term–identifier pair.

Version published to 10.20944/preprints202508.0574.v1
Aug 7, 2025

Federated Knowledge Retrieval Elevates Large Language Model Performance on Biomedical Benchmarks

This article has 2 authors:
1. Janet Joy
2. Andrew I. Su
This article has no evaluationsLatest version Aug 2, 2025
BioPars: A Pretrained Biomedical Large Language Model for Persian Biomedical Text Mining

This article has 6 authors:
1. Baqer M. Merzah
2. Tania Taami
3. Salman Asoudeh
4. Amir reza Hossein pour
5. Saeed Mirzaee
6. Amir Ali Bengari
This article has no evaluationsLatest version Jul 21, 2025
Adaptive Knowledge Graph Refinement for Oncology Insights Using Distant Learning

This article has 1 author:
1. Kanchan Verandani
This article has no evaluationsLatest version Jul 30, 2025

Listed in

Abstract

Article activity feed

Related articles

Federated Knowledge Retrieval Elevates Large Language Model Performance on Biomedical Benchmarks

BioPars: A Pretrained Biomedical Large Language Model for Persian Biomedical Text Mining

Adaptive Knowledge Graph Refinement for Oncology Insights Using Distant Learning