Understanding the Inner Workings of Large Language Models in Medicine

Georg Fuellen
Hans Jarchow
Johann-Christian Põder

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background: Large language models (LLMs) are increasingly influencing medical practice, education, and research. Their responsible integration into healthcare requires expertise in medical, ethical, practical, and theoretical domains. Objectives: This article examines how theoretical knowledge of LLMs and their internal mechanisms enhances the interpretation of model outputs in medical contexts. Methods: We prompted GPT-o1 to generate examples illustrating how understanding transformer architecture can facilitate output interpretation. Key topics were extracted from its responses, and illustrative cases were validated using Consensus.app, an AI-based web-search tool. Results: Five key topics were identified: (1) anticipating contextual focus in medical reasoning, (2) explaining “generic” or “textbook” responses, (3) understanding strengths and weaknesses in differential diagnosis, (4) explaining ambiguous or contradictory responses, and (5) identifying hallucinations in unfamiliar scenarios. Case examples highlight both benefits and limitations, including accurate attention to salient clinical details, reliance on generalized patterns, risks of base rate neglect in differential diagnosis, challenges of ambiguous prompts, and hallucinations in rare or underrepresented cases. Conclusions: A theoretical understanding of LLMs is crucial for responsible clinical integration. Distinguishing between well-represented (short head) and underrepresented (long tail) knowledge, recognizing generic responses, and identifying hallucinations are essential competencies. Coupled with medical and ethical expertise, these skills will enable healthcare professionals to leverage LLMs effectively while mitigating risks.

Version published to 10.20944/preprints202510.0630.v1
Oct 8, 2025

Ghartey’s WWWH DT Integrated Disease Mapping Framework: An Adaptable Reverse Diagnostic Reasoning Model for Clinical and Molecular Pathology Learning and Teaching

This article has 1 author:
1. Frank Naku Ghartey
This article has no evaluationsLatest version Oct 8, 2025
FEDERATED LARGE LANGUAGE MODELS IN HEALTHCARE

This article has 6 authors:
1. Leon Nascimento
2. Sadi Alwadi
3. Feras Awaysheh
4. Abbas Cheddad
5. Albert Zomaya
6. Mohsen Guizani
This article has no evaluationsLatest version Oct 14, 2025
Beyond Accuracy in Small Open-Source Medical Large Language Models for Pediatric Endocrinology

This article has 5 authors:
1. Vanessa D’Amario
2. Randy Daniel
3. Dhruv Edamadaka
4. Nitya Alaparthy
5. Joshua Tarkoff
This article has no evaluationsLatest version Aug 27, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Ghartey’s WWWH DT Integrated Disease Mapping Framework: An Adaptable Reverse Diagnostic Reasoning Model for Clinical and Molecular Pathology Learning and Teaching

FEDERATED LARGE LANGUAGE MODELS IN HEALTHCARE

Beyond Accuracy in Small Open-Source Medical Large Language Models for Pediatric Endocrinology

Ghartey’s WWWH DT Integrated Disease Mapping Framework: An Adaptable Reverse Diagnostic Reasoning Model for Clinical and Molecular Pathology Learning and Teaching