Procedural Guideline Execution Training Improves LLM Performance in Rule-Based Clinical Tasks

Pimchanok Boonmee

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large Language Models (LLMs) offer transformative potential for Clinical Decision Support (CDS) by processing complex medical information and generating actionable insights. However, ensuring their reliability, strict adherence to Clinical Practice Guidelines (CPGs), and interpretability remains a critical challenge for safe clinical deployment. Existing methods, such as prompting strategies and integrating external CPG structures, provide guidance but do not intrinsically train the LLM to execute procedural guideline logic. To address this gap, we propose Procedural Guideline Execution Training (PGET), a novel fine-tuning approach that trains LLMs to generate step-by-step execution traces explicitly demonstrating the application of CPG rules to a patient scenario. We evaluate PGET using diverse LLMs on a synthetic dataset for COVID-19 outpatient treatment, comparing it against Zero-Shot Prompting and established CPG integration methods like Binary Decision Tree integration. Our experiments, leveraging both automatic CPG adherence metrics and expert human evaluation, demonstrate that PGET significantly outperforms comparison methods in achieving higher CPG adherence, clinical accuracy, and interpretability. The generated execution trace provides valuable transparency, fostering trust in the model's recommendations. PGET offers a promising path towards building more reliable, transparent, and guideline-compliant AI systems for clinical decision support.

Version published to 10.31224/4544
Apr 22, 2025

RAGnosis: Retrieval-Augmented Generation for Enhanced Medical Decision Making

This article has 5 authors:
1. Amir Rouhollahi
2. Ali Homaei
3. Aanchal Sahu
4. Rayan Ebnali Harari
5. Farhad R. Nezami
This article has no evaluationsLatest version Jun 12, 2025
Instruction Tuning and CoT Prompting for Contextual Medical QA with LLMs

This article has 6 authors:
1. Chenqian Le
2. Ziheng Gong
3. Chihang Wang
4. Haowei Ni
5. Panfeng Li
6. Xupeng Chen
This article has no evaluationsLatest version Jun 4, 2025
AI-literacy training enhances physician-LLM diagnostic collaboration in a resource-limited setting: a randomized controlled trial

This article has 6 authors:
1. Ihsan Ayyub Qazi
2. Ayesha Ali
3. Asad Ullah Khawaja
4. Muhammad Junaid Akhtar
5. Ali Zafar Sheikh
6. Muhammad Hamad Alizai
This article has no evaluationsLatest version Jun 6, 2025

Listed in

Abstract

Article activity feed

Related articles

RAGnosis: Retrieval-Augmented Generation for Enhanced Medical Decision Making

Instruction Tuning and CoT Prompting for Contextual Medical QA with LLMs

AI-literacy training enhances physician-LLM diagnostic collaboration in a resource-limited setting: a randomized controlled trial