Enhancing predictive modeling for respiratory support with LLM-driven guideline adherence

Xiaolei Lu
Michael Miller
Alex K. Pearce
Preeti Gupta
Thaidan T. Pham
James Ford
Atul Malhotra
Shamim Nemati

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

Optimal respiratory support selection between high-flow nasal cannula (HFNC) and noninvasive ventilation (NIV) for intensive care units (ICU) patients at risk of invasive mechanical ventilation (IMV) remains unclear, particularly in cases not represented in prior clinical trials. We previously developed RepFlow-CFR, a deep counterfactual model estimating individualized treatment effects (ITE) of HFNC versus NIV. However, interpretability and guideline alignment remain challenges for clinical adoption. This study describes the development and integration of a clinical guideline-driven LLM to enhance deep counterfactual model recommendations for NIV versus HFNC in patients at high-risk for invasive mechanical ventilation.

Methods

We enhanced RepFlow-CFR by incorporating a large language model (LLM, Claude 3.5 Sonnet) to enforce clinical guideline adherence and generate explainable treatment recommendations. The LLM was configured in a HIPAA-compliant AWS environment and prompted using structured patient data, clinical notes, and formal guideline criteria. Recommendations from RepFlow-CFR and LLM were compared to actual treatment decisions to assess concordance. We evaluated IMV and mortality/hospice rates across concordant and discordant groups. Additionally, we conducted a structured chart review of 20 cases to assess the clinical validity and safety of LLM-driven recommendations.

Results

Among 1,261 ICU encounters, treatments concordant with LLM-enhanced recommendations were associated with lower IMV rates. For the HFNC recommendation, IMV occurred in 46/188 (24.47%) when care was concordant versus 9/17(52.94%) when discordant, corresponding to a 97.33% relative risk increase when discordant. Concordance was also associated with reduced mortality or hospice discharge (odds ratio 0.670, p = 0.046). In a 20-case chart review, 19/20 (95%) LLM recommendations aligned with clinical guidelines and physicians agreed with 13/20 (65%) final recommendations. Errors were noted in 11/20 cases, most rated low or moderate risk; 2/20 were judged as potentially causing severe harm.

Conclusions

Integrating LLMs for guideline enforcement improves the interpretability and clinical alignment of counterfactual models in respiratory support decision-making. This hybrid framework not only enhances concordance with real-world practice but may also improve patient outcomes. Future work will refine contraindication detection and expand validation to prospective clinical trials.

Version published to 10.1186/s13054-025-05739-3
Nov 14, 2025
Version published to 10.21203/rs.3.rs-7230335/v1 on Research Square
Aug 12, 2025

Esophageal Pressure Monitoring During Mechanical Ventilation: Principles and Practice – A Comprehensive Review

This article has 5 authors:
1. Ramakanth Pata
2. Joanna Kristeva
3. Bhanu Kosuru
4. Deepthi Devagudi
5. Oday Alhafidh
This article has no evaluationsLatest version Dec 15, 2025
Development and External Validation of a Nurse-Friendly Machine Learning Model for Early Identification of Intradialytic Hypotension in ICU Patients Receiving Renal Replacement Therapy

This article has 8 authors:
1. Zhenyuan Yu
2. Huan Tang
3. Wenjia Ye
4. Zixin Gu
5. Yu Fu
6. Rong Yao
7. Ying Guan
8. Yonghong Shen
This article has no evaluationsLatest version Jan 23, 2026
HRRT: A Holistic Renal Replacement Therapy Decision-Making Support System Using Hierarchical Reinforcement Learning

This article has 11 authors:
1. Qianyi Xu
2. Feng Wu
3. Zi Yi Christopher Thong
4. Mark Sen Liang Goh
5. Pengpeng Chen
6. Jie Yang
7. Chen Huang
8. Zhongheng Zhang
9. Yucai Hong
10. Kay Choong See
11. Mengling Feng
This article has no evaluationsLatest version Dec 11, 2025

Discuss this preprint

Listed in

Abstract

Background

Methods

Results

Conclusions

Article activity feed

Related articles

Esophageal Pressure Monitoring During Mechanical Ventilation: Principles and Practice – A Comprehensive Review

Development and External Validation of a Nurse-Friendly Machine Learning Model for Early Identification of Intradialytic Hypotension in ICU Patients Receiving Renal Replacement Therapy

HRRT: A Holistic Renal Replacement Therapy Decision-Making Support System Using Hierarchical Reinforcement Learning