Leveraging Large Language Models to Develop an Interpretable Prediction Model for Postpartum Hemorrhage Prior to the Onset of Labor

Elizabeth G Woo
Israel Zighelboim
Tyler Gifford
Joseph G Bell
Hannah Milthorpe
Emily Alsentzer
Ryan E Longman
Jorge E Tolosa
Brett K Beaulieu-Jones

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Objective

To evaluate whether large language models (LLMs) applied to prenatal clinical notes can predict postpartum hemorrhage (PPH) prior to the onset of labor and to compare model performance across outcome definitions, including a novel intervention-based definition.

Methods

We conducted a retrospective cohort study of 19,992 deliveries within a large regional health network. Two outcome definitions for PPH were used: estimated or quantitative blood loss (EBL/QBL) extracted from clinical notes, and a clinical intervention-based definition (cPPH) incorporating transfusion, uterotonics, Bakri balloon, or hysterectomy. We evaluated three approaches for PPH prediction: (1) supervised machine learning using structured electronic medical record data; (2) direct prediction using a fine-tuned LLM applied to clinical notes; and (3) interpretable models using LLM-extracted features combined with structured data. Model performance was evaluated using area under the receiver operating characteristic curve (AUROC) on a temporally held-out test set.

Results

The LLM-based direct prediction model achieved the highest performance for both PPH definitions (AUROC 0.79–0.80), followed by interpretable models combining LLM-extracted features with structured data (AUROC 0.76–0.78). Models using only structured data performed worse (AUROC 0.65–0.71). The LLM-extracted features approach identified 47 significant predictors, including established risk factors such as multiple gestation and previous cesarean delivery. Demographic differences were observed between PPH definitions: mothers who met only the cPPH definition had lower gestational age and higher rates of cesarean delivery compared to those meeting only the EBL/QBL definition.

Conclusion

These findings highlight the potential of LLM-based approaches for enhancing PPH risk stratification, with the feature extraction method offering a promising balance between predictive performance and clinical utility. Integrating these methods into clinical workflows could improve early detection and guide targeted preventive interventions.

Version published to 10.1101/2025.03.23.25324452v1 on medRxiv
Mar 25, 2025

Construction of a prediction model for adverse perinatal outcomes in fetal growth restriction based on a machine learning algorithm

This article has 6 authors:
1. Xiangli Meng
2. Lei Wang
3. Minghui Wu
4. Na Zhang
5. Xiaofei Li
6. Qingqing Wu
This article has no evaluationsLatest version Apr 16, 2025
Interpretable machine learning model for predicting low birth weight in singleton pregnancies: a retrospective cohort study

This article has 9 authors:
1. Xiaojuan Wu
2. Qingxiang Zhao
3. Yong Gao
4. Yiyu Zhang
5. Linrui Xu
6. Xianzhu Cong
7. Na Sun
8. Fuyan Shi
9. Suzhen Wang
This article has no evaluationsLatest version May 22, 2025
Evaluation of Machine Learning Models for Early Prediction of Gestational Diabetes Using Retrospective Electronic Health Records from Current and Previous Pregnancies

This article has 4 authors:
1. Mark Germaine
2. Amy C O’Higgins
3. Brendan Egan
4. Graham Healy
This article has no evaluationsLatest version May 13, 2025

Listed in

Abstract

Objective

Methods

Results

Conclusion

Article activity feed

Related articles

Construction of a prediction model for adverse perinatal outcomes in fetal growth restriction based on a machine learning algorithm

Interpretable machine learning model for predicting low birth weight in singleton pregnancies: a retrospective cohort study

Evaluation of Machine Learning Models for Early Prediction of Gestational Diabetes Using Retrospective Electronic Health Records from Current and Previous Pregnancies