Machine Learning for Dynamic and Short-term Prediction of Preeclampsia Using Routine Clinical and Laboratory Data

Haoyang Li
Yaxin Li
Chengxi Zang
Weishen Pan
He S. Yang
Tracy B. Grossman
Zhen Zhao
Fei Wang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Preeclampsia (PE) is a leading cause of maternal and perinatal morbidity and mortality, yet its unpredictable onset and rapid progression hinder timely management. Existing prediction tools often rely on specialized biomarkers, static assessments, or limited study cohorts, impeding clinical utility and generalizability. We conducted a retrospective, multi-site cohort study including 58,839 pregnancies delivered at three NewYork-Presbyterian hospitals. Using routine information captured within the electronic health record (EHR), including blood pressure with other maternal characteristics, and routine laboratory tests, we developed extreme gradient boosting (XGBoost) based models to predict PE onset within 1-, 2-, and 4-week horizons across different gestational ages. Performance was assessed using nested cross-validation at the training site and externally validated through direct transfer, fine-tuning, and retraining strategies. Prediction accuracy increased from 28 to 34 weeks of gestational age, peaked at 34 weeks (AUC 0.863 at training; 0.808–0.834 at validation), declined at 38 weeks, and rebounded near delivery (AUC up to 0.890). Blood pressure was the most consistent predictor, while laboratory features such as albumin, alkaline phosphatase, and hematologic indices added value earlier, and demographic and obstetric factors gaining importance later. Dynamic short-term prediction of PE in late gestation is feasible using routine data. This pragmatic, scalable approach provides opportunities for early intervention and is adaptable across diverse healthcare settings.

Version published to 10.1101/2025.09.29.25336926 on medRxiv
Sep 30, 2025

A Machine Learning Model Based on First-Trimester Lipidomic Signatures for Predicting Metabolic Pregnancy Complications

This article has 7 authors:
1. Alisa Tokareva
2. Natalia A. Frankevich
3. Vitaliy Chagovets
4. Anna Derenko
5. Vadim Lagutin
6. Vladimir Frankevich
7. Gennady Sukhikh
This article has no evaluationsLatest version Oct 27, 2025
MACHINE LEARNING PREDICTIVE MODELS FOR POSTPARTUM HEMORRHAGE: A SYSTEMATIC REVIEW AND META-ANALYSIS

This article has 13 authors:
1. Carlos R. Mustre-Juarez
2. Andrea Godinez-Medina
3. Diana L. Brandt-Perez
4. Mariana M. Carachure-Rendon
5. Clara E. Gutierrez-Simpson
6. Briana M. Rodriguez-Paniagua
7. Olivia Vazquez-Hernandez
8. Paul Bain
9. Sandra Acevedo
10. Jose A. Ramirez-Calvo
11. Mario Rodriguez-Bosch
12. Maria Jose Rodriguez-Sibaja
13. Mario I. Lumbreras-Marquez
This article has no evaluationsLatest version Oct 13, 2025
Machine learning-optimized perinatal depression screening: Maximum impact, minimal burden

This article has 7 authors:
1. Eric Hurwitz
2. Caroline Shell
3. Kritika Chugh
4. Veerle Bergink
5. Rena C. Patel
6. Crystal Schiller
7. Melissa A. Haendel
This article has no evaluationsLatest version Oct 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Machine Learning Model Based on First-Trimester Lipidomic Signatures for Predicting Metabolic Pregnancy Complications

MACHINE LEARNING PREDICTIVE MODELS FOR POSTPARTUM HEMORRHAGE: A SYSTEMATIC REVIEW AND META-ANALYSIS

Machine learning-optimized perinatal depression screening: Maximum impact, minimal burden