Large scale analysis and prediction of adverse maternal health events across heterogeneous populations from the All of Us dataset

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Objective: To investigate and predict adverse maternal health events across heterogeneous populations using the NIH's "All of Us Research Program" dataset. Methods: In this work, we develop pipelines for data extraction, cleaning, and pre-processing of electronic health records (EHR) (Conditions, Labs, Measurements) and survey responses from a multi-site pregnancy cohort (n=22,646 participants; 33,294 pregnancy episodes). We assess statistical correlations between lab measures, outcomes, and social determinants of health (SDoH). We then develop machine learning models (Logistic Regression, XGBoost, LSTM) to predict preeclampsia/eclampsia, preterm labor, depression/anxiety, gestational diabetes, miscarriage, and cardiomyopathy two weeks before clinical diagnosis. We conduct model interpretability post-hoc analysis to understand failure points and evaluate implications for socio-economic disparities. Finally, we conduct a clinician review of the top statistically-correlated, and machine learning-based features to determine whether these features have expected associations with outcomes and their use in current clinical practice. Results: Statistical analysis revealed expected and unexpected correlations between EHR features and outcomes, as well as associations between SDoH and adverse events. Machine learning models achieved the best performance for predicting gestational diabetes (AUROC=0.848) and miscarriage (AUROC=0.864) two weeks before diagnosis, with performance being sensitive to temporal resolution. We found that feature importance was correlated with feature availability and statistical significance. Fairness evaluations revealed disparities in model performance across SDoH subgroups, but minimal disparities across age or race subgroups. Clinician assessment identified features that were unexpected and/or not currently used in clinical practice Conclusion: This study demonstrates the potential of the "All of Us" dataset for understanding and predicting adverse maternal health outcomes. We demonstrate that meaningful populational level pattern can be extracted, and high-performing machine learning models can be trained on this diverse, multi-site dataset, enabled by rigorous data preprocessing and problem formulation methods. Most important features identified through correlation analysis align with known clinical risk factors, reinforcing the robustness of the dataset and our preprocessing pipeline. Notably, we also discovered some unexpected associations that warrant further clinical investigation. While machine learning models successfully leverage many of these relevant features, they also rely on some with limited or no apparent correlation to outcomes. These findings, if validated, could inform new strategies for maternal care. However, performance disparities across social determinants of health (SDoH) subgroups underscore the need for continued research and careful ethical consideration prior to clinical deployment.

Article activity feed