Machine learning-optimized perinatal depression screening: Maximum impact, minimal burden

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Introduction

Perinatal depression affects up to 30% of pregnant and postpartum women, which has increased since the COVID-19 pandemic, making rapidly identifying affected women a high clinical priority. While screening tools like the Edinburgh Postnatal Depression Scale (EPDS) are widely used, brevity is important for busy clinical practice to reduce administration time and patient burden. Current methods to shorten assessments rely on traditional psychometric approaches, rather than machine learning (ML) methods that could optimize predictive accuracy.

Methods

We developed an ML framework using National Clinical Cohort Collaborative (N3C) data to predict full 10-item EPDS scores from shortened question subsets (n=22,924). We evaluated all 2-5 item combinations using linear regression, validating performance across multiple cohorts including postpartum women (n=7,750) and external pregnancy populations (n=1,217). For additional validation, we applied our approach to the PHQ-9 (n=398,606) to test generalizability. Binary classification models using clinical thresholds (≥13) determined EPDS screening accuracy. Decision curve analysis was performed to assess the clinical utility of our ML method.

Results

The optimal 2-question EPDS combinations Q4+Q8 (anxiety/sadness) and Q5+Q8 (scared/sadness) both achieved R 2 =0.70. Binary classification demonstrated strong performance (sensitivity=0.68-0.72, specificity=0.98-0.99). The framework generalized across postpartum subsets, external pregnancy cohorts, and PHQ-9 validation (R 2 =0.64-0.73). Adding covariates did not improve performance. Decision curve analysis showed our ML approach had superior clinical benefit (0.01-0.03) versus traditional additive scoring.

Conclusion/Implications

Our ML framework suggests a reduced assessment burden with two EPDS questions maintains predictive accuracy as the full-item EPDS. With ∼3.6 million annual U.S. births, this approach could identify additional positive perinatal depression screens, enhancing screening implementation across clinical settings.

Article activity feed