Machine learning-optimized perinatal depression screening: Maximum impact, minimal burden
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Introduction
Perinatal depression affects up to 30% of pregnant and postpartum women, which has increased since the COVID-19 pandemic, making rapidly identifying affected women a high clinical priority. While screening tools like the Edinburgh Postnatal Depression Scale (EPDS) are widely used, brevity is important for busy clinical practice to reduce administration time and patient burden. Current methods to shorten assessments rely on traditional psychometric approaches, rather than machine learning (ML) methods that could optimize predictive accuracy.
Methods
We developed an ML framework using National Clinical Cohort Collaborative (N3C) data to predict full 10-item EPDS scores from shortened question subsets (n=22,924). We evaluated all 2-5 item combinations using linear regression, validating performance across multiple cohorts including postpartum women (n=7,750) and external pregnancy populations (n=1,217). For additional validation, we applied our approach to the PHQ-9 (n=398,606) to test generalizability. Binary classification models using clinical thresholds (≥13) determined EPDS screening accuracy. Decision curve analysis was performed to assess the clinical utility of our ML method.
Results
The optimal 2-question EPDS combinations Q4+Q8 (anxiety/sadness) and Q5+Q8 (scared/sadness) both achieved R 2 =0.70. Binary classification demonstrated strong performance (sensitivity=0.68-0.72, specificity=0.98-0.99). The framework generalized across postpartum subsets, external pregnancy cohorts, and PHQ-9 validation (R 2 =0.64-0.73). Adding covariates did not improve performance. Decision curve analysis showed our ML approach had superior clinical benefit (0.01-0.03) versus traditional additive scoring.
Conclusion/Implications
Our ML framework suggests a reduced assessment burden with two EPDS questions maintains predictive accuracy as the full-item EPDS. With ∼3.6 million annual U.S. births, this approach could identify additional positive perinatal depression screens, enhancing screening implementation across clinical settings.