Machine learning-optimized perinatal depression screening: Maximum impact, minimal burden

Eric Hurwitz
Caroline Shell
Kritika Chugh
Veerle Bergink
Rena C. Patel
Crystal Schiller
Melissa A. Haendel

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Introduction

Perinatal depression affects up to 30% of pregnant and postpartum women, which has increased since the COVID-19 pandemic, making rapidly identifying affected women a high clinical priority. While screening tools like the Edinburgh Postnatal Depression Scale (EPDS) are widely used, brevity is important for busy clinical practice to reduce administration time and patient burden. Current methods to shorten assessments rely on traditional psychometric approaches, rather than machine learning (ML) methods that could optimize predictive accuracy.

Methods

We developed a ML framework using National Clinical Cohort Collaborative (N3C) data to predict full 10-item EPDS scores from shortened question subsets (n=22,924). We evaluated all 2-5 item combinations using linear regression, validating performance across multiple cohorts including postpartum women (n=7,750) and an external non-N3C pregnancy population (n=1,217). For additional validation, we applied our approach to the PHQ-9 (n=398,606) to test generalizability. Binary classification models using clinical thresholds (≥13) determined EPDS screening accuracy. Decision curve analysis was performed to assess the clinical utility of our ML method.

Results

The optimal 2-question EPDS combinations Q4+Q8 (anxiety/sadness) and Q5+Q8 (scared/sadness) both achieved R ² =0.70. Binary classification demonstrated strong performance (sensitivity=0.68-0.72, specificity=0.98-0.99). The framework generalized across postpartum subsets, external pregnancy cohorts, and PHQ-9 validation (R ² =0.64-0.73). Adding covariates did not improve performance. Decision curve analysis showed our ML approach had superior clinical benefit (0.01-0.03) versus traditional additive scoring.

Conclusion/Implications

Our ML framework suggests a reduced assessment burden with two EPDS questions maintains predictive accuracy as the full-item EPDS. With ∼3.6 million annual U.S. births, this approach could identify additional positive perinatal depression screens, enhancing screening implementation across clinical settings.

Version published to 10.1101/2025.10.13.25337771 on medRxiv
Oct 17, 2025

Comparative Machine Learning Models for Early Prediction of Preterm Birth from Maternal Serum Biomarkers

This article has 7 authors:
1. Kaleem Maqsood
2. Javeria Malik
3. Mahnoor Fatima
4. Sundas Akram
5. Husna Ahmad
6. Nabila Roohi
7. Shahid Bashir
This article has no evaluationsLatest version Dec 16, 2025
Predicting Low Birth Weight in India Using Machine Learning Techniques: Insights from NFHS-5

This article has 2 authors:
1. Vikas Kamble
2. Basil Edolikkandy
This article has no evaluationsLatest version Feb 3, 2026
Considerations for evaluating the practical utility of machine learning in suicide risk estimation: the role of cost and equity

This article has 5 authors:
1. Christopher Kitchen
2. Anas Belouali
3. Paul S Nestadt
4. Holly C Wilcox
5. Hadi Kharrazi
This article has no evaluationsLatest version Dec 30, 2025

Discuss this preprint

Listed in

Abstract

Introduction

Methods

Results

Conclusion/Implications

Article activity feed

Related articles

Comparative Machine Learning Models for Early Prediction of Preterm Birth from Maternal Serum Biomarkers

Predicting Low Birth Weight in India Using Machine Learning Techniques: Insights from NFHS-5

Considerations for evaluating the practical utility of machine learning in suicide risk estimation: the role of cost and equity