Maternal health Aggregated Trends can be Misleading: The power of N-of-1 Level Wearable Data Analysis for Personalized Pregnancy Monitoring

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Background

Personal digital health technologies (DHTs) enable real-time monitoring of physiological metrics and behavioral data, including HRV, supporting early detection of pregnancy-related conditions and personalized care throughout the perinatal period. While recent studies demonstrate the utility of personal DHTs in tracking pregnancy-related symptoms, they often rely on aggregate statistical methods that overlook individual variability.

Objective

To compare aggregate and individual-level analyses of digital health technology (DHT) data for early detection of pregnancy-related conditions, using the comprehensive BUMP dataset to highlight the importance of individual variability and data heterogeneity.

Methods

This BUMP study (Jan 2021 – May 2022) analyzed physiological and behavioral metrics, such as heart rate variability (HRV), sleep, and fatigue, in 256 individuals using Oura rings and self-reported surveys. Individual-level (N-of-1) trajectories were evaluated and compared with aggregate results to uncover personal and collective trends. A statistical method was developed to assess the influence of adverse events and severe symptoms, while case studies explored confounding and modifying factors underlying heterogeneity. Comprehensive statistical analysis included the coefficient of determination, Kolmogorov-Smirnov tests, likelihood ratio tests, and Welch’s t-tests, with inter-individual variability flagged based on high-variability thresholds.

Results

Results revealed significant variability in HRV, sleep, and fatigue throughout pregnancy. For instance, only 4.76% of individuals had HRV inflection points at the aggregate week 33 inflection, with a 14.24% coefficient of variation. Our analysis found no significant p-values for demographic or pregnancy complication-based subgrouping, suggesting these factors alone do not drive the observed variability. Case studies further highlighted both intra- and inter-individual differences, emphasizing the importance of considering external factors like adverse events and severe symptoms.

Conclusions

Our findings show that aggregate wearable data often fails to generalize across populations, oversimplifying pregnancy-related physiological and subjective changes. This simplification can obscure individual trajectories, leading to generalized insights that may not reflect many pregnant women’s experiences. Our results highlight the impact of heterogeneity on pregnancy outcomes, emphasizing the need to move beyond one-size-fits-all models and leverage DHT for personalized care.

Article activity feed