Maternal health Aggregated Trends can be Misleading: The power of N-of-1 Level Wearable Data Analysis for Personalized Pregnancy Monitoring
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Background
Personal digital health technologies (DHTs) enable real-time monitoring of physiological metrics and behavioral data, including HRV, supporting early detection of pregnancy-related conditions and personalized care throughout the perinatal period. While recent studies demonstrate the utility of personal DHTs in tracking pregnancy-related symptoms, they often rely on aggregate statistical methods that overlook individual variability.
Objective
To compare aggregate and individual-level analyses of digital health technology (DHT) data for early detection of pregnancy-related conditions, using the comprehensive BUMP dataset to highlight the importance of individual variability and data heterogeneity.
Methods
This BUMP study (Jan 2021 – May 2022) analyzed physiological and behavioral metrics, such as heart rate variability (HRV), sleep, and fatigue, in 256 individuals using Oura rings and self-reported surveys. Individual-level (N-of-1) trajectories were evaluated and compared with aggregate results to uncover personal and collective trends. A statistical method was developed to assess the influence of adverse events and severe symptoms, while case studies explored confounding and modifying factors underlying heterogeneity. Comprehensive statistical analysis included the coefficient of determination, Kolmogorov-Smirnov tests, likelihood ratio tests, and Welch’s t-tests, with inter-individual variability flagged based on high-variability thresholds.
Results
Results revealed significant variability in HRV, sleep, and fatigue throughout pregnancy. For instance, only 4.76% of individuals had HRV inflection points at the aggregate week 33 inflection, with a 14.24% coefficient of variation. Our analysis found no significant p-values for demographic or pregnancy complication-based subgrouping, suggesting these factors alone do not drive the observed variability. Case studies further highlighted both intra- and inter-individual differences, emphasizing the importance of considering external factors like adverse events and severe symptoms.
Conclusions
Our findings show that aggregate wearable data often fails to generalize across populations, oversimplifying pregnancy-related physiological and subjective changes. This simplification can obscure individual trajectories, leading to generalized insights that may not reflect many pregnant women’s experiences. Our results highlight the impact of heterogeneity on pregnancy outcomes, emphasizing the need to move beyond one-size-fits-all models and leverage DHT for personalized care.
