A spatial EHR and wastewater-informed modeling framework for respiratory virus prediction under sparse and missing data conditions
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Wastewater-based epidemiology has emerged as a powerful complement to clinical surveillance for monitoring infectious disease dynamics. However, most existing approaches either treat wastewater sites in isolation, overlooking spatial dependencies, and often fail to account for variability in data quality, limiting their ability to generate reliable predictions of healthcare demand. Here we present a spatial Bayesian renewal framework that integrates wastewater surveillance with mobility-informed spatial interactions while incorporating reliability-weighted wastewater signals. We apply the framework to three major respiratory pathogens, i.e., SARS-CoV-2, influenza, and respiratory syncytial virus (RSV), using wastewater and hospital data from counties in South Carolina. Across rolling four-week forecasts, the spatial framework consistently outperforms non-spatial approaches and remains robust even in counties lacking direct wastewater or hospitalization observations. Importantly, we show that county-level forecasts can be translated into facility-level predictions, enabling localized assessment of healthcare demand. These forecasts provide actionable early-warning signals to support hospital capacity planning, staffing decisions, and resource allocation. Together, this work establishes a scalable digital surveillance framework that integrates heterogeneous data sources for enabling more reliable infectious disease forecasting and supporting public health decision-making in underserved and data-limited settings.