Balancing Accuracy and Actionability: An Assessment of Minimal- Input Wastewater Models for COVID-19 Prediction

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Wastewater-based epidemiology (WBE) is a valuable tool for surveillance of infectious diseases like COVID-19, yet balancing prediction accuracy with practical deployment remains challenging, particularly in low- resource, low-prevalence, and early warning contexts. We analyzed a two-and-a-half-year dataset from Athens-Clarke County, Georgia (USA) to compare approaches for predicting COVID-19 cases from SARS-CoV-2 wastewater data. We evaluated the effects of extraction replicates and use of quantitative versus detection frequency data on model performance using random forest and linear models. Results show that two extraction replicates generally suffice for reliable prediction, supporting efficient resource use. Combined viral load and detection frequency produced the strongest models, but detection frequency alone predicted new COVID-19 cases more accurately than viral load alone and was less sensitive to RT-qPCR instrumentation changes, making it a practical alternative when quantification is unreliable/infeasible. Linear models predicted new cases more accurately than random forest models, offering a resource-efficient option for monitoring wastewater trends. Following declines in clinical testing in spring 2022, wastewater- based models estimated substantially higher case counts than were reported, underscoring WBE’s role for ongoing surveillance of COVID-19 and other infectious diseases. These findings provide practical guidance for optimizing WBE implementation, particularly where early warning and resource constraints are significant factors.

HIGHLIGHTS

  • Analysis of SARS-CoV-2 in wastewater revealed that presence/absence rates in wastewater predicted new COVID-19 cases better than viral quantity alone

  • Two extraction replicates was sufficient for acceptable prediction accuracy

  • Linear regression models more accurately predicted new cases than random forest

  • After clinical testing efforts diminished, wastewater models predicted higher case counts than were reported

GRAPHICAL ABSTRACT

Article activity feed