A demographic scaling model for estimating the total number of COVID-19 infections

This article has been Reviewed by the following groups

Read the full article See related articles

Abstract

Background

Understanding how widely COVID-19 has spread is critical information for monitoring the pandemic. The actual number of infections potentially exceeds the number of confirmed cases.

Development

We develop a demographic scaling model to estimate COVID-19 infections, based on minimal data requirements: COVID-19-related deaths, infection fatality rates (IFRs), and life tables. As many countries lack IFR estimates, we scale them from a reference country based on remaining lifetime to better match the context in a target population with respect to age structure, health conditions and medical services. We introduce formulas to account for bias in input data and provide a heuristic to assess whether local seroprevalence estimates are representative for the total population.

Application

Across 10 countries with most reported COVID-19 deaths as of 23 July 2020, the number of infections is estimated to be three [95% prediction interval: 2–8] times the number of confirmed cases. Cross-country variation is high. The estimated number of infections is 5.3 million for the USA, 1.8 million for the UK, 1.4 million for France, and 0.4 million for Peru, or more than one, six, seven and more than one times the number of confirmed cases, respectively. Our central prevalence estimates for entire countries are markedly lower than most others based on local seroprevalence studies.

Conclusions

The national infection estimates indicate that the pandemic is far more widespread than the numbers of confirmed cases suggest. Some local seroprevalence estimates largely deviate from their corresponding national mean and are unlikely to be representative for the total population.

Article activity feed

  1. SciScore for 10.1101/2020.04.23.20077719: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    Institutional Review Board Statementnot detected.
    Randomizationnot detected.
    Blindingnot detected.
    Power Analysisnot detected.
    Sex as a biological variablenot detected.

    Table 2: Resources

    No key resources detected.


    Results from OddPub: Thank you for sharing your code and data.


    Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.