Estimation of the fraction of COVID-19 infected people in U.S. states and countries worldwide

Jungsik Noh
Gaudenz Danuser

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (ScreenIT)

Abstract

Since the beginning of the coronavirus disease 2019 (COVID-19) pandemic, daily counts of confirmed cases and deaths have been publicly reported in real-time to control the virus spread. However, substantial undocumented infections have obscured the true size of the currently infected population, which is arguably the most critical number for public health policy decisions. We developed a machine learning framework to estimate time courses of actual new COVID-19 cases and current infections in all 50 U.S. states and the 50 most infected countries from reported test results and deaths. Using published epidemiological parameters, our algorithm optimized slowly varying daily ascertainment rates and a time course of currently infected cases each day. Severe under-ascertainment of COVID-19 cases was found to be universal across U.S. states and countries worldwide. In 25 out of the 50 countries, actual cumulative cases were estimated to be 5–20 times greater than the confirmed cases. Our estimates of cumulative incidence were in line with the existing seroprevalence rates in 46 U.S. states. Our framework projected for countries like Belgium, Brazil, and the U.S. that ~10% of the population has been infected once. In the U.S. states like Louisiana, Georgia, and Florida, more than 4% of the population was estimated to be currently infected, as of September 3, 2020, while in New York this fraction is 0.12%. The estimation of the actual fraction of currently infected people is crucial for any definition of public health policies, which up to this point may have been misguided by the reliance on confirmed cases.

Version published to 10.1371/journal.pone.0246772
Feb 8, 2021
ScreenIT
Sep 28, 2020
SciScore for 10.1101/2020.09.26.20202382: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.
Table 2: Resources
No key resources detected.
Results from OddPub: Thank you for sharing your code.
Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:
- Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when …
SciScore for 10.1101/2020.09.26.20202382: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.
Table 2: Resources
No key resources detected.
Results from OddPub: Thank you for sharing your code.
Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:
Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.
About SciScore
SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.
Read the original source
Version published to 10.1101/2020.09.26.20202382 on medRxiv
Sep 28, 2020

Incidence and trends for notifiable infectious diseases in Shenyang, China, 2005-2024

This article has 6 authors:
1. Huijie Chen
2. Huiyu Wen
3. Ye Chen
4. Lihai Wen
5. Zhuo Jin
6. Bingzheng Zhou
This article has no evaluationsLatest version Jan 21, 2026
A preliminary molecular snapshot of measles virus genotypes circulating in Rabat, Morocco, during 2024-2025

This article has 19 authors:
1. Othmane Jeddoub
2. El Mehdi Belouad
3. Nadia Touil
4. Fadoua El Amraoui
5. Idrissa Diawara
6. Youssef El Kadiri
7. Omar Nyabi
8. Jean-Luc Gala
9. Salma Labied
10. Safae El Kochri
11. Mohamed Rida Tagajdid
12. Rachid Abi
13. Hicham El Annaz
14. Idriss Lahlou Amine
15. Mouad Merabet
16. Abdelaziz Benjouad
17. Khalid Ennibi
18. Lamiae Belayachi
19. Elmostafa El Fahime
This article has no evaluationsLatest version Dec 22, 2025
Forecasting the incidence of viral hepatitis C in the Kyrgyz Republic using SARIMA models

This article has 5 authors:
1. Dautov Timur Talgatovich
2. Z.Sh. Nurmatov
3. Abzhaparova Aiganysh Ziyaydinovna
4. Mamatkulova Nazgul Mamatkulovna
5. B.T. Abdimomunova
This article has no evaluationsLatest version Dec 22, 2025

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Incidence and trends for notifiable infectious diseases in Shenyang, China, 2005-2024

A preliminary molecular snapshot of measles virus genotypes circulating in Rabat, Morocco, during 2024-2025

Forecasting the incidence of viral hepatitis C in the Kyrgyz Republic using SARIMA models