A comparison of five epidemiological models for transmission of SARS-CoV-2 in India

Soumik Purkayastha
Rupam Bhattacharyya
Ritwik Bhaduri
Ritoban Kundu
Xuelin Gu
Maxwell Salvatore
Debashree Ray
Swapnil Mishra
Bhramar Mukherjee

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (ScreenIT)

Abstract

Background

Many popular disease transmission models have helped nations respond to the COVID-19 pandemic by informing decisions about pandemic planning, resource allocation, implementation of social distancing measures, lockdowns, and other non-pharmaceutical interventions. We study how five epidemiological models forecast and assess the course of the pandemic in India: a baseline curve-fitting model, an extended SIR (eSIR) model, two extended SEIR (SAPHIRE and SEIR-fansy) models, and a semi-mechanistic Bayesian hierarchical model (ICM).

Methods

Using COVID-19 case-recovery-death count data reported in India from March 15 to October 15 to train the models, we generate predictions from each of the five models from October 16 to December 31. To compare prediction accuracy with respect to reported cumulative and active case counts and reported cumulative death counts, we compute the symmetric mean absolute prediction error (SMAPE) for each of the five models. For reported cumulative cases and deaths, we compute Pearson’s and Lin’s correlation coefficients to investigate how well the projected and observed reported counts agree. We also present underreporting factors when available, and comment on uncertainty of projections from each model.

Results

For active case counts, SMAPE values are 35.14% (SEIR-fansy) and 37.96% (eSIR). For cumulative case counts, SMAPE values are 6.89% (baseline), 6.59% (eSIR), 2.25% (SAPHIRE) and 2.29% (SEIR-fansy). For cumulative death counts, the SMAPE values are 4.74% (SEIR-fansy), 8.94% (eSIR) and 0.77% (ICM). Three models (SAPHIRE, SEIR-fansy and ICM) return total (sum of reported and unreported) cumulative case counts as well. We compute underreporting factors as of October 31 and note that for cumulative cases, the SEIR-fansy model yields an underreporting factor of 7.25 and ICM model yields 4.54 for the same quantity. For total (sum of reported and unreported) cumulative deaths the SEIR-fansy model reports an underreporting factor of 2.97. On October 31, we observe 8.18 million cumulative reported cases, while the projections (in millions) from the baseline model are 8.71 (95% credible interval: 8.63–8.80), while eSIR yields 8.35 (7.19–9.60), SAPHIRE returns 8.17 (7.90–8.52) and SEIR-fansy projects 8.51 (8.18–8.85) million cases. Cumulative case projections from the eSIR model have the highest uncertainty in terms of width of 95% credible intervals, followed by those from SAPHIRE, the baseline model and finally SEIR-fansy.

Conclusions

In this comparative paper, we describe five different models used to study the transmission dynamics of the SARS-Cov-2 virus in India. While simulation studies are the only gold standard way to compare the accuracy of the models, here we were uniquely poised to compare the projected case-counts against observed data on a test period. The largest variability across models is observed in predicting the “total” number of infections including reported and unreported cases (on which we have no validation data). The degree of under-reporting has been a major concern in India and is characterized in this report. Overall, the SEIR-fansy model appeared to be a good choice with publicly available R-package and desired flexibility plus accuracy.

Version published to 10.1186/s12879-021-06077-9
Jun 7, 2021
ScreenIT
Mar 1, 2021
SciScore for 10.1101/2020.09.19.20198010: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
Institutional Review Board Statement not detected.
Randomization not detected.
Blinding not detected.
Power Analysis not detected.
Sex as a biological variable not detected.
Table 2: Resources
No key resources detected.
Results from OddPub: Thank you for sharing your code and data.
Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
There are several limitations to this work. First and foremost, all model estimates are based on a scenario where we assumed no change in either interventions or behavior of people in the forecast period. This is not true as there is tremendous variation in policies across Indian states in the post lock-down phase. We did …
SciScore for 10.1101/2020.09.19.20198010: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
Institutional Review Board Statement not detected.
Randomization not detected.
Blinding not detected.
Power Analysis not detected.
Sex as a biological variable not detected.
Table 2: Resources
No key resources detected.
Results from OddPub: Thank you for sharing your code and data.
Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
There are several limitations to this work. First and foremost, all model estimates are based on a scenario where we assumed no change in either interventions or behavior of people in the forecast period. This is not true as there is tremendous variation in policies across Indian states in the post lock-down phase. We did observe regional lockdowns that were enacted in the forecast period. None of our models tried to capture this variability. Second, the five models we compare are a subset of vast amount of work that has been done in this area, particularly models that incorporate age-specific contact network and spatiotemporal variation. Finally, an extensive simulation study would be the best way to assess the models under different scenarios but we have restricted our attention to India. Finally, we only report point estimates and have not compared the uncertainty estimates from each model which also play a key role in our choice.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:
Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.
About SciScore
SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.
Read the original source
Version published to 10.1101/2020.09.19.20198010 on medRxiv
Sep 22, 2020

Evaluating Age-Dependent Transmission and Vaccination Policy in Singapore’s SARS-CoV-2 Epidemic: A Computational Modelling Approach

This article has 7 authors:
1. Jingyan Huang
2. Zhi Ling
3. Mousumi Roy
4. Shihui Jin
5. Jeremy WeiQuan Chan
6. Kelvin Bryan Tan
7. Swapnil Mishra
This article has no evaluationsLatest version Jun 30, 2025
Spatio-temporal modelling of COVID-19 infection and associated risk factors in Dakar, Senegal

This article has 12 authors:
1. Assane Niang Gadiaga
2. Mame Wodji Tine
3. Aminata Niang Diene
4. Catherine Linard
5. Niko Speybroeck
6. Ortis Yankey
7. Somnath Chaudhuri
8. Chibuzor Christopher Nnanatu
9. Eimear Cleary
10. Shengjie Lai
11. Attila Nando Lazar
12. Andrew James Tatem
This article has no evaluationsLatest version Jul 6, 2025
ESTIMATING THE INCIDENCE OF SARS-COV-2 INFECTIONS IN 2020 IN BELGIUM BY JOINTLY MODELLING SEROPREVALENCE, HOSPITALIZATION AND MORTALITY DATA

This article has 9 authors:
1. Toon Braeye
2. Robby De Pauw
3. Laurence Geebelen
4. Steven Abrams
5. Isabelle Desombere
6. Niel Hens
7. Naïma Hammami
8. Mathieu Roelants
9. Sereina A. Herzog
This article has no evaluationsLatest version Jul 3, 2025

Institutional Review Board Statement	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	not detected.

This article has been Reviewed by the following groups

Listed in

Abstract

Background

Methods

Results

Conclusions

Article activity feed

Related articles

Evaluating Age-Dependent Transmission and Vaccination Policy in Singapore’s SARS-CoV-2 Epidemic: A Computational Modelling Approach

Spatio-temporal modelling of COVID-19 infection and associated risk factors in Dakar, Senegal

ESTIMATING THE INCIDENCE OF SARS-COV-2 INFECTIONS IN 2020 IN BELGIUM BY JOINTLY MODELLING SEROPREVALENCE, HOSPITALIZATION AND MORTALITY DATA