Implementation of the COVID-19 Vulnerability Index Across an International Network of Health Care Data Sets: Collaborative External Validation Study

Jenna M Reps
Chungsoo Kim
Ross D Williams
Aniek F Markus
Cynthia Yang
Talita Duarte-Salles
Thomas Falconer
Jitendra Jonnagaddala
Andrew Williams
Sergio Fernández-Bertolín
Scott L DuVall
Kristin Kostka
Gowtham Rao
Azza Shoaibi
Anna Ostropolets
Matthew E Spotnitz
Lin Zhang
Paula Casajust
Ewout W Steyerberg
Fredrik Nyberg
Benjamin Skov Kaas-Hansen
Young Hwa Choi
Daniel Morales
Siaw-Teng Liaw
Maria Tereza Fernandes Abrahão
Carlos Areia
Michael E Matheny
Kristine E Lynch
María Aragón
Rae Woong Park
George Hripcsak
Christian G Reich
Marc A Suchard
Seng Chan You
Patrick B Ryan
Daniel Prieto-Alhambra
Peter R Rijnbeek

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (ScreenIT)

Abstract

SARS-CoV-2 is straining health care systems globally. The burden on hospitals during the pandemic could be reduced by implementing prediction models that can discriminate patients who require hospitalization from those who do not. The COVID-19 vulnerability (C-19) index, a model that predicts which patients will be admitted to hospital for treatment of pneumonia or pneumonia proxies, has been developed and proposed as a valuable tool for decision-making during the pandemic. However, the model is at high risk of bias according to the “prediction model risk of bias assessment” criteria, and it has not been externally validated.

Objective

The aim of this study was to externally validate the C-19 index across a range of health care settings to determine how well it broadly predicts hospitalization due to pneumonia in COVID-19 cases.

Methods

We followed the Observational Health Data Sciences and Informatics (OHDSI) framework for external validation to assess the reliability of the C-19 index. We evaluated the model on two different target populations, 41,381 patients who presented with SARS-CoV-2 at an outpatient or emergency department visit and 9,429,285 patients who presented with influenza or related symptoms during an outpatient or emergency department visit, to predict their risk of hospitalization with pneumonia during the following 0-30 days. In total, we validated the model across a network of 14 databases spanning the United States, Europe, Australia, and Asia.

Results

The internal validation performance of the C-19 index had a C statistic of 0.73, and the calibration was not reported by the authors. When we externally validated it by transporting it to SARS-CoV-2 data, the model obtained C statistics of 0.36, 0.53 (0.473-0.584) and 0.56 (0.488-0.636) on Spanish, US, and South Korean data sets, respectively. The calibration was poor, with the model underestimating risk. When validated on 12 data sets containing influenza patients across the OHDSI network, the C statistics ranged between 0.40 and 0.68.

Conclusions

Our results show that the discriminative performance of the C-19 index model is low for influenza cohorts and even worse among patients with COVID-19 in the United States, Spain, and South Korea. These results suggest that C-19 should not be used to aid decision-making during the COVID-19 pandemic. Our findings highlight the importance of performing external validation across a range of settings, especially when a prediction model is being extrapolated to a different population. In the field of prediction, extensive validation is required to create appropriate trust in a model.

Version published to 10.2196/21547
Apr 5, 2021

SciScore for 10.1101/2020.06.15.20130328: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Institutional Review Board Statement	IRB: Consent to publish: Each site obtained institutional review board (IRB) approval for the study or used de-identified data, and therefore the study was determined not to be human subjects research. Consent: Informed consent was not necessary at any site.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	Predictors: The predictors of the logistic regression version of the C-19 index are age in years, male sex, number of inpatient visits during the prior 12 months and indicator variables for various Clinical Classifications Software Refined (CCSR) categories.

Table 2: Resources

No key resources detected.

Results from OddPub: Thank you for sharing your code and data.

Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:

Limitations: A common issue when using observational healthcare data, especially across a network of databases, is the difficulty in developing phenotypes that are valid on all datasets. In this study we used predictor definitions given by the researchers who developed the model. However, these definitions may not transport across all the datasets and may account for some of the decrease in performance. We were also limited to validate the less complex C-19 model due to the large number of variables and lack of transparency for the more complex models.

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.

Read the original source

Version published to 10.2196/preprints.21547
Jun 17, 2020
Version published to 10.1101/2020.06.15.20130328 on medRxiv
Jun 17, 2020

A Statewise Analysis of the Socioeconomic and Health Impacts of the COVID-19 Pandemic in India: Lessons for Future Health System Preparedness

This article has 9 authors:
1. Geetha R. Menon
2. U Venkatesh
3. Jeetendra Yadav
4. Krushna Chandra Sahoo
5. Tanu Anand
6. Ashoo Grover
7. Saurabh Sharma
8. Sandhya Singh
9. Firoz Khan
This article has no evaluationsLatest version Jul 2, 2025
Global Excess Tuberculosis Mortality DuringCOVID-19: A Country-Level Modeling Study of Policy and Development Correlates

This article has 4 authors:
1. Hamed Karami
2. Svenn-Erik Mamelund
3. Alexandra Smirnova
4. Gerardo Chowell
This article has no evaluationsLatest version Jun 12, 2025
Using Ensemble and Multi-Model Learning to Predict Longitudinal Hospitalization of Cardiovascular Patients and Warn of Pandemic Risks: A Retrospective Cohort Study with External Validations

This article has 7 authors:
1. Chengbo Fu
2. Zhe Zheng
3. Xiaoyan Yin
4. Xuanchu Ge
5. Xiangyu Zeng
6. Lingfeng Zha
7. Yanze Li
This article has no evaluationsLatest version Jun 10, 2025

This article has been Reviewed by the following groups

Listed in

Abstract

Objective

Methods

Results

Conclusions

Article activity feed

Related articles

A Statewise Analysis of the Socioeconomic and Health Impacts of the COVID-19 Pandemic in India: Lessons for Future Health System Preparedness

Global Excess Tuberculosis Mortality DuringCOVID-19: A Country-Level Modeling Study of Policy and Development Correlates

Using Ensemble and Multi-Model Learning to Predict Longitudinal Hospitalization of Cardiovascular Patients and Warn of Pandemic Risks: A Retrospective Cohort Study with External Validations