Development and validation of an algorithm to estimate the risk of severe complications of COVID-19: a retrospective cohort study in primary care in the Netherlands

Abstract

To develop an algorithm (sCOVID) to predict the risk of severe complications of COVID-19 in a community-dwelling population to optimise vaccination scenarios.

Design

Population-based cohort study.

Setting

264 Dutch general practices contributing to the NL-COVID database.

Participants

6074 people aged 0–99 diagnosed with COVID-19.

Main outcomes

Severe complications (hospitalisation, institutionalisation, death). The algorithm was developed from a training data set comprising 70% of the patients and validated in the remaining 30%. Potential predictor variables included age, sex, chronic comorbidity score (CCS) based on risk factors for COVID-19 complications, obesity, neighbourhood deprivation score (NDS), first or second COVID-19 wave and confirmation test. Six population vaccination scenarios were explored: (1) random ( naive ), (2) random for persons above 60 years ( 60plus ), (3) oldest patients first in age band of 5 years ( oldest first ), (4) target population of the annual influenza vaccination programme ( influenza ), (5) those 25–65 years of age first ( worker ), and (6) risk based using the prediction algorithm ( sCOVID ).

Results

Severe complications were reported in 243 (4.8%) people with 59 (20.3%) nursing home admissions, 181 (62.2%) hospitalisations and 51 (17.5%) deaths. The algorithm included age, sex, CCS, NDS, wave and confirmation test (c-statistic=0.91, 95% CI 0.88 to 0.94) in the validation set. Applied to different vaccination scenarios, the proportion of people needed to be vaccinated to reach a 50% reduction of severe complications was 67.5%, 50.0%, 26.1%, 16.0%, 10.0% and 8.4% for the worker, naive , influenza, 60plus, oldest first and sCOVID scenarios, respectively.

Conclusion

The sCOVID algorithm performed well to predict the risk of severe complications of COVID-19 in the first and second waves of COVID-19 infections in this Dutch population. The regression estimates can and need to be adjusted for future predictions. The algorithm can be applied to identify persons with highest risks from data in the electronic health records of general practitioners (GPs).

Article activity feed

SciScore for 10.1101/2021.02.05.21251197: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Software and Algorithms
Sentences	Resources
GLMNET package (4.0-2), version was used for statistical analyses and constructing figures.	GLMNET suggested: (glmnet, RRID:SCR_015505)

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:

Strengths and limitations: The NL-COVID database also has limitations and strengths. First, we have substantial underreporting of positive cases since our 264 registration practices …

SciScore for 10.1101/2021.02.05.21251197: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Software and Algorithms
Sentences	Resources
GLMNET package (4.0-2), version was used for statistical analyses and constructing figures.	GLMNET suggested: (glmnet, RRID:SCR_015505)

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:

Strengths and limitations: The NL-COVID database also has limitations and strengths. First, we have substantial underreporting of positive cases since our 264 registration practices consisting of about 5% of all GP practices only reported 0.7% of the registered cases. This is explained by several factors: first, practices enrolled into the program over time and some practices only joined the program and the end of 2021. Second, COVID-19 testing was done by the regional health authorities (GGD) whereas the administrations of the GGD were not linked with the GP administration. Therefore our registration relies on whether the patient contacted the GP and whether the GP registered the patient. This makes it likely that we have a selection bias towards the more severe disease manifestations of the COVID-19 infection. Also our prediction partly relied on the judgement of the GP whether a patient was COVID 19 positive (in case of lacking test results). It should therefore be stressed that absolute estimates risk estimates to develop severe complications should be interpreted with care only by health care professionals for prioritizing strategies. A weakness of the sCOVID scenario is that we did not perform an external validation. The large number of GP practices that came from all over the country and the good testing characteristics of the validation set, makes it likely that the accuracy of the scenarios is adequate. For the comparison of the different scenarios this has no import...

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: We did not find any issues relating to colormaps.

Results from scite Reference Check: We found one citation with an erratum. We recommend checking the erratum to confirm that it does not impact the accuracy of your citation.

DOI	Status	Title
10.1161/circoutcomes.113.000152	Has correction	Development and Validation of a Risk Score to Predict QT Int…

Read the original source

Development and validation of an algorithm to estimate the risk of severe complications of COVID-19: a retrospective cohort study in primary care in the Netherlands

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Design

Setting

Participants

Main outcomes

Results

Conclusion

Article activity feed

Development and validation of a nomogram to predict secondary or concurrent bacterial infections in patients with COVID-19

Annual rehospitalization rates due to COVID-19 reinfection in Rio de Janeiro: the role of vaccination and associated factors

Longitudinal trends in the endemic symptomatic burden of COVID-19: Insights from community-based participatory virological surveillance in the Netherlands, November 2020 - April 2025

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Design

Setting

Participants

Main outcomes

Results

Conclusion

Article activity feed

Related articles

Development and validation of a nomogram to predict secondary or concurrent bacterial infections in patients with COVID-19

Annual rehospitalization rates due to COVID-19 reinfection in Rio de Janeiro: the role of vaccination and associated factors

Longitudinal trends in the endemic symptomatic burden of COVID-19: Insights from community-based participatory virological surveillance in the Netherlands, November 2020 - April 2025