COVID-19 trajectories among 57 million adults in England: a cohort study using electronic health records

This article has been Reviewed by the following groups

Read the full article See related articles

Abstract

No abstract available

Article activity feed

  1. SciScore for 10.1101/2021.11.08.21265312: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    Ethicsnot detected.
    Sex as a biological variablenot detected.
    Randomizationnot detected.
    Blindingnot detected.
    Power Analysisnot detected.
    Cell Line AuthenticationAuthentication: We assessed 270 previously described comorbidities14, across 16 clinical specialities / organ systems, using validated CALIBER phenotypes and data records from 1st of January 1996 until 31st December 2019 from primary care, hospitalisation and procedure data14 (Supplementary Figure 3).

    Table 2: Resources

    Software and Algorithms
    SentencesResources
    A Reporting of studies Conducted using Observational Routinely-collected Data (RECORD) statement can be found in the supplement.
    RECORD
    suggested: (RECORD, RRID:SCR_009097)
    Data cleaning, exploratory analysis, phenotype creation and cohort assembly was performed using Python (3.7) and Spark SQL (2.4.5) on Databricks Runtime 6.4 for Machine Learning.
    Python
    suggested: (IPython, RRID:SCR_001658)
    Analysis was performed in RStudio (Professional) Version 1.3.1093.1
    RStudio
    suggested: (RStudio, RRID:SCR_000432)
    Figures were constructed using ggplot2 (3.3.3), VennDiagram (1.6.20), igraph (1.2.6), survival (3.2.7) and survminer (0.4.8) packages.
    ggplot2
    suggested: (ggplot2, RRID:SCR_014601)
    VennDiagram
    suggested: (VennDiagram, RRID:SCR_002414)

    Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


    Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
    Strengths and limitations: A key strength of this work using national-scale data is that by definition it is representative of the general population across all age groups, ethnicities, deprivation levels and demographic characteristics. To our knowledge, this is the largest population-wide research study of COVID-19 phenotypes which includes: a) multiple healthcare settings through data linkage at a population level, b) detailed identification of specific ventilatory treatments, c) classification of COVID-19 related deaths, and d) exploration of transitions between COVID-19 events. Using multiple EHR sources spanning different healthcare settings, maximised infection ascertainment and reduced the effects of variable testing and data recording patterns (especially during the first wave). As the focus of this work was to create COVID-19 related phenotypes, and describe the characteristics of individuals experiencing them, we have not conducted multivariable regression analyses to control for confounders. The findings presented are therefore not associative statements and should not be interpreted as causal relationships. However by sharing reproducible phenotype definitions we hope to facilitate further work to address the questions raised in this and other COVID-19 studies exploring national level data, as exemplified by recent research17–19. Whilst our definitions of the pandemic waves differ from others, we believe using non-contiguous dates enabled a balanced comparison ac...

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • No funding statement was detected.
    • No protocol registration statement was detected.

    Results from scite Reference Check: We found no unreliable references.


    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.