COVID Symptoms, Symptom Clusters, and Predictors for Becoming a Long-Hauler Looking for Clarity in the Haze of the Pandemic

This article has been Reviewed by the following groups

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Abstract

Post-acute sequelae of SARS-CoV-2 (PASC) is defined as persistent symptoms after apparent recovery from acute COVID-19 infection, also known as COVID-19 long-haul. We performed a retrospective review of electronic health records (EHR) from the University of California COvid Research Data Set (UC CORDS), a de-identified EHR of PCR-confirmed SARS-CoV-2-positive patients in California. The purposes were to (1) describe the prevalence of PASC, (2) describe COVID-19 symptoms and symptom clusters, and (3) identify risk factors for PASC. Data were subjected to non-negative matrix factorization to identify symptom clusters, and a predictive model of PASC was developed. PASC prevalence was 11% (277/2,153), and of these patients, 66% (183/277) were considered asymptomatic at days 0–30. Five PASC symptom clusters emerged and specific symptoms at days 0–30 were associated with PASC. Women were more likely than men to develop PASC, with all age groups and ethnicities represented. PASC is a public health priority.

Article activity feed

  1. SciScore for 10.1101/2021.03.03.21252086: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    Institutional Review Board Statementnot detected.
    Randomizationnot detected.
    Blindingnot detected.
    Power Analysisnot detected.
    Sex as a biological variablenot detected.

    Table 2: Resources

    No key resources detected.


    Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


    Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
    Some of the variability in symptom reporting and symptom association with long-haulers may be due to limitations inherent in rapid screening questionnaires in as much as these questionnaires inquire about symptoms that predominantly impact those with severe disease. Also, questionnaires may fail to inquire about emerging symptoms such as cognitive dysfunction (including “brain fog”), limiting the ability to accurately document such symptoms. Asymptomatic individuals may be less often intensely monitored due to an inherent notion of low risk for severe acute disease; however, this is problematic as asymptomatic individuals account for 32% of the long-haulers observed in this study. The symptom clusters observed among long-haulers vary compared to those at initial presentation. The evolution of these clusters may provide insight into the etiology of long-haulers in which elucidating sites of evolving tissue damage, and alterations in innate and adaptive immune inflammatory pathways might provide clarity in understanding the underlying pathophysiology. In October 2020, the Tony Blair Institute for Global Change identified key characteristics among long-haulers, specifically that women appear to be at greater risk and those who are of working age (mean of age 45) (17). Our data align with these observations. Therefore, to our third key point, we observed that all ethnicities were affected as well as individuals who were initially asymptomatic. However, our use of ethnicity is lim...

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.