Persistent symptoms following SARS-CoV-2 infection in a random community sample of 508,707 people

This article has been Reviewed by the following groups

Read the full article See related articles

Abstract

Background

Long COVID, describing the long-term sequelae after SARS-CoV-2 infection, remains a poorly defined syndrome. There is uncertainty about its predisposing factors and the extent of the resultant public health burden, with estimates of prevalence and duration varying widely.

Methods

Within rounds 3–5 of the REACT-2 study, 508,707 people in the community in England were asked about a prior history of COVID-19 and the presence and duration of 29 different symptoms. We used uni-and multivariable models to identify predictors of persistence of symptoms (12 weeks or more). We estimated the prevalence of symptom persistence at 12 weeks, and used unsupervised learning to cluster individuals by symptoms experienced.

Findings

Among the 508,707 participants, the weighted prevalence of self-reported COVID-19 was 19.2% (95% CI: 19.1,19.3). 37.7% of 76,155 symptomatic people post COVID-19 experienced at least one symptom, while 14.8% experienced three or more symptoms, lasting 12 weeks or more. This gives a weighted population prevalence of persistent symptoms of 5.75% (5.68, 5.81) for one and 2.22% (2.1, 2.26) for three or more symptoms. Almost a third of people (8,771/28,713 [30.5%]) with at least one symptom lasting 12 weeks or more reported having had severe COVID-19 symptoms (“significant effect on my daily life”) at the time of their illness, giving a weighted prevalence overall for this group of 1.72% (1.69,1.76). The prevalence of persistent symptoms was higher in women than men (OR: 1.51 [1.46,1.55]) and, conditional on reporting symptoms, risk of persistent symptoms increased linearly with age by 3.5 percentage points per decade of life. Obesity, smoking or vaping, hospitalisation, and deprivation were also associated with a higher probability of persistent symptoms, while Asian ethnicity was associated with a lower probability. Two stable clusters were identified based on symptoms that persisted for 12 weeks or more: in the largest cluster, tiredness predominated, while in the second there was a high prevalence of respiratory and related symptoms.

Interpretation

A substantial proportion of people with symptomatic COVID-19 go on to have persistent symptoms for 12 weeks or more, which is age-dependent. Clinicians need to be aware of the differing manifestations of Long COVID which may require tailored therapeutic approaches. Managing the long-term sequelae of SARS-CoV-2 infection in the population will remain a major challenge for health services in the next stage of the pandemic.

Funding

The study was funded by the Department of Health and Social Care in England.

Research in context

Evidence before this study

Recent systematic reviews have documented the wide range of symptoms and reported prevalence of persistent symptoms following COVID-19. A dynamic review of Long COVID studies (NIHR Evidence) in March 2021 summarised the literature on the prevalence of persistent symptoms after acute COVID19, and reported that most studies (14) were of hospitalised patients, with higher prevalence of persistent symptoms compared with two community-based studies. There was limited evidence from community studies beyond 12 weeks. Another systematic review reported a median of over 70% of people with symptoms lasting at least 60 days. A review of risk factors for Long COVID found consistent evidence for an increased risk amongst women and those with high body mass index (BMI) but inconsistent findings on the role of age and little evidence concerning risks among different socioeconomic or ethnic groups which are often not well captured in routine healthcare records. Long COVID is increasingly recognised as heterogenous, likely underpinned by differing biological mechanisms, but there is not yet consensus on defining subtypes of the condition.

Added value of this study

This community-based study of over half a million people was designed to be representative of the adult population of England. A random sample of adults ages 18 years and above registered with a GP were invited irrespective of previous access to services for COVID-19, providing an estimate of population prevalence that was representative of the whole population. The findings show substantial declines in symptom prevalence over the first 12 weeks following Covid-19, reported by nearly one fifth of respondents, of whom over a third remained symptomatic at 12 weeks and beyond, with little evidence for decline thereafter.

Risk factors identified for persistent symptoms (12 weeks or more) suggestive of Long COVID confirm some previous findings - an increased risk in women, obese and overweight individuals and those hospitalised for COVID-19, with strong evidence for an increasing risk with age. Additional evidence was found for an increased risk in those with lower income, smoking or vaping and healthcare or care home workers. A lower risk was found in those of Asian ethnicity.

Clustering identified two distinct groups of individuals wit h different symptom profiles at 12 weeks, highlighting the heterogeneity of clinical presentation. The smaller cluster had higher prevalence of respiratory and related symptoms, while for those in the larger cluster tiredness was the dominant symptom, with lower prevalence of organ-specific symptoms.

Implications of available evidence

There is a high prevalence of persistent symptoms beyond 12 weeks after acute COVID-19, with little evidence of decline thereafter. This highlights the needs for greater support for patients, both through specialised services and, for those from low-income settings, financial support. The understanding that there are distinct clusters of persistent symptoms, the most common of which is dominated by fatigue, is important for the recognition and clinical management of the condition outside of specialised services.

Article activity feed

  1. Our take

    This study, available as a preprint and thus not yet peer-reviewed, used cross-sectional data from three rounds of a national population-based study in England to estimate the prevalence and correlates of persistent COVID-19 symptoms, and patterns of symptom occurrence. Of the 76,155 participants with self-reported COVID-19 symptom onset 12+ weeks before their survey date, 37.7% reported at least one persistent symptom 12+ weeks from COVID-19 diagnosis. Among all participants, the weighted population prevalence of at least one persistent symptom 12+ weeks from symptom onset was 5.75% (95% CI: 5.68, 5.82) in England. They also identified that among those with persistent symptoms, symptoms tended to cluster as “tiredness” (fatigue, muscle aches, and difficulty sleeping) or “respiratory” (shortness of breath, chest tightness, and chest pain). The “respiratory cluster” was more common among those with more severe initial disease. While it’s possible that this persistent symptom prevalence is an overestimate given a low survey response rate and potential for pre-existing symptoms to be attributed to COVID-19, this paper has important implications for clinicians treating individuals with a history of COVID-19 and adds to our collective understanding about COVID-19’s long-term sequelae.

    Study design

    cross-sectional

    Study population and setting

    This study used cross-sectional data from rounds three to five of the Real-Time Assessment of Community Transmission-2 (REACT-2) study in England between September 2020 and February 2021. The REACT-2 study used a cluster sampling approach to randomly select individuals from the National Health Service patient list within each of the 315 lower-tier local authority areas (LTLA). The study collected self-reported PCR testing history, as well as demographic characteristics, medical comorbidities, and current symptoms that may be related to COVID-19. This analysis included individuals who reported a history of symptomatic COVID-19 with symptom onset 12-weeks or more before the survey date. They weighted symptom prevalence estimates by sex, age, ethnicity, LTLA-area, and an index of multiple deprivation to estimate prevalence across England. They then investigated the relationship between demographic and lifestyle factors with any symptom persistence at 12 weeks or more via logistic regression, gradient boosted tree models, and generalized additive models. Finally, they used CLustering LARge Applications (CLARA) to identify symptom clusters among participants with lingering symptoms 12 or more weeks from their COVID-19 onset.

    Summary of main findings

    Of the 508,707 participants in REACT-2 rounds three to five (26-29% response rate across rounds), 76,155 reported a valid date of symptomatic COVID-19 symptom onset 12 or more weeks before their survey date. A large percentage (37.7%) reported at least one persistent symptom at 12+ weeks from symptom onset, with 14.8% reporting at least three persistent symptoms. The predominant symptom at 12 weeks was tiredness, followed by shortness of breath, difficulty sleeping, and muscle aches. They calculated the weighted population prevalence in England of at least one persistent symptom of 5.75% (95% CI: 5.68, 5.82) and three or more persistent symptoms of 2.22% (95% CI: 2.18, 2.26). Overall, female participants reported more persistent symptoms than male participants (age-adjusted OR: 1.51, 95% CI: 1.46, 1.55 for 12+ weeks of symptoms), which increased with age. After adjustment for age and sex, comorbidities, weight, smoking, vaping, living in deprived areas, being low income, and being a healthcare worker were each associated with increased reports of symptom persistence at 12+ weeks. In the clustering analysis (N=53,309), they identified a “tiredness cluster” (N=15,799, 30%) — which  included fatigue, muscle aches, and difficulty sleeping — and a “respiratory cluster” (N=4,441, 9%)— which included shortness of breath, chest tightness, and chest pain — of persistent symptoms, with the “respiratory cluster” more common among those with a history of more severe COVID-19 initially.

    Study strengths

    This national study included a large number of randomly selected participants across England drawn from NHS patient lists. The study sample therefore included individuals who tested positive for SARS-CoV-2 in the community regardless of their initial disease severity or whether or not they were hospitalized. The study used multivariable analyses to identify factors with independent associations with COVID-19 sequelae.

    Limitations

    This survey had a relatively low response rate (<30%), which raises questions about how representative respondents were of the initial sampling frame or the national population. Without data comparing respondents and non-respondents, it is difficult to estimate the magnitude or direction of bias, but it is plausible that individuals with prolonged COVID-19 symptoms may be more likely to participate than those without sequelae, potentially overestimating the prevalence of long COVID-19 among those with symptomatic acute infections. It is not clear when symptom onset occurred (i.e. whether it was caused by COVID-19 or whether it was prevalent before COVID-19 onset, both of which could be complicated by initial symptomatic COVID-19 duration). By focusing on “any” symptom prevalence at 12+ weeks, they may be overestimating symptom persistence, especially considering many symptoms reported could be influenced by non-COVID-19 conditions and lifestyle factors (such as fatigue and headaches).

    Value added

    This population-based study estimated the national prevalence of COVID-19 symptoms that persist 12+ weeks from initial COVID-19 onset, highlighting the long-term implications of SARS-CoV-2 infection. The large sample size and random sampling methodology of this study have likely captured a more representative range of post-COVID-19 experiences than studies relying on samples who were hospitalized with COVID-19 or those who responded to surveys on social media.

  2. SciScore for 10.1101/2021.06.28.21259452: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    Ethicsnot detected.
    Sex as a biological variablenot detected.
    Randomizationnot detected.
    Blindingnot detected.
    Power Analysisnot detected.

    Table 2: Resources

    Antibodies
    SentencesResources
    Participants: The REACT-2 programme evaluates community prevalence of SARS-CoV-2 anti-spike protein antibody positivity in England.
    anti-spike protein
    suggested: None

    Results from OddPub: Thank you for sharing your code and data.


    Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
    Strengths and Limitations: This study uses a large random community sample with a high response rate (26–29% across the three rounds) to describe the persistence of COVID-19 symptoms. It is therefore more likely to be representative of the range of disease severity in the population compared to some others, especially those based on hospitalised cases alone5. The focus in our questionnaire on persistence of self-reported COVID-19 symptoms, without specific reference to Long COVID (in contrast to the questions in some other studies7) has allowed us to investigate the persistence of a wide range of specific symptoms that have been suggested as relevant to Long COVID.3,5 However, it is clear that a wide spectrum of symptoms and clinical presentations post-COVID-19 may be involved; our open free-text question identified a number of symptoms not included in our questionnaire including “brain fog”, “palpitations” and “hair loss”.29 However, as the study was based on self-reported data and because many of the symptoms are common and not specific to COVID-19, we may, as noted, have overestimated the prevalence of persistent symptoms. A further limitation is the retrospective study design, which introduces the possibility of recall bias. Nonetheless, in earlier analyses we have shown that participant reports of date of onset of their symptoms produce an epidemic curve that very closely tracks the epidemic.27,30,31 Respondents were restricted to reporting a single date of (initial) sym...

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    Results from scite Reference Check: We found no unreliable references.


    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.