Whole-genome demography of COVID-19 virus during its pandemic period and on “panvalent” vaccine design

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

With over 16 million submitted genomic sequences, the SARS-CoV-2 (SC2) virus, the cause of the most recent worldwide pandemic COVID-19 pandemic, has become the most sequenced genome of all known viruses (1), revealing, for example, a vast number of expanding viral lineages. Since the pandemic phase appears to be over, we performed a retrospective re-examination of the demographic grouping pattern and their genomic characteristics during the entire pandemic period up to the peak of the last pandemic wave. For our study, we extracted from the NCBI (2) only unique viral sequences and converted each sequence data to a relational vector, indicating the presence/absence of each variational event compared to a “reference” sequence. Our study revealed several genomic features that are unexpected or different from those of previous studies (3 – 9). For example, approximately 44,000 variants with unique sequences emerged during the pandemic period; they group into only four major demographic groups and each has a set of mostly unique highly-conserved variant-genotypes (HCVGs); and the set from the first group was passed down to the remaining three groups, suggesting that HCVGs in the next group may be predictable from the HCVGs of the current group(s). Such a concept may be potentially important in designing “panvalent” vaccines against the current and future waves of viral infections.

Article activity feed