The Runaway Evolution of SARS-CoV-2 Leading to the Highly Evolved Delta Strain

This article has been Reviewed by the following groups

Read the full article

Abstract

In new epidemics after the host shift, the pathogens may experience accelerated evolution driven by novel selective pressures. When the accelerated evolution enters a positive feedback loop with the expanding epidemics, the pathogen’s runaway evolution may be triggered. To test this possibility in coronavirus disease 2019 (COVID-19), we analyze the extensive databases and identify five major waves of strains, one replacing the previous one in 2020–2021. The mutations differ entirely between waves and the number of mutations continues to increase, from 3-4 to 21-31. The latest wave in the fall of 2021 is the Delta strain which accrues 31 new mutations to become highly prevalent. Interestingly, these new mutations in Delta strain emerge in multiple stages with each stage driven by 6–12 coding mutations that form a fitness group. In short, the evolution of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) from the oldest to the youngest wave, and from the earlier to the later stages of the Delta wave, is a process of acceleration with more and more mutations. The global increase in the viral population size (M(t), at time t) and the mutation accumulation (R(t)) may have indeed triggered the runaway evolution in late 2020, leading to the highly evolved Alpha and then Delta strain. To suppress the pandemic, it is crucial to break the positive feedback loop between M(t) and R(t), neither of which has yet to be effectively dampened by late 2021. New waves after Delta, hence, should not be surprising.

Article activity feed

  1. SciScore for 10.1101/2021.12.30.474592: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    Ethicsnot detected.
    Sex as a biological variablenot detected.
    Randomizationnot detected.
    Blindingnot detected.
    Power Analysisnot detected.

    Table 2: Resources

    Software and Algorithms
    SentencesResources
    Sequence alignment, SNP calling and annotation: We aligned these 1,853,355 genome sequences to the reference sequence (Wuhan-Hu-1(Wu, et al. 2020), GenBank: NC_045512, GISAID: EPI_ISL_402125) using MAFFT (--auto -- keeplength)(Katoh and Standley 2013).
    MAFFT
    suggested: (MAFFT, RRID:SCR_011811)

    Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


    Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    Results from scite Reference Check: We found no unreliable references.


    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.