Real-time monitoring epidemic trends and key mutations in SARS-CoV-2 evolution by an automated tool

This article has been Reviewed by the following groups

Read the full article

Abstract

With the global epidemic of SARS-CoV-2, it is important to monitor the variation, haplotype subgroup epidemic trends and key mutations of SARS-CoV-2 over time effectively, which is of great significance to the development of new vaccines, the update of therapeutic drugs, and the improvement of detection reagents. The AutoVEM tool developed in the present study could complete all mutations detections, haplotypes classification, haplotype subgroup epidemic trends and key mutations analysis for 131,576 SARS-CoV-2 genome sequences in 18 hours on a 1 core CPU and 2G internal storage computer. Through haplotype subgroup epidemic trends analysis of 131,576 genome sequences, the great significance of the previous 4 specific sites (C241T, C3037T, C14408T and A23403G) was further revealed, and 6 new mutation sites of highly linked (T445C, C6286T, C22227T, G25563T, C26801G and G29645T) were discovered for the first time that might be related to the infectivity, pathogenicity or host adaptability of SARS-CoV-2. In brief, we proposed an integrative method and developed an efficient automated tool to monitor haplotype subgroup epidemic trends and screen out the key mutations in the evolution of SARS-CoV-2 over time for the first time, and all data could be updated quickly to track the prevalence of previous key mutations and new key mutations because of high efficiency of the tool. In addition, the idea of combinatorial analysis in the present study can also provide a reference for the mutation monitoring of other viruses.

Article activity feed

  1. SciScore for 10.1101/2020.12.24.424271: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    Institutional Review Board Statementnot detected.
    Randomizationnot detected.
    Blindingnot detected.
    Power Analysisnot detected.
    Sex as a biological variablenot detected.

    Table 2: Resources

    Software and Algorithms
    SentencesResources
    AutoVEM is written in python language (python 3.8.6) and runs on Linux machines with centos.
    python
    suggested: (IPython, RRID:SCR_001658)
    And bowtie2 (17), samtools (18), bcftools (19), vcftools (20) and haploview (21) were applied in this automated tool.
    samtools
    suggested: (SAMTOOLS, RRID:SCR_002105)
    Workflow of AutoVEM:
    AutoVEM
    suggested: None

    Results from OddPub: Thank you for sharing your code and data.


    Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.