The novel Coronavirus enigma: Phylogeny and mutation analyses of SARS-CoV-2 viruses circulating in India during early 2020

This article has been Reviewed by the following groups

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Abstract

Background

This is a comprehensive analysis of 46 Indian SARS-CoV-2 genome sequences available from the NCBI and GISAID repository during early 2020. Evolutionary dynamics, gene-specific phylogeny and emergence of the novel co-evolving mutations in nine structural and non-structural genes among circulating SARS-CoV-2 strains in ten states of India have been assessed.

Materials and methods

46 SARS-CoV-2 nucleotide sequences submitted from India were downloaded from the GISAID (39/46) or from NCBI (7/46) database. Phylogenetic study and analyses of mutation were based on the nine structural and non-structural genes of SARS-CoV-2 strains. Secondary structure of RdRP/NSP12 protein was predicted with respect to the novel A97V mutation.

Results

Phylogenetic analyses revealed the evolution of “genome-type clusters” and adaptive selection of “L” type SARS-CoV-2 strains with genetic closeness to the bat SARS-like coronaviruses than pangolin or MERS-CoVs. With regards to the novel co-evolving mutations, 2 groups are seen to circulate in India at present: the “major group” (52.2%) and the “minor group” (30.4%), harboring four and five co-existing mutations, respectively. The “major group” mutations fall in the A2a clade. All the minor group mutations, except 11083G>T (L37F, NSP6) were unique to the Indian isolates.

Conclusion

The study highlights rapidly evolving SARS-CoV-2 virus and co-circulation of multiple clades and sub-clades, driving this pandemic worldwide. This comprehensive study is a potential resource for monitoring the novel mutations in the viral genome, changes in viral pathogenesis, for designing vaccines and other therapeutics.

Article activity feed

  1. SciScore for 10.1101/2020.05.25.114199: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    NIH rigor criteria are not applicable to paper type.

    Table 2: Resources

    Software and Algorithms
    SentencesResources
    Multiple sequence alignment for all the respective set of gene sequences was done using MUSCLE v3.8.31.
    MUSCLE
    suggested: (MUSCLE, RRID:SCR_011812)
    Phylogenetic dendrograms were constructed by MEGA, version X (Molecular Evolutionary Genetics Analysis), using the maximum-likelihood statistical method (at 1000 bootstrap replicates), using the best fit nucleotide substitution models for each dendrogram.
    MEGA
    suggested: (Mega BLAST, RRID:SCR_011920)

    Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


    Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.