COVID-19 Variants Database: A repository for Human SARS-CoV-2 Polymorphism Data

This article has been Reviewed by the following groups

Read the full article

Abstract

COVID-19 is a newly communicable disease with a catastrophe outbreak that affects all over the world. We retrieved about 8,781 nucleotide fragments and complete genomes of SARS-CoV-2 reported from sixty-four countries. The CoV-2 reference genome was obtained from the National Genomics Data Center (NGDC), GISAID, and NCBI Genbank. All the sequences were aligned against reference genomes using Clustal Omega and variants were called using in-house built Python script. We intend to establish a user-friendly online resource to visualize the variants in the viral genome along with the Primer Infopedia. After analyzing and filtering the data globally, it was made available to the public. The detail of data available to the public includes mutations from 5688 SARS-CoV-2 sequences curated from 91 regions. This database incorporated 39920 mutations over 3990 unique positions. According to the translational impact, these mutations include 11829 synonymous mutations including 681 synonymous frameshifts and 21701 nonsynonymous mutations including 10 nonsynonymous frameshifts. Development of SARS-CoV-2 mutation genome browsers is a fundamental step obliging towards the virus surveillance, viral detection, and development of vaccine and therapeutic drugs. The SARS-COV-2 mutation browser is available at http://covid-19.dnageography.com .

Article activity feed

  1. SciScore for 10.1101/2020.06.10.145292: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    Institutional Review Board Statementnot detected.
    Randomizationnot detected.
    Blindingnot detected.
    Power Analysisnot detected.
    Sex as a biological variablenot detected.

    Table 2: Resources

    No key resources detected.


    Results from OddPub: Thank you for sharing your data.


    Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
    A potential limitation of this mutation browser is that since all SARS-CoV-2 genome sequences used in this study were retrieved from different databases and the accuracy of these genome sequences could not be verified although these partial or complete sequences have been published worldwide. In conclusion, the COVID-19 mutation browser is a worthful addition to the currently available coronavirus resources that will allow the researchers to access data expeditiously and intuitively on SARS-CoV-2 mutation. We expect the SARS-CoV-2 mutation browser will assist the process of study design and be a key resource for the COVID-19 community.

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.