Mapping genome variation of SARS-CoV-2 worldwide highlights the impact of COVID-19 super-spreaders
This article has been Reviewed by the following groups
Listed in
- Evaluated articles (ScreenIT)
Abstract
The human pathogen severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the major pandemic of the twenty-first century. We analyzed more than 4700 SARS-CoV-2 genomes and associated metadata retrieved from public repositories. SARS-CoV-2 sequences have a high sequence identity (>99.9%), which drops to >96% when compared to bat coronavirus genome. We built a mutation-annotated reference SARS-CoV-2 phylogeny with two main macro-haplogroups, A and B, both of Asian origin, and more than 160 sub-branches representing virus strains of variable geographical origins worldwide, revealing a rather uniform mutation occurrence along branches that could have implications for diagnostics and the design of future vaccines. Identification of the root of SARS-CoV-2 genomes is not without problems, owing to conflicting interpretations derived from either using the bat coronavirus genomes as an outgroup or relying on the sampling chronology of the SARS-CoV-2 genomes and TMRCA estimates; however, the overall scenario favors haplogroup A as the ancestral node. Phylogenetic analysis indicates a TMRCA for SARS-CoV-2 genomes dating to November 12, 2019, thus matching epidemiological records. Sub-haplogroup A2 most likely originated in Europe from an Asian ancestor and gave rise to subclade A2a, which represents the major non-Asian outbreak, especially in Africa and Europe. Multiple founder effect episodes, most likely associated with super-spreader hosts, might explain COVID-19 pandemic to a large extent.
Article activity feed
-
-
-
SciScore for 10.1101/2020.05.19.097410: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.Table 2: Resources
Software and Algorithms Sentences Resources Alignment of SARS-CoV-2 genomes against the reference sequence was carried out using MUSCLE v3.8.31 program (Edgar 2004). MUSCLEsuggested: (MUSCLE, RRID:SCR_011812)Interspecific alignment was carried out using MAFFT program (Katoh et al. 2002) with default parameters. MAFFTsuggested: (MAFFT, RRID:SCR_011811)For this, we used the function cmdscale (library stats) from the statistical software R Project for Statistical Computing v. 3.3.1 (https://www.r-project.org/; (Team 2012)). R Project for Statisticalsuggested: (R Project for Statistical Computing, RRID:SCR_001905)The best maximum … SciScore for 10.1101/2020.05.19.097410: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.Table 2: Resources
Software and Algorithms Sentences Resources Alignment of SARS-CoV-2 genomes against the reference sequence was carried out using MUSCLE v3.8.31 program (Edgar 2004). MUSCLEsuggested: (MUSCLE, RRID:SCR_011812)Interspecific alignment was carried out using MAFFT program (Katoh et al. 2002) with default parameters. MAFFTsuggested: (MAFFT, RRID:SCR_011811)For this, we used the function cmdscale (library stats) from the statistical software R Project for Statistical Computing v. 3.3.1 (https://www.r-project.org/; (Team 2012)). R Project for Statisticalsuggested: (R Project for Statistical Computing, RRID:SCR_001905)The best maximum likelihood tree was visualized and edited using FigTree v. FigTreesuggested: (FigTree, RRID:SCR_008515)The demography of SARS-CoV-2 sequences was inferred using the Extended Bayesian Skyline Plot method (EBSP) (Heled and Drummond 2008) implemented in BEAST v2.6.2 (Drummond and Rambaut 2007). BEASTsuggested: (BEAST, RRID:SCR_010228)Results from OddPub: Thank you for sharing your data.
Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:- Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
- No funding statement was detected.
- No protocol registration statement was detected.
-
-