Genomic variance of the 2019‐nCoV coronavirus

Carmine Ceraolo
Federico M. Giorgi

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (ScreenIT)

Abstract

There is a rising global concern for the recently emerged novel coronavirus (2019‐nCoV). Full genomic sequences have been released by the worldwide scientific community in the last few weeks to understand the evolutionary origin and molecular characteristics of this virus. Taking advantage of all the genomic information currently available, we constructed a phylogenetic tree including also representatives of other coronaviridae, such as Bat coronavirus (BCoV) and severe acute respiratory syndrome. We confirm high sequence similarity (>99%) between all sequenced 2019‐nCoVs genomes available, with the closest BCoV sequence sharing 96.2% sequence identity, confirming the notion of a zoonotic origin of 2019‐nCoV. Despite the low heterogeneity of the 2019‐nCoV genomes, we could identify at least two hypervariable genomic hotspots, one of which is responsible for a Serine/Leucine variation in the viral ORF8‐encoded protein. Finally, we perform a full proteomic comparison with other coronaviridae, identifying key aminoacidic differences to be considered for antiviral strategies deriving from previous anti‐coronavirus approaches.

SciScore for 10.1101/2020.02.02.931162: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Software and Algorithms
Sentences	Resources
MSA visualization was generated via Jalview v 2.11.0 [17]	Jalview suggested: (Jalview, RRID:SCR_006459)
Phylogenetic model generation and tree visualization was done using MEGAX v 10.1.7 [18], using the Maximum Likelihood method and Tamura-Nei model [10].	MEGAX suggested: None
Categorical Principal Component Analysis was performed on R version 3.6.1 using the package FactoMineR [11].	FactoMineR suggested: (FactoMineR, RRID:SCR_014602)
Specifically, a MSA FASTA file from MUSCLE is loaded in R and converted into a categorical matrix, with genomes as rows and nucleotide coordinates as columns.	MUSCLE su…

SciScore for 10.1101/2020.02.02.931162: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Software and Algorithms
Sentences	Resources
MSA visualization was generated via Jalview v 2.11.0 [17]	Jalview suggested: (Jalview, RRID:SCR_006459)
Phylogenetic model generation and tree visualization was done using MEGAX v 10.1.7 [18], using the Maximum Likelihood method and Tamura-Nei model [10].	MEGAX suggested: None
Categorical Principal Component Analysis was performed on R version 3.6.1 using the package FactoMineR [11].	FactoMineR suggested: (FactoMineR, RRID:SCR_014602)
Specifically, a MSA FASTA file from MUSCLE is loaded in R and converted into a categorical matrix, with genomes as rows and nucleotide coordinates as columns.	MUSCLE suggested: (MUSCLE, RRID:SCR_011812)
Nucleotide sequence identity and coverage were calculated using BLAST nucleotide v2.6.0 [20].	BLAST suggested: (BLASTX, RRID:SCR_001653)

Results from OddPub: Thank you for sharing your data.

Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

No conflict of interest statement was detected. If there are no conflicts, we encourage authors to explicit state so.
No funding statement was detected.
No protocol registration statement was detected.

Read the original source

SciScore for 10.1101/2020.02.02.931162: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Software and Algorithms
Sentences	Resources
We aligned all the 70 coronavirus sequences using MUSCLE [9] and inferred the evolutionary relationships between these sequences with a Tamura-Nei Maximum Likelihood estimation [10] with 100 bootstraps for model robustness estimation.	MUSCLE suggested: (MUSCLE, SCR_011812)
Phylogenetic model generation and tree visualization was done using MEGAX v 10.1.7 [ 18] , using the Maximum Likelihood method and Tamura-Nei model [ 10] .	MEGAX suggested: None
Categorical Principal Component Analysis was performed on R version 3.6.1 using the package FactoMineR [ 11] .	FactoMineR suggested: (FactoMineR, SCR_014602)
Pairwise protein identity and coverage was calculated using BLAST protein v2.6.0 [ 20 ] with BLOSUM62 matrix and default parameters .	BLAST suggested: (BLASTX, SCR_001653)
MSA visualization was generated via Jalview v 2.11.0 [ 17]	Jalview suggested: (Jalview, SCR_006459)

Results from OddPub: Thank you for sharing your data.

About SciScore

SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore is not a substitute for expert review. SciScore checks for the presence and correctness of RRIDs (research resource identifiers) in the manuscript, and detects sentences that appear to be missing RRIDs. SciScore also checks to make sure that rigor criteria are addressed by authors. It does this by detecting sentences that discuss criteria such as blinding or power analysis. SciScore does not guarantee that the rigor criteria that it detects are appropriate for the particular study. Instead it assists authors, editors, and reviewers by drawing attention to sections of the manuscript that contain or should contain various rigor criteria and key resources. For details on the results shown here, including references cited, please follow this link.

Read the original source

Version published to 10.1002/jmv.25700
Feb 19, 2020
Version published to 10.1101/2020.02.02.931162v2 on bioRxiv
Feb 5, 2020
Version published to 10.1101/2020.02.02.931162v1 on bioRxiv
Feb 4, 2020

First full-genome alignment representative for the genus Pestivirus

This article has 6 authors:
1. Sandra Triebel
2. Tom Eulenfeld
3. Nancy Ontiveros-Palacios
4. Blake Sweeney
5. Norbert Tautz
6. Manja Marz
This article has no evaluationsLatest version May 27, 2025
Rodents as Potential Reservoirs for Toroviruses

This article has 6 authors:
1. Jaime Buigues
2. Adrià Viñals
3. Raquel Martínez-Recio
4. Juan S. Monrós
5. Rafael Sanjuán
6. José M. Cuevas
This article has no evaluationsLatest version Jun 13, 2025
A conserved region T-cell vaccine for Sarbecoviruses

This article has 16 authors:
1. Marta Escarra-Senmarti
2. Jacqueline Brockhurst
3. Amanda R Maxwell
4. Taylor Godwin
5. Tianle Zhang
6. Yuzheng Chen
7. Peter K Um
8. Ludmila Danilova
9. Wen-Chi Hsu
10. Cheng Ting Lin
11. Tomáš Hanke
12. William R Bishai
13. Andrew Pekosz
14. Cory Brayton
15. Bette Korber
16. Maximillian Rosario
This article has no evaluationsLatest version May 22, 2025

Genomic variance of the 2019‐nCoV coronavirus

This article has been Reviewed by the following groups

Listed in

Abstract

Article activity feed

First full-genome alignment representative for the genus Pestivirus

Rodents as Potential Reservoirs for Toroviruses

A conserved region T-cell vaccine for Sarbecoviruses

This article has been Reviewed by the following groups

Listed in

Abstract

Article activity feed

Related articles

First full-genome alignment representative for the genus Pestivirus

Rodents as Potential Reservoirs for Toroviruses

A conserved region T-cell vaccine for Sarbecoviruses