Prediction of the Receptorome for the Human-Infecting Virome
This article has been Reviewed by the following groups
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
- Evaluated articles (ScreenIT)
Abstract
No abstract available
Article activity feed
-
-
SciScore for 10.1101/2020.02.27.967885: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.Table 2: Resources
Software and Algorithms Sentences Resources Human cell membrane proteins and human membrane proteins were obtained from the UniProtKB/Swiss-Prot database on February 21, 2020. UniProtKB/Swiss-Protsuggested: NoneFor those proteins without the annotation of N-glycosylation sites in the UniprotKB/Swiss-Prot database, their N-glycosylation sites were predicted with NetNGlyc 1.0 (available at http://www.cbs.dtu.dk/services/NetNGlyc/) (Gupta et al., 2004). NetNGlycsuggested: (NetNGlyc, RRID:SCR_001570)To calculate the node degree of the human proteins in the human PPI network, firstly, the human PPIs with the combined scores greater than … SciScore for 10.1101/2020.02.27.967885: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.Table 2: Resources
Software and Algorithms Sentences Resources Human cell membrane proteins and human membrane proteins were obtained from the UniProtKB/Swiss-Prot database on February 21, 2020. UniProtKB/Swiss-Protsuggested: NoneFor those proteins without the annotation of N-glycosylation sites in the UniprotKB/Swiss-Prot database, their N-glycosylation sites were predicted with NetNGlyc 1.0 (available at http://www.cbs.dtu.dk/services/NetNGlyc/) (Gupta et al., 2004). NetNGlycsuggested: (NetNGlyc, RRID:SCR_001570)To calculate the node degree of the human proteins in the human PPI network, firstly, the human PPIs with the combined scores greater than 400 were extracted from the STRING database (version 10.5) (Szklarczyk et al., 2015) and were used to form the human PPI network. STRINGsuggested: (STRING, RRID:SCR_005223)Since there were strong correlations between the gene expression level in different tissues, the principal component analysis (PCA) method was used to reduce the correlations with the function of PCA in the package scikit-learn (version 0.21.3) (Pedregosa et al., 2011) in Python (version 3.6.7). scikit-learnsuggested: (scikit-learn, RRID:SCR_002577)Besides, the sequence redundancy in both human virus receptor proteins and human membrane proteins was removed using CD-HIT (version 4.8.1) (Fu et al., 2012) at 70% identity level. CD-HITsuggested: (CD-HIT, RRID:SCR_007105)Five times of five-fold cross-validations were conducted to evaluate the predictive performances of the RF model with the function of StratifiedKFold in the package scikit-learn in Python. Pythonsuggested: (IPython, RRID:SCR_001658)The RBPs of human-infecting viruses were compiled from three sources: the ViralZone database (Masson et al., 2012), the UniprotKB database in which viral proteins were annotated with GO terms “viral entry into host cell” or “virion attachment the host cell”, and the literatures related to viral RBPs. ViralZonesuggested: (ViralZone, RRID:SCR_006563)UniprotKBsuggested: (UniProtKB, RRID:SCR_004426)Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).
Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:There are some limitations to this study. Firstly, the number of human virus receptor proteins was much smaller than that of human membrane proteins in the modeling, which may hinder accurate modeling. Thus, the under-sampling method was used to deal with the imbalance problem. Secondly, the performance of the RF model was modest in discriminating human virus receptor proteins from human membrane proteins. More efforts are still needed to improve the model. Thirdly, although the RF model can be used to predict the receptorome of human-infecting virome, it is not feasible to use the model to identify the receptors for a specific human-infecting virus. The combination of the RF model with the model of PPI predictions such as Lasso’s work can help identify virus-receptor interactions. In conclusion, this study for the first time built a computational model for predicting the receptorome of the human-infecting virome. The results can facilitate the identification of human virus receptors in either computational or experimental studies.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We did not find any issues relating to the usage of bar graphs.
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:- Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
- Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
- No protocol registration statement was detected.
-
