Integrated network-based multiple computational analyses for identification of co-expressed candidate genes associated with neurological manifestations of COVID-19

This article has been Reviewed by the following groups

Read the full article

Abstract

‘Tripartite network’ (TN) and ‘combined gene network’ (CGN) were constructed and their hub-bottleneck and driver nodes (44 genes) were evaluated as ‘ target genes ’ (TG) to identify 21 ‘ candidate genes ’ (CG) and their relationship with neurological manifestations of COVID-19. TN was developed using neurological symptoms of COVID-19 found in literature. Under query genes (TG of TN), co-expressed genes were identified using pair-wise mutual information to genes available in RNA-Seq autopsy data of frontal cortex of COVID-19 victims. CGN was constructed with genes selected from TN and co-expressed in COVID-19. TG and their connecting genes of respective networks underwent functional analyses through findings of their enrichment terms and pair-wise ‘semantic similarity scores’ (SSS). A new integrated ‘weighted harmonic mean score’ was formulated assimilating values of SSS and STRING-based ‘combined score’ of the selected TG-pairs, which provided CG-pairs with properties of CGs as co-expressed and ‘indispensable nodes’ in CGN. Finally, six pairs sharing seven ‘ prevalent CGs’ ( ADAM10 , ADAM17 , AKT1 , CTNNB1 , ESR1 , PIK3CA , FGFR1 ) showed linkages with the phenotypes (a) directly under neurodegeneration, neurodevelopmental diseases, tumour/cancer and cellular signalling, and (b) indirectly through other CGs under behavioural/cognitive and motor dysfunctions. The pathophysiology of ‘ prevalent CGs’ has been discussed to interpret neurological phenotypes of COVID-19.

Article activity feed

  1. SciScore for 10.1101/2022.04.10.487761: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    NIH rigor criteria are not applicable to paper type.

    Table 2: Resources

    Software and Algorithms
    SentencesResources
    The terms or keywords of neurological symptoms (represented by nodes) related to COVID-19 were found by systematic PubMed bibliographic literature search.
    PubMed
    suggested: (PubMed, RRID:SCR_004846)
    Finding of the co-expressed genes of COVID-19: The important nodes (Hub-bottlenecks and driver nodes) evaluated from TN were considered as co-regulated in tissue and used as query genes to identify a set of co-expressed genes from RNA-Seq data59 (NCBI-GEO accession ID: GSE164332) of brain frontal cortex of COVID-19 patients (n=9) with healthy matched controls (n=7), using geneRecommender algorithm60,61 in R software and environment.
    geneRecommender
    suggested: None
    In silico modeling of STRING-PPI network using selected co-expressed genes and genes of TN: A combined set of 225 genes collected from minet output (co-expressed genes) and all 189 genes of TN, were incorporated as a query in STRING database56 using ‘STRING-PPI combined score’ (SPPICS) 0.60 as threshold, to construct a PPI-based ‘combined gene network’ (CGN).
    STRING
    suggested: (STRING, RRID:SCR_005223)
    Topology analysis of networks to find ‘hub’, ‘bottleneck’ and ‘driver’ nodes: The centrality measurements of networks were analyzed using the CentiScaPe module58 in Cytoscape software57 to find ‘hub’ (high degree) and ‘bottleneck’ (high-betweenness/shortest-path) nodes that having higher scores than cut-off values i.e.,
    Cytoscape
    suggested: (Cytoscape, RRID:SCR_003032)
    The GO terms (BP, CC, MF), KEGG pathway, disease modules (DisGeNET, Jensen Disease) analyses were performed using the Enrichr platform79, 80, an intuitive web-tool for gene over-representation study with an inclusive functional annotation set, by considering each gene-set as query.
    KEGG
    suggested: (KEGG, RRID:SCR_012773)
    Enrichr
    suggested: (Enrichr, RRID:SCR_001575)
    The pairwise SSS-II scores for statistically confident enriched terms were calculated using mclusterSim function in GOSemSim package75 as previously described methods.
    GOSemSim
    suggested: None

    Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


    Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:
    The PPI model of STRING-database includes genes from prior knowledge and therefore has its own limitation56. The PPI model developed by semantic similarity scores has GO annotation biasness and unable to include genes not having sufficient annotation information. The integration of networks is supposed to perform better by filtering out the false positive interactions81,90. An approach has been reported to evaluate integrated score by combining semantic similarity scores of anatomy-based gene network and STRING-based PPI network by introducing ‘accuracy values of ROC’ as weightages to the respective scores followed by summation of weighted scores81. The same principal of weightage (‘accuracy values of ROC) was applied in the present study and the integrated scores (WHMS) were evaluated using harmonic mean of weightage scores for those gene-pairs which appeared to fulfil the criteria of having (a) three individual scores (SPPICS, SSS-I, SSS-II) and (b) at least one score with value above respective threshold level (Fig. 5b). Total 21 gene-pairs (solid edges in Fig. 5f) having integrated scores provided excellently strong fitted (AUC>0.9) and most accurate (95%) interactions (Fig. 5b) for prediction of 21 ‘candidate genes’ (Fig. 5c and Fig. 5f) involved in neurological insults (Fig. 5f) in COVID-19. All 21 gene-pairs/PPIs of ‘candidate genes’ showed SSS-I values (Fig. 3d, Fig. 4d, vide Point 4.2 in results section) above the respective threshold value and therefore represented ...

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    Results from scite Reference Check: We found no unreliable references.


    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.