N-glycosylation network construction and analysis to modify glycans on the spike S glycoprotein of SARS-CoV-2

This article has been Reviewed by the following groups

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Abstract

Background

The spike S-protein of SARS-CoV-2 is N-glycosylated. The N-glycan structure and composition of this glycoprotein influence how the virus interacts with host cells.

Objective

To identify a putative N-glycan biosynthesis pathway of SARS-CoV-2 (HEK293 cell recombinant) from previously published mass spectrometric studies, and to identify what effect blocking some enzymes has on the overall glycoprotein profile. Finally, our goal was to provide the biosynthesis network, and glycans in easy-to-use format for further glycoinformatics work.

Methods

We reconstructed the glycosylation network based on previously published empirical data using GNAT, a glycosylation network analysis tool. Our compilation of the network tool had 23 glycosyltransferase and glucosidase enzymes, and could infer the pathway of glycosylation machinery based on glycans identified in the virus spike protein. Once the glycan biosynthesis pathway was generated, we simulated the effect of blocking specific enzymes - Mannosidase-II and alpha-1,6-fucosyltransferase to see how they would affect the biosynthesis network.

Results

Of the 23 enzymes, a total of 12 were involved in glycosylation of SARS-CoV-2 - Man-Ia, MGAT1, MGAT2, MGAT4, MGAT5, B4GalT, B4GalT, Man II, SiaT, ST3GalI, ST3GalVI and FucT8. Blocking enzymes resulted in a substantially modified glycan profile of the protein.

Conclusions

A network analysis of N-glycan biosynthesis of SARS-CoV-2 spike protein shows an elaborate enzymatic pathway with several intermediate glycans, along with the ones identified by mass spectrometric studies. Variations in the final N-glycan profile of the virus, given its site-specific microheterogeneity, could be a factor in the host response to the infection and response to antibodies. Here we provide all the resources generated - the glycans derived from mass spectrometry and intermediate glycans in glycoCT xml format, and the biosynthesis network for future drug and vaccine development work.

Article activity feed

  1. SciScore for 10.1101/2020.06.23.167791: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    NIH rigor criteria are not applicable to paper type.

    Table 2: Resources

    Experimental Models: Cell Lines
    SentencesResources
    Data from Zhang et al11 and Shajahan et al13 were both obtained from recombinant viral proteins expressed in HEK293 cells.
    HEK293
    suggested: CLS Cat# 300192/p777_HEK293, RRID:CVCL_0045)

    Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


    Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.

  2. SciScore for 10.1101/2020.06.23.167791: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    NIH rigor criteria are not applicable to paper type.

    Table 2: Resources

    Experimental Models: Cell Lines
    SentencesResources
    Objective: To identify a putative N-glycan biosynthesis pathway of SARS-CoV-2 (HEK293 cell recombinant) from previously published mass spectrometric studies, and to identify what effect blocking some enzymes has on the overall glycoprotein profile.
    HEK293
    suggested: CLS Cat# 300192/p777_HEK293, CVCL_0045

    Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:

    • This heterogeneity can also affect the pathway that was chosen or the enzyme that was involved in the biosynthesis, which is a limitation in our current approach.
    • Limitations: As mentioned, there are several more approaches to construct N-glycosylation pathways.
    • In addition to that, this work being computational, is preliminary, and requires further computational and basic/pre-clinical/clinical work to identify the effect of simulated outcomes.
    • We did not conduct protein dynamic modeling studies, to determine if the altered glycan affects the protein, and its downstream binding with mammalian receptors.


    Results from OddPub: We did not find a statement about open data. We also did not find a statement about open code. Researchers are encouraged to share open data when possible (see Nature blog).


    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore is not a substitute for expert review. SciScore checks for the presence and correctness of RRIDs (research resource identifiers) in the manuscript, and detects sentences that appear to be missing RRIDs. SciScore also checks to make sure that rigor criteria are addressed by authors. It does this by detecting sentences that discuss criteria such as blinding or power analysis. SciScore does not guarantee that the rigor criteria that it detects are appropriate for the particular study. Instead it assists authors, editors, and reviewers by drawing attention to sections of the manuscript that contain or should contain various rigor criteria and key resources. For details on the results shown here, including references cited, please follow this link.