Phylogenetic evidence for asparagine to aspartic acid protein editing of N-glycosylated SARS-CoV-2 viral proteins by NGLY1 deglycosylation/deamidation suggests an unusual vaccination strategy

This article has been Reviewed by the following groups

Read the full article See related articles

Abstract

Many viral proteins, including multiple SARS-CoV-2 proteins, are secreted via the endoplasmic reticulum, and viral particles are assembled and exported in ER-associated replication compartments. Viral coat proteins such as the SARS-CoV-2 Spike protein are N-glycosylated at NxS/T sites as they enter the ER. N-glycosylated sites in many eukaryotic proteins are deglycosylated by the NGLY1/PNG-1 deglycosylation enzyme which also deamidates the N-glycosylated asparagine to aspartic acid, thus editing the target protein sequence. Proteomic analysis of mammalian cell lines has revealed deamidation of many host N-glycosylated asparagines to aspartic acid by NGLY1/PNG-1 on peptides that are presented by mammalian HLA for immune surveillance. The key client protein for NGLY1/PNG-1 deglycosylation and N to D protein editing was revealed by genetic analysis of C. elegans proteasome regulation to be the intact endoplasmic reticulum-transiting SKN-1A transcription factor. Strikingly, an analysis of cancer cell genetic dependencies for growth revealed that the mammalian orthologue of SKN-1A, NRF1 (also called NFE2L1) is required by a highly correlated set of cell lines as NGLY1/PNG-1, supporting that NGLY1/PNG-1 and NRF1 act in the same pathway. NGLY1/PNG-1 edits N-glycosylated asparagines on the intact SKN-1 protein as it is retrieved by ERAD from the ER to in turn activate the transcription of target proteasomal genes. The normal requirement for NGLY1/PNG-1 editing of SKN-1A can be bypassed by a genomic substituion of N to D in four NxS/T N-glycosylation motifs of SKN-1A. Thus NGLY1/PNG-1-mediated N to D protein editing is more than a degradation step for the key client protein for proteasomal homeostasis in C. elegans or tumor growth in particular mammalian cell lines, SKN-1A/NRF1. In addition, such N to D substitutions in NxS/T N-glycosylation motifs occur in evolution: N to D substitutions are observed in phylogenetic comparisons of SKN-1A between nematode species that diverged hundreds of millions of years ago or of the vertebrate NRF1 between disparate vertebrates. Genomic N to D mutations bypass the many steps in N-glycosylation in the ER and deglycosylation-based editing of N to D, perhaps based on differences in the competency of divergent species for various N-glycosylation or deglycosylation steps.

We surveyed the N-glycosylation sites in coronavirus proteins for such phylogenetic evidence for N to D protein editing in viral life cycles, and found evidence for preferential N to D residue substitutions in NxS/T N-glycosylation sites in comparisons of the genome sequences of hundreds of coronaviruses. This suggests that viruses use NGLY1/PNG-1 in some hosts, for example humans, to edit particular N-glycosylated residues to aspartic acid, but that in other hosts, often in bats, an N to D substitution mutation in the virus genome is selected. Single nucleotide mutations in Asp or Asn codons can produce viruses with N to D or D to N substitutions that might be selected in different animal hosts from the population of viral variants produced in any previous host. NGLY1/PNG-1 has been implicated in viral immunity in mammalian cell culture, favoring this hypothesis.

Because of the phylogenetic evidence that the NGLY1/PNG-1 editing of protein sequences has functional importance for SKN-1A/NRF1 and viruses, and because most immunization protocols do not address the probable editing and functional importance of N-glycosylated aspargines to aspartic acid in normal viral infections, we suggest that immunization with viral proteins engineered to substitute D at genomically encoded NxS/T sites of N-glycosylated viral proteins that show a high frequency of N to D substitution in viral phylogeny may enhance immunological response to peptide antigens. Such genomically-edited peptides would not require ER-localization for N-glycosylation or other cell compartment localization for NGLY1/PNG-1 N to D protein editing. In addition, such N to D edited protein vaccines could be produced in bacteria since N-glycosylation and deglycosylation which do not occur in bacteria would no longer be required to immunize with a D-substituted peptide. Bacterially-expressed vaccines would be much lower cost and with fewer failure modes than attenuated viral vaccines or recombinant animal viruses produced in chicken eggs, mammalian tissue culture cells, or delivered by mRNA vectors to the patient directly. Because N to D edited peptides are clearly produced by NGLY1/PNG-1, and may be and presented by mammalian HLA, such peptides may more robustly activate T-cell killing or B-cell maturation to mediate more robust viral immunity.

Article activity feed

  1. SciScore for 10.1101/2021.09.11.459891: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    NIH rigor criteria are not applicable to paper type.

    Table 2: Resources

    Software and Algorithms
    SentencesResources
    At Spike protein position 657 in Table 1, there are 20 alignable viral proteins from coronaviruses most closely related to SARS-CoV-2, which are N at this position in 85% of the viruses, including SARS-CoV-2, and D at this position in 15% of the alignable betacoronaviruses.
    SARS-CoV-2
    suggested: (BioLegend Cat# 946101, RRID:AB_2892515)
    The resulting database was used in the BLASTP (PMID 2231712) search for the homologs of C. elegans ER proteins.
    BLASTP
    suggested: (BLASTP, RRID:SCR_001010)

    Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


    Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    Results from scite Reference Check: We found no unreliable references.


    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.