Decoding protein glycosylation by an integrative mass spectrometry-based de novo sequencing strategy

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Glycoproteins, representing over 50% of human proteins and most biopharmaceuticals, are crucial for regulating various biological processes. The complexity of multiple glycosylation sites, often leading to incomplete sequence coverage and ambiguous glycan modification profiles. Here, we developed an integrative mass spectrometry-based approach for decoding unknown glycoproteins, which is featured with the combination of deglycosylation-mediated de novo sequencing with glycosylation site characterization. We utilized enzymatic deglycosylation of N -/ O -glycan to achieve comprehensive sequence coverage. Additionally, EThcD fragmentation enables the identification of high-quality long peptides, facilitating precise protein assembly. We subsequently applied this method to de novo sequencing of the highly glycosylated therapeutic fusion protein Etanercept (Enbrel ® ). We also sequenced three new tumor necrosis factor receptor (TNFR): Fc-fusion biologics with largely unknown sequences, unveiling subtle distinctions in the primary sequences. Furthermore, we characterized N - and O -glycosylation modifications of these proteins at subunit, glycopeptide, and glycan levels. This strategy bridges the gap between the de novo sequencing and glycosylation modification, providing complete information of the primary structure and glycosylation modifications for glycoproteins. Notably, our method could be a robust solution for accurate sequencing of the glycoproteins and has practical value not only in basic research but also in the biopharmaceutical industry.

Article activity feed