A universal pipeline MosaicProt enables large-scale modeling and detection of chimeric protein sequences for studies on programmed ribosomal frameshifting
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Peptides and proteins produced by programmed ribosomal frameshifting (PRF) are well-known in viruses. In non-viral systems, only a few examples of such chimeric sequences have been documented until recently. Two new studies, one in humans and one in plants, showed that chimeric peptides are numerous and diverse. In humans, their discovery was possible due to focusing on sequences with naturally repeated codons. This way, many candidate sequences with mass spectrometry (MS) proteomics-based support for translation have been identified. In the plant study, our group discovered MS-validated chimeric peptides using a unique modeling algorithm, which is described and made available here. Our pipeline enables the identification of chimeric peptides in any organism for which transcript sequences and MS proteomic data are available. By design, our approach does not require prior knowledge about sequence similarity to already characterized PRF sites and can detect forward and backward frameshifts by 1 and 2 nucleotides. Thus, our pipeline opens a path for uncovering previously unknown PRF events across various transcript types, potentially broadening our understanding of proteome diversity.