Codon arrangement modulates MHC-I peptides presentation: implications for a SARS-CoV-2 peptide-based vaccine

This article has been Reviewed by the following groups

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Abstract

Among various vaccination strategies, peptide-based vaccines appear as excellent candidates because they are cheap to produce, are highly stable and harbor low toxicity. However, predicting which MHC-I Associated Peptide (MAP) will ultimately reach cell surface remains challenging, due to high false discovery rates. Previously, we demonstrated that synonymous codon arrangement (usage and placement) is predictive of, and modulates MAP presentation. Here, we apply CAMAP (Codon Arrangement MAP Predictor), the artificial neural network we used to unveil the role of codon arrangement in MAP presentation, to predict SARS-CoV MAPs. We report that experimentally identified SARS-CoV-1 and SARS-CoV-2 MAPs are associated with significantly higher CAMAP scores. Based on CAMAP scores and binding affinity, we identified 48 non-overlapping MAP candidates for a peptide-based vaccine, ensuring coverage for a high proportion of HLA haplotypes in the US population (>78%) and SARS-CoV-2 strains (detected in >98% of SARS-CoV-2 strains present in the GISAID database). Finally, we built an interactive web portal ( https://www.epitopes.world ) where researchers can freely explore CAMAP predictions for SARS-CoV-1/2 viruses. Collectively, we present an analysis framework that can be generalizable to empower the rapid identification of virus-specific MAPs, including in the context of an emergent virus, to help accelerate target identification for peptide-based vaccine designs that could be critical in safely attaining group immunity in the context of a global pandemic.

Article activity feed

  1. SciScore for 10.1101/2021.02.04.429819: (What is this?)

    Please note, not all rigor criteria are appropriate for all manuscripts.

    Table 1: Rigor

    NIH rigor criteria are not applicable to paper type.

    Table 2: Resources

    Software and Algorithms
    SentencesResources
    SARS-CoV-1/2 sequences: Genetic coding sequences of SARS-CoV-1/2 viruses were extracted from NCBI (NC_004718.3 and NC_045512.2), and parsed using the BioPython library31.
    BioPython
    suggested: (Biopython, RRID:SCR_007173)
    These consisted of 42,109 strains that were aligned to the reference SARS-CoV-2 genome from NCBI (NC_045512.2) using MUSCLE v3.8.31 with default parameters and the BioPython library.
    MUSCLE
    suggested: (MUSCLE, RRID:SCR_011812)

    Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).


    Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

    Results from TrialIdentifier: No clinical trial numbers were referenced.


    Results from Barzooka: We did not find any issues relating to the usage of bar graphs.


    Results from JetFighter: We did not find any issues relating to colormaps.


    Results from rtransparent:
    • Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
    • Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
    • No protocol registration statement was detected.

    About SciScore

    SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.