A Globally Representative Immunopeptidomics Approach to Identify Population-Wide Vaccine Candidates
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Vaccine development has historically relied on preclinical testing using one or two cell lines, typically of Caucasian descent, contributing to vaccines that may fail to provide adequate protection across genetically diverse populations. A major factor in these failures is the lack of consideration for human leukocyte antigen (HLA) diversity, with over 30,000 HLA alleles worldwide exhibiting distinct frequencies across ethnic groups and geographic regions. Here, we present a globally representative immunopeptidomics approach that addresses HLA diversity at the earliest stages of vaccine antigen discovery. We established a panel of 30 cell lines from the 1000 Genomes Project that represent 75% of the world’s most common HLA alleles. Using a combined computational and experimental approach, we identified 18 common binding motifs (7 MHC class I and 11 MHC class II) shared across 24 cell lines from the panel. We developed an accessible bioinformatic tool that predicts globally presentable antigenic regions from any pathogen proteome by identifying peptides matching these common binding motifs. Validation with Salmonella enterica serovar Typhimurium and SARS-CoV-2 Spike protein demonstrated that our method successfully identifies known immunogenic regions, with experimentally detected epitopes showing substantial overlap with predicted regions of interest. Notably, only three cell lines were required to validate highly immunogenic S. enterica targets, including OmpC—a porin with known 100% protective efficacy—detected across all tested lines. Our approach enables researchers to rapidly screen pathogen proteomes using freely available bioinformatic tools, with optional experimental validation requiring minimal resources. By identifying antigens with broad population coverage before clinical trials begin, this method has the potential to increase vaccine success rates while ensuring equitable protection across diverse populations. The cell line panel, common binding motifs, and bioinformatic workflow are publicly available, providing an accessible pathway for developing vaccines that truly serve global populations.