Geometric deep learning improves generalizability of MHC-bound peptide predictions

Dario F. Marzella
Giulia Crocioni
Tadija Radusinović
Daniil Lepikhov
Heleen Severin
Dani L. Bodor
Daniel T. Rademaker
ChiaYu Lin
Sonja Georgievska
Nicolas Renaud
Amy L. Kessler
Pablo Lopez-Tarifa
Sonja I. Buschow
Erik Bekkers
Li C. Xue

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The interaction between peptides and major histocompatibility complex (MHC) molecules is pivotal in autoimmunity, pathogen recognition and tumor immunity. Recent advances in cancer immunotherapies demand for more accurate computational prediction of MHC-bound peptides. We address the generalizability challenge of MHC-bound peptide predictions, revealing limitations in current sequence-based approaches. Our structure-based methods leveraging geometric deep learning (GDL) demonstrate promising improvement in generalizability across unseen MHC alleles. Further, we tackle data efficiency by introducing a self-supervised learning approach on structures (3D-SSL). Without being exposed to any binding affinity data, our 3D-SSL outperforms sequence-based methods trained on ~90 times more data points. Finally, we demonstrate the resilience of structure-based GDL methods to biases in binding data on an Hepatitis B virus vaccine immunopeptidomics case study. This proof-of-concept study highlights structure-based methods’ potential to enhance generalizability and data efficiency, with possible implications for data-intensive fields like T-cell receptor specificity predictions.

Version published to 10.1038/s42003-024-07292-1
Dec 19, 2024
Version published to 10.21203/rs.3.rs-3924124/v1 on Research Square
Mar 26, 2024

Multi-Modal Ensemble Learning for TLR4 Binding Prediction: Addressing Data Scarcity and Leakage in Small Molecule Drug Discovery

This article has 3 authors:
1. Brandon Yee
2. Maximilian Rutkowski
3. Wilson Collins
This article has no evaluationsLatest version Jan 28, 2026
LinkerMind: An Interpretable, Mechanism-Informed Deep Learning Framework for the De Novo Design of Antibody Drug Conjugate Linkers

This article has 1 author:
1. Martins Otun
This article has no evaluationsLatest version Dec 19, 2025
Reinforcement Learning-Augmented ProteinMPNN Improve the Binding Affinity of TNFR1-Targeting Minibinders

This article has 10 authors:
1. Zigong Wei
2. Lin Wei
3. Zhiyong Wu
4. Yang Hu
5. Yihe Fang
6. Miaomiao Geng
7. Banbin Xing
8. Jun Weng
9. Song Liu
10. Ke Ming
This article has no evaluationsLatest version Jan 21, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Multi-Modal Ensemble Learning for TLR4 Binding Prediction: Addressing Data Scarcity and Leakage in Small Molecule Drug Discovery

LinkerMind: An Interpretable, Mechanism-Informed Deep Learning Framework for the De Novo Design of Antibody Drug Conjugate Linkers

Reinforcement Learning-Augmented ProteinMPNN Improve the Binding Affinity of TNFR1-Targeting Minibinders