Language models learn to represent antigenic properties of human influenza A(H3) virus

Francesco Durazzi
Marion P. G. Koopmans
Ron A. M. Fouchier
Daniel Remondini

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Given that influenza vaccine effectiveness depends on a good antigenic match between the vaccine and circulating viruses, it is important to assess the antigenic properties of newly emerging variants continuously. With the increasing application of real-time pathogen genomic surveillance, a key question is if antigenic properties can reliably be predicted from influenza virus genomic information. Based on validated linked datasets of influenza virus genomic and wet lab experimental results, in silico models may be of use to learn to predict immune escape of variants of interest starting from the protein sequence only. In this study, we compared several machine-learning methods to reconstruct antigenic map coordinates for HA1 protein sequences of influenza A(H3N2) virus, to rank substitutions responsible for major antigenic changes, and to recognize variants with novel antigenic properties that may warrant future vaccine updates. Methods based on deep learning language models (BiLSTM and ProtBERT) were shown to outperform more classical approaches that involved predictions based solely on genetic distances and physicochemical properties of amino acid sequences, particularly for fine-grained features like single amino acid-driven antigenic change and in silico deep mutational scanning experiments to rank the substitutions with the largest impact on antigenic properties. Given that the best performing model that produces protein embeddings is agnostic to the specific pathogen, the presented approach may be applicable to other pathogens.

Version published to 10.1101/2025.01.17.633534v1 on bioRxiv
Jan 18, 2025

A machine learning framework to identify complex physicochemical features of B cell epitopes

This article has 6 authors:
1. Simranjit Grewal
2. Uwa Iyamu
3. Daniel Vinals
4. Catherine Mitran
5. Nidhi Hegde
6. Stephanie Yanow
This article has no evaluationsLatest version Apr 18, 2025
Dominant substitutions underlying the antigenic evolution of H5 influenza virus

This article has 16 authors:
1. Qianqian Li
2. Aiping Wu
3. Youchun Wang
4. Mengyi Zhang
5. Luyao Qin
6. Zichen Li
7. Jiamin Chen
8. Jincheng Tong
9. Xiao Ding
10. Huan Li
11. Yun Ma
12. Jingze Liu
13. Qiuju He
14. Rong Bi
15. Weijin Huang
16. Cheng Bian
This article has no evaluationsLatest version Feb 26, 2025
Machine learning approach to predict protein-protein interactions between human and hepatitis E virus: revealing links to hepatocellular carcinoma

This article has 5 authors:
1. Anahid Hematpour
2. Parnian Habibi
3. Sajad Alavimanesh
4. Katayoon Dadkhah
5. Kiarash Babaie
This article has no evaluationsLatest version Feb 28, 2025

Listed in

Abstract

Article activity feed

Related articles

A machine learning framework to identify complex physicochemical features of B cell epitopes

Dominant substitutions underlying the antigenic evolution of H5 influenza virus

Machine learning approach to predict protein-protein interactions between human and hepatitis E virus: revealing links to hepatocellular carcinoma