Hybrid Gated Fusion: A Multimodal Deep Learning Framework for Protein Function Annotation

Zijian Zhou
Daniel WA Buchan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Protein function annotation requires integrating diverse biological signals, yet existing multimodal methods often struggle with missing inputs and redundant information. We present Hybrid Gated Fusion, a multimodal architecture that combines intrinsic protein features, including sequence and structure, with extrinsic functional context from text and interaction networks. Rather than weighting all modalities equally, the model uses bilinear gating to assess both the informativeness of each modality and its agreement with the others, while auxiliary supervision reduces modality dominance and preserves useful signal in weaker modalities. On the CAFA3 benchmark, a single Hybrid Gated Fusion model achieves state-of-the-art performance in Biological Process ( F _max = 0.601) and Cellular Component ( F _max = 0.706), while remaining competitive in Molecular Function ( F _max = 0.702). Analysis of the learned gates shows that interaction networks and text often provide complementary functional signals, whereas structural features are down-weighted when redundant but remain valuable under sparse-input settings. These results establish Hybrid Gated Fusion as a robust and scalable framework for genome-scale protein function annotation.

Availability and implementation

Source code and reproduction scripts are freely available at https://github.com/psipred/PFP . Pre-computed embeddings, data splits, and model checkpoints are deposited at https://doi.org/10.5281/zenodo.19498341 .

Version published to 10.64898/2026.04.14.718564 on bioRxiv
Apr 17, 2026

BiLSTM-Powered Bilinear Attention for Protein–Ligand Prediction

This article has 4 authors:
1. Chih-Yang Cheng
2. Yi-An Chen
3. Feng-Yin Li
4. Suyong Re
This article has no evaluationsLatest version May 13, 2026
ModCRE-NN: Interpretable Deep Learning Harnesses Structural and Evolutionary Synergy to Predict Transcription Factor Binding Specificity

This article has 8 authors:
1. Victor Méndez-Riosalido
2. Patrick Gohl
3. Patricia M. Bota
4. Eric Kramer
5. Alberto Meseguer
6. Oriol Gallego
7. Narcis Fernandez-Fuentes
8. Baldo Oliva
This article has no evaluationsLatest version May 29, 2026
NeuroCDS: Integrating Local and Global Neural Network Representations via Structural Constrained Viterbi Decoding for Robust CDS Annotation

This article has 4 authors:
1. Ziheng Mei
2. Zhexi Xie
3. Lingrui Wu
4. Chao Wei
This article has no evaluationsLatest version May 2, 2026

Discuss this preprint

Listed in

Abstract

Availability and implementation

Article activity feed

Related articles

BiLSTM-Powered Bilinear Attention for Protein–Ligand Prediction

ModCRE-NN: Interpretable Deep Learning Harnesses Structural and Evolutionary Synergy to Predict Transcription Factor Binding Specificity

NeuroCDS: Integrating Local and Global Neural Network Representations via Structural Constrained Viterbi Decoding for Robust CDS Annotation