BBATProt : A Framework Predicting Biological Function with Enhanced Feature Extraction via Explainable Deep Learning

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Accurately predicting the functions of peptides and proteins from their amino acid sequences is essential for understanding life processes and advancing biomolecule engineering. Due to the time-consuming and resource-intensive nature of experimental procedures, computational approaches, especially those based on machine learning frameworks, have garnered significant interest. However, many existing machine learning tools are limited to specific tasks and lack adaptability across different predictions. Here we propose a versatile framework BBATProt for the prediction of various protein and peptide functions. BBATProt employs transfer learning with a pre-trained Bidirectional Encoder Representations from Transformers (BERT) model, to effectively capture high-dimensional features from amino acid sequences. The whole custom-designed network, integrating Bidirectional Long Short-Term Memory (Bi-LSTM) and Temporal Convolutional Networks (TCN), can align with the spatial characteristics of proteins. It combines local and global feature extraction through attention mechanisms for precise functional prediction. This approach ensures that key features are adaptively extracted and balanced across diverse tasks. Comprehensive evaluations show BBATProt outperforms state-of-the-art models in predicting functions like hydrolytic catalysis, activity of peptides, and post-translational modification sites. Visualizations of feature evolution and refinement via attention mechanisms validate the framework’s interpretability, providing transparency into the evolutional process and offering deeper insights into function prediction.

Article activity feed