Accurate Predictions of Molecular Properties of Proteins via Graph Neural Networks and Transfer Learning

Spencer Wozniak
Giacomo Janson
Michael Feig

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Machine learning has emerged as a promising approach for predicting molecular properties of proteins, as it addresses limitations of experimental and traditional computational methods. Here, we introduce GSnet, a graph neural network (GNN) trained to predict physicochemical and geometric properties including solvation free energies, diffusion constants, and hydrodynamic radii, based on three-dimensional protein structures. By leveraging transfer learning, pre-trained GSnet embeddings were adapted to predict solvent-accessible surface area (SASA) and residue-specific p K _a values, achieving high accuracy and generalizability. Notably, GSnet outperformed existing protein embeddings for SASA prediction, and a locally charge-aware variant, aLCnet, approached the accuracy of simulation-based and empirical methods for p K _a prediction. Our GNN framework demonstrated robustness across diverse datasets, including intrinsically disordered peptides, and scalability for high-throughput applications. These results highlight the potential of GNN-based embeddings and transfer learning to advance protein structure analysis, providing a foundation for integrating predictive models into proteome-wide studies and structural biology pipelines.

Version published to 10.1101/2024.12.10.627714v1 on bioRxiv
Dec 12, 2024

SPACE: STRING proteins as complementary embeddings

This article has 4 authors:
1. Dewei Hu
2. Damian Szklarczyk
3. Christian von Mering
4. Lars Juhl Jensen
This article has no evaluationsLatest version Nov 26, 2024
Gene Regulatory Network Inference with Joint Representation from Graph Neural Network and Single-Cell Foundation Model

This article has 4 authors:
1. Sindhura Kommu
2. Yizhi Wang
3. Yue Wang
4. Xuan Wang
This article has no evaluationsLatest version Dec 20, 2024
A deep learning method for predicting interactions for intrinsically disordered regions of proteins

This article has 3 authors:
1. Kartik Majila
2. Varun Ullanat
3. Shruthi Viswanath
This article has no evaluationsLatest version Jan 12, 2025

Listed in

Abstract

Article activity feed

Related articles

SPACE: STRING proteins as complementary embeddings

Gene Regulatory Network Inference with Joint Representation from Graph Neural Network and Single-Cell Foundation Model

A deep learning method for predicting interactions for intrinsically disordered regions of proteins