Learning Protein Representations with Conformational Dynamics

Dan Kalifa
Eric Horvitz
Kira Radinsky

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Proteins change shape as they work, and these changing states control whether binding sites are exposed, signals are relayed, and catalysis proceeds. Most protein language models pair a sequence with a single structural snapshot, which can miss state-dependent features central to interaction, localization, and enzyme activity. Studies also indicate that many proteins assume multiple, functionally relevant shapes, motivating approaches that learn from this variability. Here we present DynamicsPLM, a protein language model conditioned on ensembles of computationally generated conformations to derive state-aware representations. DynamicsPLM improves predictive performance across protein–protein interaction, subcellular localization, enzyme classification, and metal-ion binding. On a widely used protein–protein interaction benchmark, it achieves a four-point accuracy gain over the strongest baseline. On a curated test set enriched for proteins with multiple conformational states, the margin increases to eleven points. These findings argue for a shift from static to dynamics-aware modeling, in which conformational variability is treated as informative. By elevating conformational state to a central element of machine learning in protein biology, this work advances modeling toward mechanisms that better reflect how proteins operate in cells and provides a route to actionable hypotheses about when and how binding, signaling, and catalysis occur. ^*

Version published to 10.1101/2025.10.06.680789 on bioRxiv
Oct 6, 2025

BindPred: A Framework for Predicting Protein-Protein Binding Affinity from Language Model Embeddings

This article has 4 authors:
1. Haixing Piao
2. Veda Sheersh Boorla
3. Somtirtha Santra
4. Costas D. Maranas
This article has no evaluationsLatest version Sep 29, 2025
FlowDyn: Diffusion-Based Generative Modeling of Protein Conformational Ensembles for Structural Biology

This article has 1 author:
1. PRATIK RAKESHBHAI HIRAGAR
This article has no evaluationsLatest version Oct 6, 2025
Inferring Dynamic Information from Protein Structures by Gaussian Integrals and Deep Learning

This article has 4 authors:
1. Felipe Vilicich
2. Zhaoqian Su
3. Shanye Yin
4. Yinghao Wu
This article has no evaluationsLatest version Sep 24, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

BindPred: A Framework for Predicting Protein-Protein Binding Affinity from Language Model Embeddings

FlowDyn: Diffusion-Based Generative Modeling of Protein Conformational Ensembles for Structural Biology

Inferring Dynamic Information from Protein Structures by Gaussian Integrals and Deep Learning