ProteinFlux: accurate, rapid and scalable generative prediction of protein dynamics driven by post-translational modifications

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

The function of proteins, the building blocks of life, in health and disease depends not only on their 3D-conformational states but most importantly on the dynamic transition between states controlled by a wide array of post-translational modifications (PTMs). Recent major advances have been made in our ability to predict static 3D structures; however, understanding and predicting the impact of PTMs on protein conformational dynamics remains a major question and challenge in the field. Molecular dynamics (MD) simulation remains the major computational approach for studying protein dynamics. However, the high computational cost, lack of integration of PTMs as conditioning inputs and inefficient generation of continuous protein dynamics largely precludes PTM-regulated conformational dynamics and the study of slow conformational processes. To address this critical bottleneck, we developed ProteinFlux, a flow-matching generative framework that links PTM-conditioned conformational dynamics to evolutionary constraints encoded by PTM sites. Evolutionary information plays a critical role in capturing conformational dynamics beyond sequence identity, and PTM sites inherently encode evolutionary constraints critical to protein functional regulation. We therefore built FluxSite, a dual-modal PTM site predictor that integrates sequence evolutionary information and 3D structural features to generate a continuous conditional signal encoding conservation and functional importance for each predicted site. FluxSite achieves robust generalization across 18 PTM types and 30 disease-associated proteomes. ProteinFlux generates phosphorylation-conditioned, all-atom conformational trajectories across diverse protein fold classes, faithfully reproducing both thermodynamic properties such as free energy landscapes and kinetic features such as conformational transition pathways. It outperforms state-of-the-art predictors while achieving inference speeds several orders of magnitude faster than traditional MD. In addition, we introduce DynaMo-phos, a benchmark dataset of phosphorylated protein MD simulations. Together, ProteinFlux, FluxSite and DynaMo-phos provide a scalable, high-throughput platform for elucidating PTM-driven conformational mechanisms, with potential applications across allosteric drug design, functional annotation of disease-associated modifications and mechanism-guided therapeutic development.

Article activity feed