Can We Extract Physics-like Energies from Generative Protein Diffusion Models?

Sudeep Sarma
Harrison Truscott
Da Xu
Kendall Reid
Lee-Shin Chu
Jacky Chen
Jeffrey J. Gray

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Diffusion models have emerged as the state-of-the-art method in generative artificial intelligence (AI) and have shown great success in image synthesis, video generation, molecular design, and protein structure prediction. For biophysical problems, such as protein folding and association, a fundamental question in diffusion-based methods is how their learned functions correspond to thermodynamics. In this paper, we study diffusion models through the lens of theoretical biophysics, analyzing their underlying formulation of potentials and exploring their applications in scoring protein interactions. We develop simple theories rooted in statistical physics that relate thermodynamic potentials to the negative log of the probability of observing a system in a particular state. We include dimensional analysis of diffusion model equations and a table mapping AI and physics jargon. We then test a diffusion model’s ability to capture learned energies as negative log-likelihood values, − log p ₀ ( x ₀ ), by integrating over the diffusion-generated path or a probability flow path. We test these integrals on a simple 1D Gaussian mixture diffusion model and a protein-docking diffusion model, DFMDock. In the 1D case, we find that integration over both diffusion and flow paths can accurately recover ground truth probabilities. When we extract the learned docking energies for cases where DFMDock succeeds, we observe energy funnels with the minimum energy near the experimental docked structure, like those we observe with Rosetta, an empirically tuned physics-based biomolecular modeling suite. The learned energy performs comparably or outperforms Rosetta interface energy in 6 out of 25 cases at ranking the correctness of docked poses. These data show that we can extract a relevant learned energy function from a diffusion model and compare it to physical energy functions.

Version published to 10.1101/2025.11.28.690021 on bioRxiv
Nov 29, 2025

The Information Dynamics of Generative Diffusion

This article has 2 authors:
1. Dejan Stančević
2. Luca Ambrogioni
This article has no evaluationsLatest version Jan 5, 2026
The Physics of Thought: Reasoning as Thermodynamic Relaxation in Generative Models

This article has 3 authors:
1. Mahule Roy
2. Omar Abudayyeh
3. Subhas Roy
This article has no evaluationsLatest version Dec 24, 2025
Are Energy and Forces Really Enough? Using Structure to Evaluate the Accuracy and Transferability of Machine Learning Potentials of Biomolecules

This article has 3 authors:
1. Lejla S. Biberić
2. Nisarg Joshi
3. Jim Pfaendtner
This article has no evaluationsLatest version Jan 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

The Information Dynamics of Generative Diffusion

The Physics of Thought: Reasoning as Thermodynamic Relaxation in Generative Models

Are Energy and Forces Really Enough? Using Structure to Evaluate the Accuracy and Transferability of Machine Learning Potentials of Biomolecules