Teaching Diffusion Models Physics: Reinforcement Learning for Physically Valid Diffusion-Based Docking

J. Henry Broster
Bojana Popovic
Diana Kondinskaia
Charlotte M. Deane
Fergus Imrie

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Molecular docking aims to predict the binding conformation of a small molecule to its protein target. Recent work has proposed diffusion models for this task, from rigid-body docking that diffuses over ligand degrees of freedom to co-folding approaches that jointly generate protein structure and ligand pose. However, diffusion-based docking models have been shown to frequently produce physically implausible poses and fail to consistently recover key protein-ligand interactions. To address this, we introduce a reinforcement learning framework for training diffusion-based docking models directly on non-differentiable objectives. Fine-tuning DiffDock-Pocket for physical validity with our approach substantially increases the number of generated poses that are physically valid and interaction-preserving, with no increase in inference-time compute. Importantly, this comes without sacrificing structural accuracy; in fact, our approach increases the proportion of structures with near-native poses. These effects are most pronounced for protein targets that are dissimilar to the training data. Our fine-tuned DiffDock-Pocket model outperforms both classical docking algorithms and machine learning-based approaches on the PoseBusters set. Our results demonstrate that reinforcement learning can teach diffusion-based docking models to better respect physical constraints and recover key interactions, without the requirement to rely on inference-time corrections.

Version published to 10.64898/2026.03.25.714128 on bioRxiv
Mar 27, 2026

Bayesian-Steered Structure Prediction of Mechanical Biomolecules Using Twisted Diffusion

This article has 2 authors:
1. Colin Klaus
2. Marcos Sotomayor
This article has no evaluationsLatest version May 13, 2026
A Hybrid Physics-Deep Learning Framework for Combinatorial De Novo Design of Small-Molecule Binding Proteins

This article has 9 authors:
1. Connor V Galvin
2. Amy B Guo
3. Maple N Chen
4. Isabella L Alfonso
5. Dominic Grisingher
6. Simon Kretschmer
7. Divya Kranthi
8. Mark JS Kelly
9. Tanja Kortemme
This article has no evaluationsLatest version Apr 15, 2026
ConforFlux: Particle-Guided Trunk Repulsion for Diverse Protein Conformations

This article has 2 authors:
1. Shosuke Suzuki
2. Toshiyuki Amagasa
This article has no evaluationsLatest version May 17, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Bayesian-Steered Structure Prediction of Mechanical Biomolecules Using Twisted Diffusion

A Hybrid Physics-Deep Learning Framework for Combinatorial De Novo Design of Small-Molecule Binding Proteins

ConforFlux: Particle-Guided Trunk Repulsion for Diverse Protein Conformations