Sampling and ranking of protein conformations using machine learning techniques do not improve the quality of rigid protein-protein docking

Roman Kyrylenko
Ihor Koleiev
Illia Savchenko
Taras Voitsitskyi
Roman Stratiichuk
Vladyslav Husak
Semen Yesylevskyy
Serhii Starosyla
Alan Nafiiev

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Rigid docking remains the most popular method of predicting protein-protein interactions in cases when experimental 3D structures of the complexes are not available. The docking often relies on known unbound (Apo) protein structures, which may differ significantly from their bound (Holo) forms. Modern machine learning (ML) based conformational sampling techniques allow generating ensembles of functionally relevant protein structures, which may be closer to their Holo forms and thus could improve the outcomes of the classical rigid protein-protein docking. Here, we sampled conformations of the protein subunits in 30 complexes from the novel PINDER dataset with two state-of-the-art ML-based techniques and evaluated their docking performance using several physics-based, data-based, and ML-based scoring functions. We showed that such conformational sampling rarely produces structures that are closer to the Holo conformations than the corresponding Apo ones. Moreover, even when such conformations are generated, none of the tested scoring functions were able to prioritize and rank them correctly. Our work highlights critical limitations in the current ML-enhanced rigid protein-protein docking workflows and emphasizes the need for new approaches that can better utilize the potential of modern techniques for conformational generation and scoring.

Version published to 10.1101/2025.05.13.652389 on bioRxiv
May 16, 2025

The Evolution of the AlphaFold Architecture

This article has 1 author:
1. Y.C.B.J. Dissanayaka
This article has no evaluationsLatest version Jan 9, 2026
Feature-Optimized Machine Learning Benchmarking for Protein Interface Prediction in Permanent Homodimer Complexes with Distinct Structural Features

This article has 4 authors:
1. Tayyip Topuz
2. Zeki Erdem
3. Halil Bisgin
4. E. Demet Akten
This article has no evaluationsLatest version Feb 2, 2026
Are Energy and Forces Really Enough? Using Structure to Evaluate the Accuracy and Transferability of Machine Learning Potentials of Biomolecules

This article has 3 authors:
1. Lejla S. Biberić
2. Nisarg Joshi
3. Jim Pfaendtner
This article has no evaluationsLatest version Jan 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

The Evolution of the AlphaFold Architecture

Feature-Optimized Machine Learning Benchmarking for Protein Interface Prediction in Permanent Homodimer Complexes with Distinct Structural Features

Are Energy and Forces Really Enough? Using Structure to Evaluate the Accuracy and Transferability of Machine Learning Potentials of Biomolecules