DockFormer: Affinity Prediction and Flexible Docking with Pair Transformer
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Protein-small molecule interactions, or receptor-ligand interactions, are essential for understanding biological processes and advancing drug design. Despite advancements, existing prediction models of these interactions still lack the capabilities and accuracy needed to replace traditional screening. In this paper, we introduce DockFormer, a method that leverages multimodal learning to predict both the binding affinity and structure of these interactions. DockFormer employs fully flexible docking, where no part of the receptor remains rigid, by adapting the AlphaFold2 architecture. Instead of relying on protein sequences and multiple sequence alignments, DockFormer uses predicted receptor structures as input. This modification enables the model to concentrate on ligand docking prediction, rather than protein folding, while preserving full receptor flexibility. The streamlined design also reduces the model size to just eight layers, compared to AlphaFold2's 48 layers, greatly accelerating the inference process and making it more efficient for large-scale screening. When evaluated on affinity benchmarks such as CASF-2016, PLINDER, and the recently released CASP16 ligand screening benchmark, DockFormer performs comparably to or better than state-of-the-art methods, which typically rely on templates or bound structures as input. On structural benchmarks such as PoseBusters and PLINDER, DockFormer demonstrated success rates of 20% and 15%, respectively.