Predicting Experimental Success in De Novo Binder Design: A Meta-Analysis of 3,766 Experimentally Characterised Binders

Max D. Overath
Andreas S. H. Rygaard
Christian P. Jacobsen
Valentas Brasas
Oliver Morell
Pietro Sormanni
Timothy P. Jenkins

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Designing high-affinity de novo protein binders has become increasingly tractable, yet in vitro prioritisation continues to depend on heuristics in the absence of systematic analysis. Here, we present a large-scale meta-analysis of 3,766 experimentally tested binders across 15 structurally diverse targets. Using a unified, high-throughput pipeline that re-predicts each binder-target complex with AF2 (initial guess and ColabFold), AF3 and Boltz-1, we extract over 200 structural, energetic and confidence features per design. We show that interface-focused metrics, most notably the AF3-derived interaction prediction Score from Aligned Errors (ipSAE) outperform commonly used scores such as ipAE and ipTM, with a significant 1.4-fold increase in average precision compared to ipAE. We further show that combining these metrics with orthogonal physicochemical interface descriptors, including Rosetta ΔG/ΔSASA and interface shape complementarity, improves predictive performance. While overall per-formance varies by target, simple linear models trained on a small number of AF3-derived features generalize well across datasets. We propose interpretable, target-agnostic filtering strategies, such as combining AF3 ipSAE_min rankings with structural filters, to improve precision in selecting binders for testing. Finally, we release the complete dataset establishing a community resource to benchmark and accelerate de novo binder discovery.

Version published to 10.1101/2025.08.14.670059 on bioRxiv
Aug 14, 2025

Integrating Evolutionary and Compositional Features with ML and DL for Robust and Interpretable Druggable Protein Prediction

This article has 5 authors:
1. Mujeebu Rehman
2. Qinghua Liu
3. Muhammad Javed
4. Ali Ghulam
5. Teerath Kumar
This article has no evaluationsLatest version Dec 11, 2025
Predictive Bioactivity Modeling and Structural Binding Analysis for the Identification of Potential SMYD3 Modulators

This article has 4 authors:
1. Abdullah R. Alzahrani
2. Zia Ur Rehman
3. Talha Jawaid
4. Abida Khan
This article has no evaluationsLatest version Jan 28, 2026
Discovery of β-Sheet Peptide Assembly Codes via an Experimentally Validated Predictive Computational Platform

This article has 4 authors:
1. Wei Han
2. Hang Zheng
3. Ke Huang
4. Chi-Sing Lee
This article has no evaluationsLatest version Jan 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Integrating Evolutionary and Compositional Features with ML and DL for Robust and Interpretable Druggable Protein Prediction

Predictive Bioactivity Modeling and Structural Binding Analysis for the Identification of Potential SMYD3 Modulators

Discovery of β-Sheet Peptide Assembly Codes via an Experimentally Validated Predictive Computational Platform