Exploring Log-Likelihood Scores for Ranking Antibody Sequence Designs

Talip Uçar
Cedric Malherbe
Ferran Gonzalez

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Generative models trained on antibody sequences and structures have shown great potential in advancing machine learning-assisted antibody engineering and drug discovery. Current state-of-the-art models are primarily evaluated using two categories of in silico metrics: sequence-based metrics, such as amino acid recovery (AAR), and structure-based metrics, including root-mean-square deviation (RMSD), predicted alignment error (pAE), and interface predicted template modeling (ipTM). While metrics such as pAE and ipTM have been shown to be useful filters for experimental success, there is no evidence that they are suitable for ranking, particularly for antibody sequence designs. Furthermore, no reliable sequence-based metric for ranking has been established. In this work, using real-world experimental data from fourteen diverse datasets, we extensively benchmark a range of generative models, including LLM-style, diffusion-based, and graph-based models. We show that log-likelihood scores from these generative models have promising correlation with experimentally measured binding affinities, suggesting that log-likelihood can potentially serve as a reliable metric for ranking antibody sequence designs. Additionally, we scale up one of the diffusion-based models by training it on a large and diverse synthetic dataset, significantly enhancing its ability to rank antibodies based on their binding affinities. We also evaluate non–log-likelihood-based metrics on ten datasets and find that, while they are less consistent for ranking, they provide complementary information. Structure-, energy-, and sequence-based scores appear to be orthogonal and may be used together to increase the likelihood of experimental success. Our implementation is available at: https://github.com/AstraZeneca/DiffAbXL

Version published to 10.1101/2024.10.07.617023 on bioRxiv
Oct 11, 2024

Integrating Evolutionary and Compositional Features with ML and DL for Robust and Interpretable Druggable Protein Prediction

This article has 5 authors:
1. Mujeebu Rehman
2. Qinghua Liu
3. Muhammad Javed
4. Ali Ghulam
5. Teerath Kumar
This article has no evaluationsLatest version Dec 11, 2025
Rebuilding the Antibiotic Pipeline with Guided Generative Models

This article has 3 authors:
1. Shriya Bhat
2. Rishab Jain
3. Wesley Greenblatt
This article has no evaluationsLatest version Jan 16, 2026
Drug discovery guided by maximum drug likeness

This article has 3 authors:
1. Hao-Yu Zhu
2. Lu Xu
3. Wei Shi
This article has no evaluationsLatest version Dec 31, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Integrating Evolutionary and Compositional Features with ML and DL for Robust and Interpretable Druggable Protein Prediction

Rebuilding the Antibiotic Pipeline with Guided Generative Models

Drug discovery guided by maximum drug likeness