Simple baselines rival protein language models in mutation-dense design of function tasks

Itay Talpir
Sarel J. Fleishman

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Computational protein design demands generally applicable models that reliably predict or generate unmeasured variants with superior functional properties. Although protein language models (pLMs) have been used in zero-shot and transfer-learning design studies, they have generally not been assessed in benchmarks that explicitly test combinatorial extrapolation from lower- to higher-order variants. Here we benchmark widely used pLMs against conventional baseline methods in recently described dense, experimentally validated multi-mutant landscapes. We find that regardless of architecture and parameter count, pLMs are statistically similar to one another, and none consistently outperforms conventional baseline methods. Furthermore, their ability to distinguish functional from non-functional variants in zero-shot prediction is comparable to that of conventional homology-based methods. We suggest that to contribute significantly to the design of protein function, pLMs may need to encode biophysical and structural priors or be combined with structure-based approaches.

Version published to 10.64898/2026.05.01.722313 on bioRxiv
May 6, 2026

Unified sampling framework and experimental benchmarking of sequence- and structure-based protein models

This article has 8 authors:
1. Aviv Spinner
2. Pascal Notin
3. Samuel Berry
4. Dana Cortade
5. Zach Sisson
6. Svetlana Ikonomova
7. David Ross
8. Debora Marks
This article has no evaluationsLatest version May 12, 2026
Cross-Attention Over RNA And Protein Sequences Enables Generalizable Interaction Prediction

This article has 7 authors:
1. Mario Catalano
2. Gerardo Pepe
3. Gabriele Ausiello
4. Claire McWhite
5. Giorgio Gambosi
6. Manuela Helmer Citterich
7. Pier Federico Gherardini
This article has no evaluationsLatest version Apr 23, 2026
CombinGym: a benchmark platform for machine learning-assisted design of combinatorial protein variants

This article has 8 authors:
1. Yongcan Chen
2. Lihao Fu
3. Xuchao Lu
4. Wenzhuo Li
5. Yuan Gao
6. Yibo Wang
7. Zhicheng Ruan
8. Tong Si
This article has no evaluationsLatest version Mar 25, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Unified sampling framework and experimental benchmarking of sequence- and structure-based protein models

Cross-Attention Over RNA And Protein Sequences Enables Generalizable Interaction Prediction

CombinGym: a benchmark platform for machine learning-assisted design of combinatorial protein variants