Accelerating Virtual Directed Evolution of Proteins via Reinforcement Learning

Tianyu Mi
Yuxiang Wang
Jingyu Zhao
Wanze Wang
Yunhao Shen
Nan Xiao
Ligong Chen
Guo-Qiang Chen
Shuyi Zhang
Wen-Bin Zhang
Haipeng Gong

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

With the advancement of machine learning methods, the protein fitness landscape can be predicted, providing reliable guidance in the selection of advantageous mutations for the directed evolution of proteins. However, the potential multiple mutational variants derived from the simple combi-nation of a limited number of advantageous single mutations may not represent superior choices. Moreover, the exploration and selection of the astronomical number of multiple mutational variants remain a highly challenging task. In this study, we introduce a virtual directed evolution pipeline, RelaVDEP, for the rapid identification of mutational variants with explicit enhancement in the de-sired property of the target protein. By adapting and fine-tuning a pre-trained fitness predictor to improve sequence-based protein functional prediction and by designing a model-based reinforce-ment learning framework to efficiently explore the vast combinatorial space of protein mutations, this pipeline is capable of effectively accelerating the directed evolution process for a broad spec-trum of proteins with versatile functional profiles. According to a series of experimental validations, the diversified mutational variants identified by our method exhibit notable improvements in desir-able protein functional properties. In particular, by integrating RelaVDEP with active learning, we successfully optimized the sequence of a PETase, enhancing its catalytic activity through previously unknown mutations.

Version published to 10.1101/2025.06.25.661516 on bioRxiv
Jun 27, 2025

The Evolution of the AlphaFold Architecture

This article has 1 author:
1. Y.C.B.J. Dissanayaka
This article has no evaluationsLatest version Jan 9, 2026
A Survey on Efficient Protein Language Models

This article has 8 authors:
1. Shouren Wang
2. Debargha Ganguly
3. Vinooth Kulkarni
4. Wang Yang
5. Zhuoran Qiao
6. Daniel Blankenberg
7. Vipin Chaudhary
8. Xiaotian Han
This article has no evaluationsLatest version Dec 24, 2025
Integrating Evolutionary and Compositional Features with ML and DL for Robust and Interpretable Druggable Protein Prediction

This article has 5 authors:
1. Mujeebu Rehman
2. Qinghua Liu
3. Muhammad Javed
4. Ali Ghulam
5. Teerath Kumar
This article has no evaluationsLatest version Dec 11, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

The Evolution of the AlphaFold Architecture

A Survey on Efficient Protein Language Models

Integrating Evolutionary and Compositional Features with ML and DL for Robust and Interpretable Druggable Protein Prediction