Reinforcement Learning for Antibody Sequence Infilling

Chak Shing Lee
Conor F. Hayes
Denis Vashchenko
Mikel Landajuela

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

We introduce a flexible framework for antibody sequence design that combines an infilling language model with reinforcement learning to optimize functional properties. Our approach leverages a pretrained infilling language model to generate specific antibody regions within full sequences, guided by reinforcement learning to improve desired biophysical characteristics. We implement a range of online learning strategies, exploring both vanilla REINFORCE and Proximal Policy Optimization with Kullback-Leibler (KL) regularization, and demonstrate that KL regularization is essential for maintaining a balance between score optimization and sequence plausibility. We also adapt Direct Reward Optimization to the protein domain by adding a value head to the infilling model, allowing it to learn directly from static (prompt, response, feedback) datasets using a mean-squared error objective. This formulation is particularly useful when only single-trajectory data is available, which is commonly the case for historically collected experimental assays. We evaluate both the online and offline methods across multiple antibody design tasks—including binding affinity, immunogenicity, and expression—and show that our framework improves alignment with measured biophysical properties while outperforming likelihood-only baselines. This integrated online/offline approach enables functionally driven antibody design and provides a scalable toolkit for therapeutic sequence engineering. Code and data are available at https://github.com/LLNL/protein_tune_rl .

Version published to 10.1101/2025.08.08.669419 on bioRxiv
Aug 12, 2025

ALLM-Ab: Active Learning-Driven Antibody Optimization Using Fine-tuned Protein Language Models

This article has 2 authors:
1. Kairi Furui
2. Masahito Ohue
This article has no evaluationsLatest version Aug 7, 2025
Fast Multi-objective RNA Optimization with Autoregressive Reinforcement Learning

This article has 10 authors:
1. Jiaqi Huang
2. Huarong Bai
3. Yi Fang
4. Xiaojian Liu
5. Shengfan Wang
6. Junchi Yan
7. Hong-Bin Shen
8. Ye Yuan
9. Rongkuan Hu
10. Xiaoyong Pan
This article has no evaluationsLatest version Aug 31, 2025
SHARP: Generating Synthesizable Molecules via Fragment-based Hierarchical Action-space Reinforcement Learning for Pareto Optimization

This article has 4 authors:
1. Jeonghyeon Kim
2. Seongok Ryu
3. Hahnbeom Park
4. Chaok Seok
This article has no evaluationsLatest version Jul 23, 2025

Listed in

Abstract

Article activity feed

Related articles

ALLM-Ab: Active Learning-Driven Antibody Optimization Using Fine-tuned Protein Language Models

Fast Multi-objective RNA Optimization with Autoregressive Reinforcement Learning

SHARP: Generating Synthesizable Molecules via Fragment-based Hierarchical Action-space Reinforcement Learning for Pareto Optimization