A deep reinforcement learning platform for antibiotic discovery

Hanqun Cao
Marcelo D. T. Torres
Jingjie Zhang
Zijun Gao
Fang Wu
Chunbin Gu
Jure Leskovec
Yejin Choi
Cesar de la Fuente-Nunez
Guangyong Chen
Pheng-Ann Heng

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Antimicrobial resistance (AMR) is projected to cause up to 10 million deaths annually by 2050, underscoring the urgent need for new antibiotics. Here we present ApexAmphion, a deep-learning framework for de novo design of antibiotics that couples a 6.4-billion-parameter protein language model with reinforcement learning. The model is first fine-tuned on curated peptide data to capture antimicrobial sequence regularities, then optimised with proximal policy optimization against a composite reward that combines predictions from a learned minimum inhibitory concentration (MIC) classifier with differentiable physicochemical objectives. In vitro evaluation of 100 designed peptides showed low MIC values (nanomolar range in some cases) for all candidates (100% hit rate). Moreover, 99 our of 100 compounds exhibited broad-spectrum antimicrobial activity against at least two clinically relevant bacteria. The lead molecules killed bacteria primarily by potently targeting the cytoplasmic membrane. By unifying generation, scoring and multi-objective optimization with deep reinforcement learning in a single pipeline, our approach rapidly produces diverse, potent candidates, offering a scalable route to peptide antibiotics and a platform for iterative steering toward potency and developability within hours.

Version published to 10.1101/2025.09.23.678086 on bioRxiv
Sep 23, 2025

Reinforcement Learning-Based Generation of EGFR-Targeted Anticancer Small Molecules

This article has 2 authors:
1. Yuran Chai
2. Xiao Huang
This article has no evaluationsLatest version Oct 7, 2025
Practical Machine Learning Framework for Designing and Predicting C-Amidated Antimicrobial Peptides

This article has 5 authors:
1. Tu Le
2. Dang-Huy Le
3. Wenyi Li
4. Andrew Hung
5. Shadi Houshyar
This article has no evaluationsLatest version Oct 9, 2025
Moremi Bio Agent: Leveraging Agentic Large Language Model for the Discovery of Broad-Spectrum Antibiotics for Enterobacteriaceae

This article has 7 authors:
1. Gertrude Hattoh
2. Jeremiah Ayensu
3. Nyarko Prince Ofori
4. Solomon Eshun
5. Joshua Ntow Opare-Boateng
6. Osman Tanko
7. Darlington Akogo
This article has no evaluationsLatest version Aug 24, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Reinforcement Learning-Based Generation of EGFR-Targeted Anticancer Small Molecules

Practical Machine Learning Framework for Designing and Predicting C-Amidated Antimicrobial Peptides

Moremi Bio Agent: Leveraging Agentic Large Language Model for the Discovery of Broad-Spectrum Antibiotics for Enterobacteriaceae