Generative models for antimicrobial peptide design: auto-encoders and beyond

Lukas Beierle
Julian Hahnfeld
Alexander Goesmann
Reihaneh Mostolizadeh
Franz Cemič

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

Since the number of multi-resistant pathogens is growing rapidly, new strategies to accelerate the development of antimicrobial drugs are urgently needed. A promising candidate class for new antibiotics are antimicrobial peptides, showing lower tendency to induce antibiotic resistance. High-throughput in silico strategies for candidate mining, such as generative deep learning algorithms, have become popular over the last few years and offer novel ways for peptide discovery.

Methods

This study presents a comparative analysis of contemporary deep learning models’ generative performance for generating novel antimicrobial peptides. The models examined include Variational Auto-Encoders, a Wasserstein Auto-Encoder, a Recurrent Neural Network and a Language Model. The primary focus of this study is the systematic comparison and evaluation of various methods and sampling options to identify the most suitable model and sampling strategy combination for different use cases.

Results

The findings demonstrate the models’ capacity to generate peptide sequences exhibiting analogous properties to those of naturally occurring active peptides, which are utilized for model training while featuring an appropriate degree of sequence diversity. Auto-encoder-based models, particularly the Wasserstein auto-encoder, have generated novel and remarkably diverse sequences compared to recurrent neural networks and language models. This model category exhibits a propensity to prioritize the frequencies of individual amino acids during the learning process, in contrast to variational auto-encoders. Furthermore, latent space models have been shown to possess the capacity to utilize diverse methodologies for generating novel peptides. However, it is imperative to note that these sampling strategies are not universally advantageous or disadvantageous; their optimal selection is contingent on the specificities of each individual use case.

Conclusion

The present study investigates the strengths and weaknesses of various generative models for antimicrobial peptides and suggests which model and sampling strategy combination should be favoured for specific individual applications.

Version published to 10.1101/2025.10.29.685317 on bioRxiv
Oct 30, 2025

Rebuilding the Antibiotic Pipeline with Guided Generative Models

This article has 3 authors:
1. Shriya Bhat
2. Rishab Jain
3. Wesley Greenblatt
This article has no evaluationsLatest version Jan 16, 2026
Unlocking the genomic landscape for antimicrobial domain discovery with a two-stage progressive residue-level annotation model

This article has 13 authors:
1. Peilin Xie
2. Xingchen Liu
3. Lantian Yao
4. Zhihao Zhao
5. Anming Yang
6. Jiahui Guan
7. Zijun Jiao
8. Zhihong Liu
9. Junwen Wang
10. Tzong-Yi Lee
11. Zigang Li
12. Bingyu Cui
13. Ying-Chih Chiang
This article has no evaluationsLatest version Dec 11, 2025
Drug discovery guided by maximum drug likeness

This article has 3 authors:
1. Hao-Yu Zhu
2. Lu Xu
3. Wei Shi
This article has no evaluationsLatest version Dec 31, 2025

Discuss this preprint

Listed in

Abstract

Background

Methods

Results

Conclusion

Article activity feed

Related articles

Rebuilding the Antibiotic Pipeline with Guided Generative Models

Unlocking the genomic landscape for antimicrobial domain discovery with a two-stage progressive residue-level annotation model

Drug discovery guided by maximum drug likeness