Sequence-based Drug-Target Complex Pre-training Enhances Protein-Ligand Binding Process Predictions Tackling Crypticity

Shuo Zhang
Li Xie
Daniel Tiourine
Lei Xie

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Predicting protein-ligand binding processes, such as affinity and kinetics, is critical for accelerating drug discovery. However, many existing computational methods face key limitations, including insufficient integration of comprehensive databases, inadequate representation of protein structural dynamics, and incomplete modeling of microscale protein-ligand interactions. To address these challenges, we introduce ProMoNet, a sequence-based pre-training and fine-tuning framework to enhance protein-ligand binding process prediction. ProMoNet connects protein and molecular foundation models to expand data coverage and enhance diversity, and it integrates large-scale binding site pre-training with efficient fine-tuning for affinity and kinetics prediction. During pre-training, it effectively models microscale protein-ligand interactions and captures the dynamic nature of proteins, including binding site crypticity, without relying on 3-dimensional structural inputs. Notably, ProMoNet’s pre-training module surpasses or matches state-of-the-art structure-based methods in identifying exposed and cryptic binding sites. In the fine-tuning stage, it transfers pre-trained knowledge, achieving superior performance in affinity and kinetics prediction tasks with high computational efficiency. The combination of ProMoNet’s powerful modeling capabilities and demonstrated success across multiple tasks highlight its potential for broad applications in drug discovery.

Version published to 10.1101/2025.01.14.633076 on bioRxiv
Jan 19, 2025

Integrating Computational Biology in Modern Drug Discovery: A Synergistic Approach of Structure-Based, Ligand-Based, and Network Pharmacology Strategies

This article has 4 authors:
1. Cromwel Tepap Zemnou
2. Gabriel Tchuente Kamsu
3. Ramelle Ngakam
4. Etienne Junior Tcheumeni
This article has no evaluationsLatest version Jan 29, 2026
Parameter-Efficient Adaptation of Large Language Models for Drug-Target Affinity Modeling in Drug Discovery

This article has 1 author:
1. Virendra Singh Kaira
This article has no evaluationsLatest version Jan 29, 2026
Rapid Assessment of Chemical Complementarity of Ligands for Protein Design

This article has 9 authors:
1. Derek Woolfson
2. Rokas Petrenas
3. Katarzyna Ożga
4. Joel Chubb
5. Andrey Romanyuk
6. Jennifer McManus
7. Graham Leggett
8. Nigel Scrutton
9. Tom Oliver
This article has no evaluationsLatest version Dec 10, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Integrating Computational Biology in Modern Drug Discovery: A Synergistic Approach of Structure-Based, Ligand-Based, and Network Pharmacology Strategies

Parameter-Efficient Adaptation of Large Language Models for Drug-Target Affinity Modeling in Drug Discovery

Rapid Assessment of Chemical Complementarity of Ligands for Protein Design