Activity Cliff-Informed Contrastive Learning for Molecular Property Prediction

Chao Cui
Xiaorui Su
Zaixi Zhang
Alejandro Velez Arce
Jianming Wang
Xiangcheng Shi
Yanbing Zhang
Jie Wu
Yu Zong Chen
Marinka Zitnik
Wanxiang Shen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Accurately predicting molecular activity is hindered by activity cliffs, which are sharp potency changes between highly similar compounds that distort the smoothness assumed by modern QSAR and graph neural networks (GNNs). Here we introduce activity cliff awareness (AC-awareness), an inductive bias that reshapes GNN latent spaces to account for these discontinuities. Implemented through an Activity Cliff Awareness (ACA) loss combining regression with soft-margin triplet contrastive learning, the method dynamically mines high-value cliff triplets during training and corrects inconsistent neighbourhoods in latent space. This yields progressively fewer cliff violations, more coherent activity gradients, and substantially reduced label incoherence across diverse chemical spaces. Evaluated on 52 datasets spanning low-sample narrow-scaffold series, large mixed-scaffold benchmarks, matched-pair cliff classification, and ADMET delta property prediction, AC-awareness consistently improves predictive accuracy and outperforms strong ECFP- and GNN-based baselines. The approach generalizes across multiple GNN backbones and remains effective under fixed hyperparameters, demonstrating that cliff-aware contrastive geometry provides a robust, architecture-independent mechanism for mitigating structure–activity discontinuities. These results establish AC-awareness as a principled strategy for enhancing molecular property prediction by aligning latent representations with the nonadditive behaviour underlying activity cliffs.

Version published to 10.21203/rs.3.rs-2988283/v3 on Research Square
Mar 24, 2026
Version published to 10.21203/rs.3.rs-2988283/v2 on Research Square
Dec 4, 2024
Version published to 10.21203/rs.3.rs-2988283/v1 on Research Square
Jun 28, 2023

Anchor Transfer Learning for Cross-Dataset Drug-Target Affinity Prediction

This article has 1 author:
1. Başar Temiz
This article has no evaluationsLatest version Apr 14, 2026
DMPKformer: An Interpretable Multimodal Deep Learning Framework for Reliable ADMET Property Prediction

This article has 6 authors:
1. A.S. Ben Geoffrey
2. Abhishek Singh
3. Sowmya Kanchan
4. Samir Anapat
5. Kishan Gurram
6. Nagaraj M Kulkarni
This article has no evaluationsLatest version May 29, 2026
BiLSTM-Powered Bilinear Attention for Protein–Ligand Prediction

This article has 4 authors:
1. Chih-Yang Cheng
2. Yi-An Chen
3. Feng-Yin Li
4. Suyong Re
This article has no evaluationsLatest version May 13, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Anchor Transfer Learning for Cross-Dataset Drug-Target Affinity Prediction

DMPKformer: An Interpretable Multimodal Deep Learning Framework for Reliable ADMET Property Prediction

BiLSTM-Powered Bilinear Attention for Protein–Ligand Prediction