iScore: A ML-Based Scoring Function for de novo Drug Discovery

Sayyed Jalil Mahdizadeh
Leif A. Eriksson

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In the quest for accelerating de novo drug discovery, the development of efficient and accurate scoring functions represents a fundamental challenge. This study introduces iScore, a novel machine learning (ML)-based scoring function designed to predict the binding affinity of protein-ligand complexes with remarkable speed and precision. Uniquely, iScore circumvents the conventional reliance on explicit knowledge of protein-ligand interactions and full picture of atomic contacts, instead leveraging a set of ligand and binding pocket descriptors to evaluate binding affinity. This approach avoids the inefficient and slow conformational sampling stage, thereby enabling the rapid screening of ultra-huge molecular libraries, a crucial advancement given the practically infinite dimensions of chemical space. iScore was rigorously trained and validated using the PDBbind 2020 refined set, CASF 2016, and CSAR NRC-HiQ Set1/2, employing three distinct ML methodologies: Deep Neural Network (iScore-DNN), Random Forest (iScore-RF), and eXtreme Gradient Boosting (iScore-XGB). A hybrid model, iScore-Hybrid, was subsequently developed to incorporate the strengths of these individual base learners. The hybrid model demonstrated a Pearson correlation coefficient ( R ) of 0.78 and a root mean square error (RMSE) of 1.23 in cross-validation, outperforming the individual base learners and establishing new benchmarks for scoring power ( R = 0.814, RMSE=1.34), ranking power ( ρ = 0.705), and screening power (success rate at top 10% = 73.7%).

Version published to 10.1101/2024.04.02.587723 on bioRxiv
Apr 3, 2024

Drug discovery guided by maximum drug likeness

This article has 3 authors:
1. Hao-Yu Zhu
2. Lu Xu
3. Wei Shi
This article has no evaluationsLatest version Dec 31, 2025
Integrating Evolutionary and Compositional Features with ML and DL for Robust and Interpretable Druggable Protein Prediction

This article has 5 authors:
1. Mujeebu Rehman
2. Qinghua Liu
3. Muhammad Javed
4. Ali Ghulam
5. Teerath Kumar
This article has no evaluationsLatest version Dec 11, 2025
AutoFilter: A Low-Cost Biocomputational Framework for High-Throughput Screening of Chemical Databases and Identification of Novel Malaria Inhibitors Targeting Plasmodium Falciparum

This article has 2 authors:
1. Kavin Ramadoss
2. Kamlendra Singh
This article has no evaluationsLatest version Jan 3, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Drug discovery guided by maximum drug likeness

Integrating Evolutionary and Compositional Features with ML and DL for Robust and Interpretable Druggable Protein Prediction

AutoFilter: A Low-Cost Biocomputational Framework for High-Throughput Screening of Chemical Databases and Identification of Novel Malaria Inhibitors Targeting Plasmodium Falciparum