Computational Prioritization of Multi-Target Inhibitors: Explainable QSAR and Docking- Based Discovery of Dual AChE/BACE1 Chemotypes

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

The discovery of dual acetylcholinesterase (AChE) and β-secretase (BACE1) inhibitors remains a promising strategy against multifactorial Alzheimer’s disease. Here, rigorously curated ChEMBL-derived data were used to develop explainable QSAR models for dual-inhibition prioritization. Molecules were standardized, near-duplicates were removed using a Tanimoto similarity threshold (≥ 0.80), and physicochemical outliers were filtered prior to modeling. Multiple classifiers (including LightGBM, XGBoost, Random Forest, SVM, kNN, MLP, and AdaBoost) and fingerprints (e.g., RDKit-FP, ECFP6) were benchmarked under scaffold-based nested cross-validation to prevent data leakage. Class imbalance was handled with SMOTETomek applied strictly within training folds. Model selection relied on a weighted composite score combining F1, PR-AUC, MCC, and Recall, and performance was accompanied by bootstrap confidence intervals, calibration curves, and Y-randomization controls. The top model (LightGBM + RDKit-FP) achieved strong generalization (Accuracy ≈ 0.92, Recall ≈ 1.00, PR-AUC ≈ 0.86, ROC-AUC ≈ 0.96). SHAP analysis highlighted aromatic and hydrogen-bonding substructures as key positive contributors. Prospective candidates (e.g., CHEMBL5082250, CHEMBL1651126, CHEMBL1651127) were evaluated by active-site-focused docking against AChE (PDB: 4EY7) and BACE1 (PDB: 2G94) with essential waters retained; docking scores (ΔG, kcal·mol⁻¹) were used for relative ranking only and were further consensus-rescored. SwissADME/pkCSM profiling suggested CNS-relevant properties (e.g., MPO, logBB, P-gp liability) and acceptable oral drug-likeness. Collectively, the workflow provides a reproducible and transparent pipeline for prioritizing dual AChE/BACE1 chemotypes and nominates testable scaffolds for experimental validation.

Article activity feed