An Interpretable and Robust Multi-Parameter Prioritization Framework for BACE1 Inhibitors Integrating Meta-Ensemble QSAR, Protein Language Model–Guided Residue Weighting, and Sensitivity-Validated Ranking

Tangilal Dihan Chowdhury
Md Ushama Shafoyat
Nayamul Hasan Hemel
Daiyan Nizam
Jayem Hasan Sajib
Tobibul Islam Toha
Tanvir Ahmed Nyeem
Maisha Farzana
Syed Rashedul Haque
Maruf Hasan
Kazy Noor e Alam Siddiquee
Kaiissar Mannoor

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Alzheimer’s disease remains a major therapeutic challenge, and no β-secretase (BACE1) inhibitor has achieved clinical approval. A key limitation of prior discovery efforts is reliance on single-parameter optimization, often resulting in candidates with limited translational potential. In this study, we developed a biology-informed computational framework integrating meta-ensemble QSAR modeling, molecular docking, Protein Language Model (ESM-1b)-guided residue interaction weighting, and ADMET profiling within a normalized multi-parameter ranking scheme. Model performance was validated using cross-validation, external validation, and Y-randomization (n = 100; p = 0.009), while applicability domain analysis based on Tanimoto similarity highlighted reduced reliability for extrapolative predictions. Sensitivity analysis showed high ranking stability under moderate perturbations (Spearman ρ = 0.998 for ±10%; 0.963 for ±25%), with reduced agreement under randomized weighting (ρ = 0.821), indicating that prioritization is robust but influenced by weight selection. Screening of 16,196 compounds identified 153 predicted actives (accuracy = 0.852; ROC–AUC = 0.920), which were refined to 111 candidates and seven prioritized leads. Molecular dynamics simulations (200 ns) indicated stable binding and persistent catalytic interactions, with Mol-2 showing favorable dynamic stability and ADMET characteristics. Overall, this study presents an interpretable and quantitatively evaluated framework for multi-parameter compound prioritization, supporting more reliable virtual screening in early-stage CNS drug discovery.

Version published to 10.64898/2026.04.07.716920 on bioRxiv
Apr 10, 2026

Methods for Continuous-Valued Training Data Generation from Genome-Scale Metabolic Models: Partial-Inhibition FBA with Mixed Essentiality Sampling, Applied to ESKAPE Drug Target Curation

This article has 1 author:
1. Byeongsoo Kang
This article has no evaluationsLatest version Apr 13, 2026
ATMeQ: A Machine Learning-Based Framework for Amyotrophic Lateral Sclerosis Disease using RNA-seq Meta-Analysis

This article has 3 authors:
1. Ahmed Saif
2. Md Tarikul Islam
3. Md Aktaruzzaman
This article has no evaluationsLatest version Apr 17, 2026
A Unified Agent-Enabled Platform for Drug Repurposing across Molecular, Phenotypic, and Clinical Scales

This article has 8 authors:
1. Cheng Wang
2. Mohamed El Moussaoui
3. Dongdong Zhang
4. Prathiksha Prabhakaraalva
5. Serge Merzliakov
6. Nabila Zaman
7. Goutam Chakraborty
8. Kuan-lin Huang
This article has no evaluationsLatest version Apr 22, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Methods for Continuous-Valued Training Data Generation from Genome-Scale Metabolic Models: Partial-Inhibition FBA with Mixed Essentiality Sampling, Applied to ESKAPE Drug Target Curation

ATMeQ: A Machine Learning-Based Framework for Amyotrophic Lateral Sclerosis Disease using RNA-seq Meta-Analysis

A Unified Agent-Enabled Platform for Drug Repurposing across Molecular, Phenotypic, and Clinical Scales