FASA: Feature-Agnostic Stacked Autoencoders for Accurate Adverse Drug Reaction Prediction

Martin Gustavo Perez Bonany Torrealva
Edward Jorge Yuri Cayllahua Cahuina
Rensso Victor Hugo Mora Colque

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Purpose: Adverse drug reactions (ADRs) remain a major obstacle to drug safety, yet many computational predictors depend on molecular or biological features that are often unavailable for newly designed compounds. Several published models also report inflated performance due to biased preprocessing or unsuitable evaluation metrics. This work introduces a feature-free deep learning framework that predicts ADRs using only drug–ADR incidence matrices, which allows early-stage assessment even when auxiliary features are missing. Methods: FASA (Feature-Agnostic Stacked Autoencoders) was developed and trained solely on binary drug-ADR incidence matrices. FASA includes a cardinality-preserving regularization term that constrains reconstructed ADR vectors to follow realistic label-count distributions, preventing degenerate solutions and encouraging the model to learn meaningful structure from sparse data. Performance was evaluated via cross-validation, and the area under the precision-recall curve was reported, as it is well suited to extremely sparse pharmacovigilance data. Results: On the harmonized WPLMF dataset (1,177 drugs and 4,247 ADRs), the method achieves an AUPR of 0.7150, surpassing all baseline models reported in the original study, including MCS-MKL, FGRMF, IDSE-HE, Galeano, LogitMF and WPLMF, which obtains 0.6553 under the same five-fold protocol. On the raw SIDER benchmark, FASA reaches an AUPR of 0.6456, again outperforming previously published results on the unmodified matrix. Conclusion: These findings show that carefully regularized deep architectures can recover meaningful pharmacological structure directly from sparse incidence data. FASA offers a straightforward and competitive approach for large-scale ADR prediction using only drug-ADR incidence matrices, without requiring chemical, biological, or phenotypic features, and generalizes across datasets with varying levels of curation.

Version published to 10.21203/rs.3.rs-8952090/v1 on Research Square
Apr 1, 2026

DMPKformer: An Interpretable Multimodal Deep Learning Framework for Reliable ADMET Property Prediction

This article has 6 authors:
1. A.S. Ben Geoffrey
2. Abhishek Singh
3. Sowmya Kanchan
4. Samir Anapat
5. Kishan Gurram
6. Nagaraj M Kulkarni
This article has no evaluationsLatest version May 29, 2026
BiLSTM-Powered Bilinear Attention for Protein–Ligand Prediction

This article has 4 authors:
1. Chih-Yang Cheng
2. Yi-An Chen
3. Feng-Yin Li
4. Suyong Re
This article has no evaluationsLatest version May 13, 2026
GraphTox: A Semi-Supervised Pre-Trained Framework for Peptide Toxicity Prediction using Geometric Graph Transformer and ReLoRA based finetuning

This article has 3 authors:
1. Soumyadeep Bhaduri
2. Debraj Das
3. Pralay Mitra
This article has no evaluationsLatest version May 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DMPKformer: An Interpretable Multimodal Deep Learning Framework for Reliable ADMET Property Prediction

BiLSTM-Powered Bilinear Attention for Protein–Ligand Prediction

GraphTox: A Semi-Supervised Pre-Trained Framework for Peptide Toxicity Prediction using Geometric Graph Transformer and ReLoRA based finetuning