CardioSafe: Multi-task prediction of cardiac ion channel activity with reverse-leak audited benchmarking

M. Jovanović
L. Weidener
M. Brkić
E. Ulgac
A. Meduri

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Drug-induced inhibition of the hERG potassium channel is the leading cause of cardiac safety-related drug attrition, but the Comprehensive in Vitro Proarrhythmia Assay (CiPA) framework requires activity data on multiple cardiac ion channels to assess proarrhythmic risk. We present CardioSafe, a three-branch multi-task neural network with cross-attention fusion that integrates chemical fingerprints, ChemBERTa embeddings, and predicted L1000 transcriptomic features to predict blocker status and potency for hERG, Nav1.5, and Cav1.2, with an exploratory IKs head. CardioSafe was trained on the largest publicly reported multi-channel cardiac ion channel dataset, combining ChEMBL 36 with the hERGCentral database (331127 hERG, 3160 Nav1.5, 1138 Cav1.2, and 115 IKs compounds), curated under a pharmacology-aware policy that retains censored measurements and inhibition-percentage votes. Under Tanimoto-similarity-controlled splits, CardioSafe outperforms the leading published comparators (CToxPred2 and CardioGenAI) on the data-rich hERG head; on the smaller Nav1.5 and Cav1.2 heads the standard evaluation is statistically inconclusive. A reverse-leak audit revealed that 22% of Nav1.5 and 21% of Cav1.2 test compounds were present in published comparators’ training data (92% as exact compound matches); after removing these contaminated compounds, CardioSafe’s lead on Nav1.5 and Cav1.2 also reaches statistical significance, demonstrating that prior cross-publication benchmarks for these channels were inflated by training-data overlap.

Scientific contribution

We present the first multi-task neural network jointly predicting blocker activity for the three primary CiPA cardiac ion channels (hERG, Nav1.5, Cav1.2) within a single architecture. We introduce a reverse-leak audit methodology that reveals systematic test-set contamination in cross-publication cardiac safety benchmarks, establishing a stricter evaluation protocol. We provide the empirical test of predicted L1000 transcriptomic features as auxiliary input for cardiac ion channel prediction and document a well-characterized negative result.

CardioSafe encodes each query SMILES with three branches (chemical fingerprints + descriptors, pretrained ChemBERTa, and predicted L1000 transcriptomic signatures), fuses them via a cross-attention block with four learnable per-channel query tokens, and emits binary blocker calls plus pChEMBL regression for hERG, Nav1.5, Cav1.2, and (exploratory) IKs.

Version published to 10.64898/2026.05.06.723181 on bioRxiv
May 12, 2026

MeTAL enables multiparametric risk prediction for human KCNH2 variants

This article has 14 authors:
1. Barbara Ribeiro de Oliveira
2. Elouan Voisin
3. Maxence Marbouty
4. Achille Gregoire
5. Malak Alameh
6. Jérôme Montnach
7. Aurélie Thollet
8. Flavien Charpentier
9. Isabelle Denjoy
10. Vincent Probst
11. Gildas Loussouarn
12. Isabelle Baró
13. Michel De Waard
14. Rupamanjari Majumder
This article has no evaluationsLatest version May 4, 2026
A Deep Learning-Based Scoring Framework for Large-Scale Multi-Donor Cardiotoxicity Screening

This article has 8 authors:
1. Danny Vu
2. Andrew Kowalczewski
3. Sarah D. Burnett
4. Courtney Sakolish
5. Xiyuan Liu
6. Huaxiao Yang
7. Ivan Rusyn
8. Zhen Ma
This article has no evaluationsLatest version Apr 26, 2026
A Consensus-Driven Stacking Ensemble Framework for Interpretable Cardiovascular Risk Prediction and Clinical Deployment

This article has 11 authors:
1. Shafak Shahriar Sozol
2. Bipul Chandra Dev Nath
3. F. M. Shafiullah Fahim
4. Nusrat Nizam Suzana
5. Jannatul Ferdous Mirza
6. Syed Ahmmed
7. Fatima-Tuz Zohra
8. Abu Hena Abid Zafr
9. Mohammed Nasir Uddin
10. M. Rubaiyat Hossain Mondal
11. Abu Sayed Md. Latiful Hoque
This article has no evaluationsLatest version May 26, 2026

CardioSafe: Multi-task prediction of cardiac ion channel activity with reverse-leak audited benchmarking

Discuss this preprint

Listed in

Abstract

Scientific contribution

Article activity feed

MeTAL enables multiparametric risk prediction for human KCNH2 variants

A Deep Learning-Based Scoring Framework for Large-Scale Multi-Donor Cardiotoxicity Screening

A Consensus-Driven Stacking Ensemble Framework for Interpretable Cardiovascular Risk Prediction and Clinical Deployment

Discuss this preprint

Listed in

Abstract

Scientific contribution

Article activity feed

Related articles

MeTAL enables multiparametric risk prediction for human KCNH2 variants

A Deep Learning-Based Scoring Framework for Large-Scale Multi-Donor Cardiotoxicity Screening

A Consensus-Driven Stacking Ensemble Framework for Interpretable Cardiovascular Risk Prediction and Clinical Deployment