HEART: Hierarchical ensemble model using augmented representations and tabular learning for coronary artery disease prediction

Dimitrios Papakyriakopoulos
Pantelis Z. Lappas
Manolis N. Kritikos

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Coronary Artery Disease (CAD) remains one of the most widespread and life-threatening cardiovascular diseases, ranking among the leading causes of mortality around the world. The high prevalence of CAD highlights the urgent need for effective early detection methods, but its diagnosis often relies on invasive or imperfect screening tools that delay intervention and increase risk. To address this challenge, we introduce HEART, a novel machine learning framework that combines structured clinical knowledge with advanced ensemble learning and data-centric augmentation to enhance early CAD prediction. HEART is a two-level ensemble model, where nine diverse models act as base learners. These include Logistic Regression, Elastic Net Regression, Support Vector Machine, K-Nearest Neighbors, Radius Neighbors, Extra Trees, LightGBM, TabNet and TabPFN. Their predictions are combined by a TabPFN meta-learner that captures complex interactions among model outputs. We use Mutual Information (MI) for feature selection and to address class imbalance and limited data, we use a hybrid augmentation strategy that combines synthetic minority oversampling technique (SMOTE) with class-specific Autoencoder reconstructions. Our study, evaluated in the Sani Z-Alizadeh dataset, increases the dataset to a final set of 1,000 samples and demonstrates that HEART achieves a top accuracy of 91\% under fully nested stratified ten-fold cross-validation compared to the other nine distinct classifiers.

Version published to 10.21203/rs.3.rs-8239358/v1 on Research Square
Feb 24, 2026

An Intelligent AI-Driven Framework for Early Prediction of Heart Disease Using Advanced Machine Learning Techniques

This article has 2 authors:
1. Akshata K
2. Dharshini K
This article has no evaluationsLatest version Apr 7, 2026
A Machine Learning–Driven Health Risk Index for Predicting Chronic Disease Burden

This article has 1 author:
1. Ved Sharma
This article has no evaluationsLatest version Apr 2, 2026
Development and Validation of an Interpretable Machine Learning Model for Predicting 5-year Major Adverse Cardiovascular Events in Patients with Coronary Artery Disease

This article has 9 authors:
1. Zhongxing Jiang
2. Haofeng Zhou
3. Yindu Liu
4. Han Yin
5. Junshuo Zhu
6. Xiaoya Xiong
7. Jinna Chang
8. Rou Wang
9. Huan Ma
This article has no evaluationsLatest version Feb 11, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

An Intelligent AI-Driven Framework for Early Prediction of Heart Disease Using Advanced Machine Learning Techniques

A Machine Learning–Driven Health Risk Index for Predicting Chronic Disease Burden

Development and Validation of an Interpretable Machine Learning Model for Predicting 5-year Major Adverse Cardiovascular Events in Patients with Coronary Artery Disease