Advancing Cardiovascular Disease Diagnosis: A Robust ML Ecosystem Integrating Early Detection, Responsible AI Framework, and Causal Inference

Kazi Sakib Hasan
Irfan Sadi Dhrubo

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Cardiovascular disease (CVD) remains a leading global health threat, responsible for one in five deaths worldwide. Early detection is critical to mitigate morbidity and mortality, yet traditional diagnostic methods often rely on reactive clinical assessments, missing opportunities for preventive intervention. In this study, we developed a machine learning (ML) ecosystem to enhance CVD diagnosis through two key approaches: (1) an early warning system using non-clinical, self-reported features for accessible risk stratification, and (2) specialized diagnostic models integrating clinical and non-clinical data. Our framework leverages advanced ML techniques, including tabular neural networks (TabNet, TabPFN) and ensemble methods (XGBoost, Random Forest), validated on multi-regional datasets. SHAP analysis identified ECG-related features as dominant predictors of CVD risk, with ST-segment slope (+0.93) and ST depression (+0.63) exhibiting the strongest effects. Counterfactual explanations from the non-clinical model further revealed actionable preventive measures: reducing exercise-induced angina and chest pain severity, alongside increasing exercise heart rate, could shift predictions from diseased to healthy, highlighting the model’s utility for lifestyle interventions. To address ethical and clinical trustworthiness, we incorporated interpretability tools (SHAP, counterfactuals), fairness mitigation (FairLearn), and uncertainty quantification (Bayesian Neural Networks). Causal inference identified key predictors such as exercise-induced angina (ATE: 0.36) and ST slope (ATE: 0.33), informing a hybrid ensemble model that achieved 89% accuracy while reducing dimensionality. Notably, our analysis revealed limitations of synthetic data augmentation in clinical contexts. The system aligns with FDA Good ML Practices and EU Trustworthy AI guidelines, offering a scalable solution for early detection and equitable diagnosis.

Version published to 10.1101/2025.06.17.25329798 on medRxiv
Jun 18, 2025

Evaluating Feature Selection Methods and Feature Contributions for Cardiovascular Disease Risk Prediction

This article has 2 authors:
1. Suraiya Akhter
2. John H. Miller
This article has no evaluationsLatest version Jul 14, 2025
Enhancing CVD Risk Prediction: Integrating ECG Signals with Conventional Models Using AI

This article has 6 authors:
1. Maryam Mahdavi
2. Anoshirvan Kazemnejad
3. Abbas Asosheh
4. Davood Khalili
5. Kamyab Hosseinpour
6. Ahmadreza Tajari
This article has no evaluationsLatest version Jun 26, 2025
Enhanced machine learning and hybrid ensemble approaches for coronary heart disease prediction

This article has 4 authors:
1. Maurice Wanyonyi
2. Zakayo Ndiku Morris
3. Faith Mueni Musyoka
4. Dominic Makaa Kitavi
This article has no evaluationsLatest version Jul 3, 2025

Listed in

Abstract

Article activity feed

Related articles

Evaluating Feature Selection Methods and Feature Contributions for Cardiovascular Disease Risk Prediction

Enhancing CVD Risk Prediction: Integrating ECG Signals with Conventional Models Using AI

Enhanced machine learning and hybrid ensemble approaches for coronary heart disease prediction