Condition-Specific Readmission Risk Stratification in a Predominantly Black Statewide Cohort Using Machine Learning: Development of Subtype-Specific Models for Heart Failure, Acute Myocardial Infarction, Atrial Fibrillation/Flutter, and Hypertensive Heart Disease

Ismail El Moudden
Michael Bittner
Sunita Dodani

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Cardiovascular disease (CVD) readmissions impose substantial clinical and economic burden. Machine learning (ML) may improve risk stratification, yet most predictive models aggregate CVD subtypes into a single outcome and underrepresent Black populations. Using Virginia Health Information database records (2010 to 2020), we analyzed 157,791 discharge records from 123,272 unique patients (96.6% Black) to develop condition-specific 30-day readmission models for heart failure (HF; n = 91,752), acute myocardial infarction (AMI; n = 34,497), atrial fibrillation/flutter (AF/AFL; n = 18,424), and hypertensive heart disease (HHD; n = 13,118). Four algorithms (XGBoost, LightGBM, Random Forest, Elastic Net) plus a Super Learner ensemble were trained on patient-grouped 70/30 splits with and without Synthetic Minority Oversampling Technique balancing. Models incorporated validated clinical indices (LACE, Charlson, Elixhauser) and administrative social determinants of health proxies. The overall 30-day readmission rate was 18.9%. Best area under the receiver operating characteristic curve (AUC) values by condition were HF 0.708 (95% CI, 0.701 to 0.716), AMI 0.706 (95% CI, 0.691 to 0.721), AF/AFL 0.732 (95% CI, 0.715 to 0.750), and HHD 0.758 (95% CI, 0.735 to 0.777). XGBoost was the top-performing algorithm for three of four subtypes. The LACE Index, Charlson Comorbidity Index, and insurance type were consistently the strongest predictors. Algorithm-native, aggregated, and SHAP-based importance measures converged on these key features. In this largest-to-date, predominantly Black statewide cohort, condition-specific ML models achieved moderate-to-high discrimination for HF, AMI, AF/AFL, and HHD. Key clinical indices and administrative social determinants proxies emerged as dominant predictors, highlighting modifiable targets and high-risk subgroups. These findings support the development of precision, equity-informed readmission interventions and provide a scalable framework for deploying ML-driven decision support in safety-net and minority-serving healthcare systems.

Version published to 10.21203/rs.3.rs-9098008/v1 on Research Square
Mar 19, 2026

Rising Mortality from Atrial Fibrillation and COPD Comorbidity in the United States, 1999–2024: Implications for Cardiopulmonary Care in South Asia

This article has 10 authors:
1. Ammad Uddin
2. Muhammad Salik Uddin
3. Abbeha Talib
4. S M Aleem Hussain
5. Rida Shakeel
6. Muhammad Tahir
7. Shaheer Bin Shafiq
8. Ahmed Anwaar Uddin
9. Hermann Yokolo
10. Mohid Zulfiqar
This article has no evaluationsLatest version Apr 15, 2026
Rising Female Mortality in Combined Ischemic Heart Disease and Chronic Kidney Disease in the United States (1999–2023)

This article has 6 authors:
1. Hassaan Abid
2. Muhammad Vazaym
3. Muhammad Jawad
4. Gaaitri Lohano
5. Syed Mohamin Abbas Shah
6. Naveed Ahmed Khan
This article has no evaluationsLatest version Mar 27, 2026
Does Ethnicity Alter the Hazard of Stroke in Patients withMajor Modifiable Cardiometabolic Risk Factors? A population-based Longitudinal Studyusing Electronic Health Records in South London

This article has 2 authors:
1. Marc Delord
2. Abdel Douiri
This article has no evaluationsLatest version Mar 24, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Rising Mortality from Atrial Fibrillation and COPD Comorbidity in the United States, 1999–2024: Implications for Cardiopulmonary Care in South Asia

Rising Female Mortality in Combined Ischemic Heart Disease and Chronic Kidney Disease in the United States (1999–2023)

Does Ethnicity Alter the Hazard of Stroke in Patients withMajor Modifiable Cardiometabolic Risk Factors? A population-based Longitudinal Studyusing Electronic Health Records in South London