Predicting 30-Day Heart Failure Readmissions Using Machine Learning: Insights From the Kansas Health Information Network (KHIN)

Minjung Kim
Jun Yan
Jason Wasfy
Robert H. Aseltine

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background

Heart failure (HF) is a major contributor to inpatient hospital utilization, with persistently high 30-day readmission rates. Existing prediction tools are frequently restricted to primary-diagnosis HF admissions, potentially excluding clinically relevant HF-related hospitalizations.

Objectives

To develop and validate risk prediction models using machine learning (ML)-based risk prediction models to predict 30-day readmissions among patients with HF using the Kansas Health Information Network, a statewide health information exchange.

Methods

This retrospective cohort study analyzed HF hospitalizations using predictors including demographics, comorbidities, laboratory results, medications, clinical quality metrics for diabetes and kidney disease management, and prior healthcare utilization. Five ML models, including regularized logistic regression, random forest, extreme gradient boosting, categorical boosting, and deep neural network, were trained using stratified 5-fold cross-validation. Model performance was evaluated on an independent test set using the area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), misclassification rate (MCR), and Brier score.

Results

Among 2,734 HF patients, the 30-day readmission rate was 27%. The XGBoost model achieved the best discrimination (AUROC=0.75; AUPRC=0.58; MCR=0.21). Patients in the highest-risk decile had a positive predictive value of 76%, accounted for approximately one-third of all 30-day readmissions, and had a 3.3-fold enrichment compared with baseline risk. The key predictors included prior hospital utilization, diabetes and kidney disease management indicators, and comorbidity burden.

Conclusions

Risk stratification using routinely collected clinical data identified a subgroup at elevated risk for 30-day readmission. These findings support the potential role of data-driven risk prediction to inform targeted transitional care.

CLINICAL PERSPECTIVES

What is New?

This study is among the first to utilize health information exchange data to develop machine learning models for predicting 30-day heart failure readmissions, including patients with HF as either a primary or secondary diagnosis. Unlike prior models restricted to primary HF admissions, this approach includes hospitalizations in any diagnosis position, reflecting real-world acute-care utilization. Notably, our machine learning model correctly identified approximately three in four patients classified as high-risk in the top decile of predicted scores, highlighting its ability to identify individuals at greatest likelihood of early readmission

What are the Clinical Implications?

Accurately identifying a group of high-risk patients has meaningful implications for care delivery. By correctly capturing actual readmissions within a small, prioritized segment of the population, ML-driven risk stratification can guide clinicians and care management teams in targeting transitional care resources where they are most needed. Such focused intervention strategies may allow for efficient targeting of limited health care resources, improve post-discharge outcomes, enhance operational efficiency, and potentially reduce preventable HF readmissions.

Version published to 10.64898/2026.05.18.26353537 on medRxiv
May 21, 2026

Developing a Heart Failure Readmission Model from Inpatient Electronic Medical Record Data

This article has 11 authors:
1. Elliot A. Martin
2. Seungwon Lee
3. Robin L. Walker
4. Eda Pitka
5. Morteza Zangeneh Soroush
6. Justin A. Ezekowitz
7. Jonathan G. Howlett
8. Nowell M. Fine
9. Jeffrey A. Bakal
10. Hude Quan
11. Cathy A. Eastwood
This article has no evaluationsLatest version Jul 21, 2026
Calibrated and Interpretable Machine Learning for ICU Mortality Prediction Using First 24-Hour Clinical Data

This article has 3 authors:
1. Abdallah Alsammani
2. Merasia Johnson
3. Jessica Elrefaei
This article has no evaluationsLatest version Jun 2, 2026
Explainable, personalised prediction of emergency readmission and mortality following hospitalisation in patients with heart failure

This article has 7 authors:
1. B Gallego
2. L Huberts
3. J Yu
4. V Blake
5. L Jorm
6. L Liu
7. S-Y Ooi
This article has no evaluationsLatest version Jul 17, 2026

Discuss this preprint

Listed in

Abstract

Background

Objectives

Methods

Results

Conclusions

CLINICAL PERSPECTIVES

What is New?

What are the Clinical Implications?

Article activity feed

Related articles

Developing a Heart Failure Readmission Model from Inpatient Electronic Medical Record Data

Calibrated and Interpretable Machine Learning for ICU Mortality Prediction Using First 24-Hour Clinical Data

Explainable, personalised prediction of emergency readmission and mortality following hospitalisation in patients with heart failure