A Natural Language Processing-Based Approach for Early Detection of Heart Failure Onset using Electronic Health Records

Yuxi Liu
Zhen Tan
Zhenhao Zhang
Song Wang
Jingchuan Guo
Huan Liu
Tianlong Chen
Jiang Bian

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Objectives

This study set out to develop and validate a risk prediction tool for the early detection of heart failure (HF) onset using real-world electronic health records (EHRs).

Background

While existing HF risk assessment models have shown promise in clinical settings, they are often tailored to specific medical conditions, limiting their generalizability. Moreover, most methods rely on hand-crafted features, making it difficult to capture the high-dimensional, sparse, and temporal nature of EHR data, thus reducing their predictive accuracy.

Methods

A total of 2,561 HF and 5,493 matched control patients were identified from the OneFlorida Clinical Research Consortium. We employed a suite of natural language processing (NLP) models, including Bag of Words, Skip-gram, and ClinicalBERT, to generate EHR embeddings, which were used as inputs for five prediction models. Model calibration was assessed under three calibration scenarios: no recalibration, recalibration in the large, and logistic recalibration.

Results

The XGBoost model demonstrated the best overall performance, achieving an AUROC of 0.7672, an F1 score of 0.5547, an AUPRC of 0.6382, and a Matthews correlation coefficient of 0.3993. The most impactful predictors included diagnoses, procedures, medications, lab tests, and patient age. Model performance varied across gender, race, and ethnicity subgroups. Logistic recalibration significantly improved model calibration in the overall cohort and demographic subgroups.

Conclusions

Our NLP-based approach demonstrated strong predictive performance and clinical relevance, highlighting its potential for integration into real-world clinical applications to facilitate early detection and proactive management of individuals at risk for HF.

Version published to 10.1101/2025.04.04.25325211v1 on medRxiv
Apr 6, 2025

Evaluating ChatGPT for Disease Prediction: A Comparative Study on Heart Disease and Diabetes

This article has 1 author:
1. Ebtesam Alomari
This article has no evaluationsLatest version Apr 7, 2025
Validation of Natural Language Processing for Surgical Complication Surveillance: Detecting Eleven Postoperative Complications from Electronic Health Records

This article has 4 authors:
1. Emilie E. Dencker
2. Alexander Bonde
3. Anders Troelsen
4. Martin Sillesen
This article has no evaluationsLatest version Apr 7, 2025
Novel Insights into the Application of Large Language Models in the Diagnosis and Treatment of Complex Cardiovascular Diseases: A Comparative Study

This article has 13 authors:
1. Menglin Tian
2. Shaolong Li
3. Wenyin Du
4. Sen Yang
5. Xiaohua Zhao
6. Hao Xiong
7. Hongxi Li
8. Mei Lu
9. Yunyan Ying
10. Jilei Zhang
11. Qiwei Liao
12. Dong Yang
13. Fuding Guo
This article has no evaluationsLatest version Apr 3, 2025

Listed in

Abstract

Objectives

Background

Methods

Results

Conclusions

Article activity feed

Related articles

Evaluating ChatGPT for Disease Prediction: A Comparative Study on Heart Disease and Diabetes

Validation of Natural Language Processing for Surgical Complication Surveillance: Detecting Eleven Postoperative Complications from Electronic Health Records

Novel Insights into the Application of Large Language Models in the Diagnosis and Treatment of Complex Cardiovascular Diseases: A Comparative Study