A Comparative Analysis of Electronic Health Record and Electrocardiogram Waveform Data for Pulmonary Embolism Identification in Critically Ill Patients

Sampath Rapuri
Carl Harris
Kirby Gong
Robert D. Stevens

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Pulmonary embolism (PE) is one of the leading causes of preventable death amongst hospitalized patients, yet current risk assessment tests based on clinical variables have shown inconsistent validity or poor predictiveness. More recent predictive models for PE using electronic health record (EHR) data are promising, but their reliance on comprehensive and integrated EHR data can limit real-time utility, creating a need for more accessible and rapid diagnostic tools. This study compares the performance of an EHR-based model, an electrocardiogram waveform (WF) model, and a fusion model combining both modalities for the identification of PE in critically ill patients. We leverage routinely acquired clinical and ECG waveform data from the 48 hours preceding PE suspicion from a retrospective dataset of 7,132 ICU admissions between 2008 and 2019 (4.60% PE prevalence). PE diagnoses were determined through ICD-9 or ICD-10 diagnostic coding. We find that our WF model, which employs a single, 10-second 12-lead ECG sample, demonstrated comparable predictive performance (AUROC 0.67 (95% CI, 0.64–0.70)) to our EHR-based model (AUROC 0.71 (0.68–0.74)). However, a fusion model combining both modalities did not improve predictive performance (AUROC 0.67 (0.64–0.70)). All our models outperform widely used existing risk stratification scores such as the Revised Geneva score (AUROC 0.54 (0.51–0.57)), the original Wells score (AUROC 0.61 (0.58–0.64)), and the PE Rule Out Criteria (AUROC 0.56 (0.53–0.59)). Our findings underscore the value that ECG waveform data can bring to the detection of PE in critically ill patients by demonstrating its predictive capability compared to existing benchmarks. After additional validation, these models may serve as valuable tools in PE diagnostic clinical workflows.

Author Summary

Pulmonary embolism (PE) is a life-threatening condition resulting from an embolus that obstructs blood flow in the arteries of the lung. Although recent advancements in the treatment of PE have improved patient outcomes and reduced mortality, existing risk scoring systems still lack discriminatory power and fail to validate in specialized populations like those in the Intensive Care Unit (ICU). In this cohort study, we developed PE detection models for critically ill patients using a large, open-source clinical dataset that outperforms current benchmark risk stratification scores. Our approach leverages ECG waveform data, obtained at least 48 hours before clinical suspicion of PE, potentially enabling earlier therapeutic intervention. Furthermore, by incorporating hand-crafted features during model training, our study provides detailed insights into some predictive factors of PE derived from ECG waveform data.

Version published to 10.1101/2025.09.24.25335530 on medRxiv
Sep 27, 2025

Independent Risk Factors and Predictive Modeling of Pulmonary Embolism in Patients with Acute Ischemic Stroke

This article has 9 authors:
1. Sangsang Chen
2. Shixin Wu
3. Jie Liu
4. Yanfang Liu
5. Yanlan Huang
6. Xiulin Huang
7. Peng Chen
8. Lishan Xu
9. Zhijian Liang
This article has no evaluationsLatest version Jan 30, 2026
Diagnostic Adequacy of COPD in Primary Care: A Population-Based Analysis of Spirometry Use and Risk-Factor Documentation

This article has 6 authors:
1. Pedro Fonte
2. Inês Domingues
3. Benvinda Barbosa
4. Carina Ferreira
5. Thys van Der Molen
6. Jaime Correia de Sousa
This article has no evaluationsLatest version Jan 30, 2026
Establishment and Validation of A Model for New-onset Atrial Fibrillation in Patients with STEMI; A Study Based on CMR

This article has 10 authors:
1. Muhammad Arshad Saeed
2. Perkash Kumar
3. Liqi Ge
4. Xiaoqin Hu
5. Fei Li
6. Baixiang Zhang
7. Yuan Lu
8. Rabia Ismail
9. Zhuoqi Zhang
10. Wensu Chen
This article has no evaluationsLatest version Jan 21, 2026

Discuss this preprint

Listed in

Abstract

Author Summary

Article activity feed

Related articles

Independent Risk Factors and Predictive Modeling of Pulmonary Embolism in Patients with Acute Ischemic Stroke

Diagnostic Adequacy of COPD in Primary Care: A Population-Based Analysis of Spirometry Use and Risk-Factor Documentation

Establishment and Validation of A Model for New-onset Atrial Fibrillation in Patients with STEMI; A Study Based on CMR