Development and External Validation of a High-Precision Model for Predicting ICU Admission from Emergency Department Triage

Nathan Nguyen
Andrew Chu
Debadutta Dash

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Objective

To develop, internally evaluate, and externally validate a machine-learning (ML) model predicting intensive care unit (ICU) admission or death using information available solely at emergency department (ED) triage. Performance was primarily assessed by area under the precision-recall curve (AUPRC) to address severe class imbalance.

Methods

We trained an XGBoost classifier on the Medical Information Mart for Intensive Care IV (MIMIC-IV) dataset. Positive outcomes were ICU admission or death within 6 hours of arrival. Features included vital signs, engineered physiological measures, clinician-assigned acuity, demographics, chief complaint, and home medications. Model performance was internally evaluated through group-stratified five-fold cross-validation and externally validated on the Multimodal Clinical Monitoring in the Emergency Department (MC-MED) dataset.

Results

In the internal validation (MIMIC-IV, 350,241 visits; 11,745 ICU/death), the model achieved an AUPRC of 0.736 (95% CI: 0.728–0.743), AUROC of 0.966 (95% CI: 0.965–0.968), and accuracy of 0.936 (95% CI: 0.936–0.938). On external validation (MC-MED, 42,624 visits; 1,503 ICU/death), the model retained robust performance with an AUPRC of 0.602 (95% CI: 0.578–0.624, 0.134 decrease), AUROC of 0.949 (95% CI: 0.944–0.955, 0.017 decrease), and accuracy of 0.928 (95% CI: 0.927-0.932, 0.007 decrease), demonstrating promising generalizability despite institutional, temporal, and patient demographic differences.

Conclusions

This study presents one of the first triage ML models externally validated on a distinct ED cohort, achieving a new benchmark for AUPRC in flagging critically ill patients within minutes. Future directions include multi-site training and validation to further enhance real-world generalizability and clinical applicability.

Version published to 10.1101/2025.07.22.25332000 on medRxiv
Jul 23, 2025

Diagnostic Codes in AI prediction models and Label Leakage of Same-admission Clinical Outcomes

This article has 5 authors:
1. Bashar Ramadan
2. Ming-Chieh Liu
3. Michael C. Burkhart
4. William F. Parker
5. Brett K. Beaulieu-Jones
This article has no evaluationsLatest version Aug 13, 2025
Improving Hospital Length of Stay Prediction through Heterogeneous Data Integration from MIMIC-III Records

This article has 6 authors:
1. Ahmad F. Al Musawi
2. Pratip Rana
3. Sibtanu Raha
4. William C. Sleeman IV
5. Rishabh Kapoor
6. Preetam Ghosh
This article has no evaluationsLatest version Aug 26, 2025
Machine learning models to detect opioid misuse in Emergency Department patients at triage

This article has 9 authors:
1. Chirag Chhablani
2. Usman Shahid
3. Natalie Parde
4. Sami Muslmani
5. Huiyi Hu
6. Dillon Thorpe
7. Majid Afshar
8. Niranjan Karnik
9. Neeraj Chhabra
This article has no evaluationsLatest version Jul 18, 2025

Listed in

Abstract

Objective

Methods

Results

Conclusions

Article activity feed

Related articles

Diagnostic Codes in AI prediction models and Label Leakage of Same-admission Clinical Outcomes

Improving Hospital Length of Stay Prediction through Heterogeneous Data Integration from MIMIC-III Records

Machine learning models to detect opioid misuse in Emergency Department patients at triage