Machine Learning to Predict Mortality and Critical Events in a Cohort of Patients With COVID-19 in New York City: Model Development and Validation

Akhil Vaid
Sulaiman Somani
Adam J Russak
Jessica K De Freitas
Fayzan F Chaudhry
Ishan Paranjpe
Kipp W Johnson
Samuel J Lee
Riccardo Miotto
Felix Richter
Shan Zhao
Noam D Beckmann
Nidhi Naik
Arash Kia
Prem Timsina
Anuradha Lala
Manish Paranjpe
Eddye Golden
Matteo Danieletto
Manbir Singh
Dara Meyer
Paul F O'Reilly
Laura Huckins
Patricia Kovatch
Joseph Finkelstein
Robert M. Freeman
Edgar Argulian
Andrew Kasarskis
Bethany Percha
Judith A Aberg
Emilia Bagiella
Carol R Horowitz
Barbara Murphy
Eric J Nestler
Eric E Schadt
Judy H Cho
Carlos Cordon-Cardo
Valentin Fuster
Dennis S Charney
David L Reich
Erwin P Bottinger
Matthew A Levin
Jagat Narula
Zahi A Fayad
Allan C Just
Alexander W Charney
Girish N Nadkarni
Benjamin S Glicksberg

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (ScreenIT)

Abstract

COVID-19 has infected millions of people worldwide and is responsible for several hundred thousand fatalities. The COVID-19 pandemic has necessitated thoughtful resource allocation and early identification of high-risk patients. However, effective methods to meet these needs are lacking.

Objective

The aims of this study were to analyze the electronic health records (EHRs) of patients who tested positive for COVID-19 and were admitted to hospitals in the Mount Sinai Health System in New York City; to develop machine learning models for making predictions about the hospital course of the patients over clinically meaningful time horizons based on patient characteristics at admission; and to assess the performance of these models at multiple hospitals and time points.

Methods

We used Extreme Gradient Boosting (XGBoost) and baseline comparator models to predict in-hospital mortality and critical events at time windows of 3, 5, 7, and 10 days from admission. Our study population included harmonized EHR data from five hospitals in New York City for 4098 COVID-19–positive patients admitted from March 15 to May 22, 2020. The models were first trained on patients from a single hospital (n=1514) before or on May 1, externally validated on patients from four other hospitals (n=2201) before or on May 1, and prospectively validated on all patients after May 1 (n=383). Finally, we established model interpretability to identify and rank variables that drive model predictions.

Results

Upon cross-validation, the XGBoost classifier outperformed baseline models, with an area under the receiver operating characteristic curve (AUC-ROC) for mortality of 0.89 at 3 days, 0.85 at 5 and 7 days, and 0.84 at 10 days. XGBoost also performed well for critical event prediction, with an AUC-ROC of 0.80 at 3 days, 0.79 at 5 days, 0.80 at 7 days, and 0.81 at 10 days. In external validation, XGBoost achieved an AUC-ROC of 0.88 at 3 days, 0.86 at 5 days, 0.86 at 7 days, and 0.84 at 10 days for mortality prediction. Similarly, the unimputed XGBoost model achieved an AUC-ROC of 0.78 at 3 days, 0.79 at 5 days, 0.80 at 7 days, and 0.81 at 10 days. Trends in performance on prospective validation sets were similar. At 7 days, acute kidney injury on admission, elevated LDH, tachypnea, and hyperglycemia were the strongest drivers of critical event prediction, while higher age, anion gap, and C-reactive protein were the strongest drivers of mortality prediction.

Conclusions

We externally and prospectively trained and validated machine learning models for mortality and critical events for patients with COVID-19 at different time horizons. These models identified at-risk patients and uncovered underlying relationships that predicted outcomes.

Version published to 10.2196/24018
Nov 6, 2020
Version published to 10.2196/preprints.24018
Sep 1, 2020

SciScore for 10.1101/2020.04.26.20073411: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Institutional Review Board Statement	IRB: This study has been approved by the Institutional Review Board at the Icahn School of Medicine at Mount Sinai (IRB-20-03271)
Randomization	For each fold, hyperparameter tuning was performed by randomized grid searching directed towards maximizing the sensitivity metric over 2,000 discrete grid options.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	not detected.

Table 2: Resources

No key resources detected.

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:

The results of our models should be considered in light of several limitations. First, we base our predictions solely on a patient’s admission labs (i.e. within 36 hours); while this restriction encourages the use of this model in patient triage, events during a patient’s hospital stay after admission may drive their clinical course away from the prior probability. Furthermore, not all patient labs are drawn at admission, which introduces an element of missingness in our dataset. For example, unlike the general patient population, patients on anticoagulation therapy, who likely have comorbidities increasing their baseline risk, will have coagulation labs (PT, PTT) taken on admission. However, the shift away from predicting death by the model in the absence of PT/PTT (Figure 3) suggests that missingness in coagulation labs is a proxy for this lower baseline risk secondary to not having comorbid conditions that require anticoagulation therapy. Additionally, patients admitted to the hospital later in the crisis were both beneficiaries of improved patient care protocols from experiential learning, but also victims of resource constraints from overburdened hospitals. These effects, while possibly negated by our large sample size, may also induce temporal variation between patient outcomes. Furthermore, inherent limitations exist when using EHRs, especially those integrated from multiple hospitals. In order to facilitate timely dissemination of our results, we chose not to manually...

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.

Read the original source

Version published to 10.1101/2020.04.26.20073411v1 on medRxiv
Apr 28, 2020

Using Ensemble and Multi-Model Learning to Predict Longitudinal Hospitalization of Cardiovascular Patients and Warn of Pandemic Risks: A Retrospective Cohort Study with External Validations

This article has 7 authors:
1. Chengbo Fu
2. Zhe Zheng
3. Xiaoyan Yin
4. Xuanchu Ge
5. Xiangyu Zeng
6. Lingfeng Zha
7. Yanze Li
This article has no evaluationsLatest version Jun 10, 2025
Machine learning models for the prediction of COVID-19 prognosis in the primary health care setting

This article has 8 authors:
1. Joan Barrot
2. Joan A. Caylà
3. Manel Mata-Cases
4. Jordi Real
5. Bogdan Vlacho
6. Josep Franch-Nadal
7. Didac Mauricio
8. the COVID-19 Working Group in Primary Health Care
This article has no evaluationsLatest version May 9, 2025
Machine learning-based short-term forecasting of COVID-19 hospital admissions using routine hospital patient data

This article has 8 authors:
1. Martin S. Wohlfender
2. Judith A. Bouman
3. Olga Endrich
4. Alban Ramette
5. Alexander B. Leichtle
6. Guido Beldi
7. Christian L. Althaus
8. Julien Riou
This article has no evaluationsLatest version May 21, 2025

This article has been Reviewed by the following groups

Listed in

Abstract

Objective

Methods

Results

Conclusions

Article activity feed

Related articles

Using Ensemble and Multi-Model Learning to Predict Longitudinal Hospitalization of Cardiovascular Patients and Warn of Pandemic Risks: A Retrospective Cohort Study with External Validations

Machine learning models for the prediction of COVID-19 prognosis in the primary health care setting

Machine learning-based short-term forecasting of COVID-19 hospital admissions using routine hospital patient data