Performance and Robustness of Machine Learning-based Radiomic COVID-19 Severity Prediction

Stephen S.F. Yip
Zan Klanecek
Shotaro Naganawa
John Kim
Andrej Studen
Luciano Rivetti
Robert Jeraj

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (ScreenIT)

Abstract

Objectives

This study investigated the performance and robustness of radiomics in predicting COVID-19 severity in a large public cohort.

Methods

A public dataset of 1110 COVID-19 patients (1 CT/patient) was used. Using CTs and clinical data, each patient was classified into mild, moderate, and severe by two observers: (1) dataset provider and (2) a board-certified radiologist. For each CT, 107 radiomic features were extracted. The dataset was randomly divided into a training (60%) and holdout validation (40%) set. During training, features were selected and combined into a logistic regression model for predicting severe cases from mild and moderate cases. The models were trained and validated on the classifications by both observers. AUC quantified the predictive power of models. To determine model robustness, the trained models was cross-validated on the inter-observer’s classifications.

Results

A single feature alone was sufficient to predict mild from severe COVID-19 with and (p<< 0.01). The most predictive features were the distribution of small size-zones (GLSZM-SmallAreaEmphasis) for provider’s classification and linear dependency of neighboring voxels (GLCM-Correlation) for radiologist’s classification. Cross-validation showed that both . In predicting moderate from severe COVID-19 , first-order-Median alone had sufficient predictive power of . For radiologist’s classification, the predictive power of the model increased to as the number of features grew from 1 to 5. Cross-validation yielded and .

Conclusions

Radiomics significantly predicted different levels of COVID-19 severity. The prediction was moderately sensitive to inter-observer classifications, and thus need to be used with caution.

Key points

Interpretable radiomic features can predict different levels of COVID-19 severity
Machine Learning-based radiomic models were moderately sensitive to inter-observer classifications, and thus need to be used with caution

SciScore for 10.1101/2020.09.07.20189977: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Software and Algorithms
Sentences	Resources
During training, maximum relevance minimum redundancy (MRMR) algorithm and recursive feature elimination (RFE) method were used for radiomic feature selection implemented in MATLAB fscmrmr function and Python’s Scikit-learn, respectively [32].	MATLAB suggested: (MATLAB, RRID:SCR_001622) Python’s suggested: (PyMVPA, RRID:SCR_006099) Scikit-learn suggested: (scikit-learn, RRID:SCR_002577)

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: An explicit …

SciScore for 10.1101/2020.09.07.20189977: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

NIH rigor criteria are not applicable to paper type.

Table 2: Resources

Software and Algorithms
Sentences	Resources
During training, maximum relevance minimum redundancy (MRMR) algorithm and recursive feature elimination (RFE) method were used for radiomic feature selection implemented in MATLAB fscmrmr function and Python’s Scikit-learn, respectively [32].	MATLAB suggested: (MATLAB, RRID:SCR_001622) Python’s suggested: (PyMVPA, RRID:SCR_006099) Scikit-learn suggested: (scikit-learn, RRID:SCR_002577)

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: Please consider improving the rainbow (“jet”) colormap(s) used on page 15. At least one figure is not accessible to readers with colorblindness and/or is not true to the data, i.e. not perceptually uniform.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.

Read the original source

Version published to 10.1101/2020.09.07.20189977v1 on medRxiv
Sep 9, 2020

Machine learning models for the prediction of COVID-19 prognosis in the primary health care setting

This article has 8 authors:
1. Joan Barrot
2. Joan A. Caylà
3. Manel Mata-Cases
4. Jordi Real
5. Bogdan Vlacho
6. Josep Franch-Nadal
7. Didac Mauricio
8. the COVID-19 Working Group in Primary Health Care
This article has no evaluationsLatest version May 9, 2025
Development and validation of CT-based clinical-radiomics nomogram for predicting abdominal aortic aneurysms progression

This article has 7 authors:
1. Ru Tan
2. Maobo Wang
3. Bing Kang
4. Xinxin Yu
5. Guohua Zhao
6. Shuai Zhang
7. Ximing Wang
This article has no evaluationsLatest version May 20, 2025
Recognition of Gouty Arthritis Using a Deep Learning Radiomics Model with Ultrasound Images: A Multicenter Study

This article has 7 authors:
1. Minghang Lin
2. Lei Yan
3. Ning Lin
4. Qianni Chen
5. Mei He
6. Zhuhua Li
7. Shuqiang Chen
This article has no evaluationsLatest version May 13, 2025

This article has been Reviewed by the following groups

Listed in

Abstract

Objectives

Methods

Results

Conclusions

Key points

Article activity feed

Related articles

Machine learning models for the prediction of COVID-19 prognosis in the primary health care setting

Development and validation of CT-based clinical-radiomics nomogram for predicting abdominal aortic aneurysms progression

Recognition of Gouty Arthritis Using a Deep Learning Radiomics Model with Ultrasound Images: A Multicenter Study