COVID-19 Risk Stratification and Mortality Prediction in Hospitalized Indian Patients: Harnessing clinical data for public health benefits

Shanmukh Alle
Akshay Kanakan
Samreen Siddiqui
Akshit Garg
Akshaya Karthikeyan
Priyanka Mehta
Neha Mishra
Partha Chattopadhyay
Priti Devi
Swati Waghdhare
Akansha Tyagi
Bansidhar Tarai
Pranjal Pratim Hazarik
Poonam Das
Sandeep Budhiraja
Vivek Nangia
Arun Dewan
Ramanathan Sethuraman
C. Subramanian
Mashrin Srivastava
Avinash Chakravarthi
Johnny Jacob
Madhuri Namagiri
Varma Konala
Debasish Dash
Tavpritesh Sethi
Sujeet Jha
Anurag Agrawal
Rajesh Pandey
P. K. Vinod
U. Deva Priyakumar

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (ScreenIT)

Abstract

The variability of clinical course and prognosis of COVID-19 highlights the necessity of patient sub-group risk stratification based on clinical data. In this study, clinical data from a cohort of Indian COVID-19 hospitalized patients is used to develop risk stratification and mortality prediction models. We analyzed a set of 70 clinical parameters including physiological and hematological for developing machine learning models to identify biomarkers. We also compared the Indian and Wuhan cohort, and analyzed the role of steroids. A bootstrap averaged ensemble of Bayesian networks was also learned to construct an explainable model for discovering actionable influences on mortality and days to outcome. We discovered blood parameters, diabetes, co-morbidity and SpO2 levels as important risk stratification features, whereas mortality prediction is dependent only on blood parameters. XGboost and logistic regression model yielded the best performance on risk stratification and mortality prediction, respectively (AUC score 0.83, AUC score 0.92). Blood coagulation parameters (ferritin, D-Dimer and INR), immune and inflammation parameters IL6, LDH and Neutrophil (%) are common features for both risk and mortality prediction. Compared with Wuhan patients, Indian patients with extreme blood parameters indicated higher survival rate. Analyses of medications suggest that a higher proportion of survivors and mild patients who were administered steroids had extreme neutrophil and lymphocyte percentages. The ensemble averaged Bayesian network structure revealed serum ferritin to be the most important predictor for mortality and Vitamin D to influence severity independent of days to outcome. The findings are important for effective triage during strains on healthcare infrastructure.

Version published to 10.1371/journal.pone.0264785
Mar 17, 2022
ScreenIT
Mar 1, 2021
SciScore for 10.1101/2020.12.19.20248524: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.
Table 2: Resources
No key resources detected.
Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).
Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We found bar graphs of continuous data. We recommend replacing bar graphs with more informative graphics, as many different datasets can lead to the same bar graph. The …
SciScore for 10.1101/2020.12.19.20248524: (What is this?)
Please note, not all rigor criteria are appropriate for all manuscripts.
Table 1: Rigor
NIH rigor criteria are not applicable to paper type.
Table 2: Resources
No key resources detected.
Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).
Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.
Results from TrialIdentifier: No clinical trial numbers were referenced.
Results from Barzooka: We found bar graphs of continuous data. We recommend replacing bar graphs with more informative graphics, as many different datasets can lead to the same bar graph. The actual data may suggest different conclusions from the summary statistics. For more information, please see Weissgerber et al (2015).
Results from JetFighter: We did not find any issues relating to colormaps.
Results from rtransparent:
Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.
About SciScore
SciScore is an automated tool that is designed to assist expert reviewers by finding and presenting formulaic information scattered throughout a paper in a standard, easy to digest format. SciScore checks for the presence and correctness of RRIDs (research resource identifiers), and for rigor criteria such as sex and investigator blinding. For details on the theoretical underpinning of rigor criteria and the tools shown here, including references cited, please follow this link.
Read the original source
Version published to 10.1101/2020.12.19.20248524 on medRxiv
Dec 22, 2020

Clinical Study Protocol of the ‘Biomarkers of Severity of COVID-19 Patients’ (BIOMARCOVID) Project

This article has 18 authors:
1. Tuan-Anh Dinh
2. Corentin Leroy
3. Marion Brandolini-Bunlon
4. Sylvie Berthier
5. Candice Trocme
6. Nelle Varoquaux
7. Caroline Plazy
8. Antoine Vilotitch
9. Charles Terra
10. Bertrand Toussaint
11. Jean-Luc Bosson
12. Florence Castelli
13. Estelle Pujos-Guillot
14. Pauline Le Faouder
15. Justine Bertrand-Michel
16. Marion Le Marechal
17. Olivier Epaulard
18. Audrey Le Gouellec
This article has no evaluationsLatest version Jun 17, 2026
Machine learning-based predictive clinical model for Shigella spp. infection in children with diarrhea

This article has 20 authors:
1. Francisco Sousa Junior
2. Jose Quirino Silva Filho
3. Alexandre Havt Binda
4. Gagandeep Kang
5. Margaret N. Kosek
6. Pascal O. Bessong
7. Amidou Samie
8. Rashidul Haque
9. Estomih R. Mduma
10. Jose P. Leite
11. Ladaporn Bodhidatta
12. Najeeha T. Iqbal
13. Nicola Page
14. Ireen Kiwelu
15. Zulfiqar A. Bhutta
16. Tahmeed Ahmed
17. Elizabeth Rogawski McQuade
18. James A. Platts-Mills
19. Eric R. Houpt
20. Aldo A. M. Lima
This article has no evaluationsLatest version Jul 9, 2026
Pre-pandemic blood profiles predict COVID-19 hospitalization and death a decade later

This article has 1 author:
1. Laurence A. Jacobs
This article has no evaluationsLatest version May 29, 2026

COVID-19 Risk Stratification and Mortality Prediction in Hospitalized Indian Patients: Harnessing clinical data for public health benefits

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Clinical Study Protocol of the ‘Biomarkers of Severity of COVID-19 Patients’ (BIOMARCOVID) Project

Machine learning-based predictive clinical model for Shigella spp. infection in children with diarrhea

Pre-pandemic blood profiles predict COVID-19 hospitalization and death a decade later

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Clinical Study Protocol of the ‘Biomarkers of Severity of COVID-19 Patients’ (BIOMARCOVID) Project

Machine learning-based predictive clinical model for Shigella spp. infection in children with diarrhea

Pre-pandemic blood profiles predict COVID-19 hospitalization and death a decade later