Gene Expression Risk Scores for COVID-19 Illness Severity

Derick R Peterson
Andrea M Baran
Soumyaroop Bhattacharya
Angela R Branche
Daniel P Croft
Anthony M Corbett
Edward E Walsh
Ann R Falsey
Thomas J Mariani

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (ScreenIT)

Abstract

Background

The correlates of coronavirus disease 2019 (COVID-19) illness severity following infection with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) are incompletely understood.

Methods

We assessed peripheral blood gene expression in 53 adults with confirmed SARS-CoV-2 infection clinically adjudicated as having mild, moderate, or severe disease. Supervised principal components analysis was used to build a weighted gene expression risk score (WGERS) to discriminate between severe and nonsevere COVID-19.

Results

Gene expression patterns in participants with mild and moderate illness were similar, but significantly different from severe illness. When comparing severe versus nonsevere illness, we identified >4000 genes differentially expressed (false discovery rate < 0.05). Biological pathways increased in severe COVID-19 were associated with platelet activation and coagulation, and those significantly decreased with T-cell signaling and differentiation. A WGERS based on 18 genes distinguished severe illness in our training cohort (cross-validated receiver operating characteristic-area under the curve [ROC-AUC] = 0.98), and need for intensive care in an independent cohort (ROC-AUC = 0.85). Dichotomizing the WGERS yielded 100% sensitivity and 85% specificity for classifying severe illness in our training cohort, and 84% sensitivity and 74% specificity for defining the need for intensive care in the validation cohort.

Conclusions

These data suggest that gene expression classifiers may provide clinical utility as predictors of COVID-19 illness severity.

Version published to 10.1093/infdis/jiab568
Nov 30, 2021

SciScore for 10.1101/2021.08.24.457521: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Ethics	Field Sample Permit: Sample Collection and Processing: Approximately 3 ml of whole blood was collected in a Tempus™ Blood RNA Tube at the time of enrollment and stored at -80C until the time of processing.
Sex as a biological variable	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.

Table 2: Resources

Software and Algorithms
Sentences	Resources
Sequences were aligned against the human genome version hg38 using the Splice Transcript Alignment to a Reference (STAR) algorithm [11], and counts were generated using HTSeq [12].	STAR suggested: (STAR, RRID:SCR_004463) HTSeq suggested: (HTSeq, RRID:SCR_005514)
Pathway analysis of significantly differentially expressed …

SciScore for 10.1101/2021.08.24.457521: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Ethics	Field Sample Permit: Sample Collection and Processing: Approximately 3 ml of whole blood was collected in a Tempus™ Blood RNA Tube at the time of enrollment and stored at -80C until the time of processing.
Sex as a biological variable	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.

Table 2: Resources

Software and Algorithms
Sentences	Resources
Sequences were aligned against the human genome version hg38 using the Splice Transcript Alignment to a Reference (STAR) algorithm [11], and counts were generated using HTSeq [12].	STAR suggested: (STAR, RRID:SCR_004463) HTSeq suggested: (HTSeq, RRID:SCR_005514)
Pathway analysis of significantly differentially expressed genes was performed using ENRICHR [13].	ENRICHR suggested: (Enrichr, RRID:SCR_001575)

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:

Our current study has several limitations which are worth noting, including its relatively small sample size, the non-standardized interval between symptom onset and sample collection, and blood collection at one time point. The complexity of the clinical data among hospitalized participants (i.e. admissions only for isolation, persons with chronic oxygen requirements, COVID testing for procedures) made objective criteria to distinguish mild from moderate disease difficult, necessitating the need for clinical adjudication. Lastly, certain laboratory studies were not available for all subjects.

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.

Results from scite Reference Check: We found no unreliable references.

Read the original source

Version published to 10.1101/2021.08.24.457521 on bioRxiv
Aug 24, 2021

A Preliminary Prognostic Model for Predicting Poor Prognosis in COVID-19 Integrating Lung Epithelial Injury (KL-6) with Routine Care Markers

This article has 7 authors:
1. Yunlai Liang
2. Kun Wang
3. Lu Long
4. Qizhuo Hou
5. Wenze Yu
6. Kangkang Huang
7. Bin Yi
This article has no evaluationsLatest version Feb 3, 2026
Trends of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) antibody prevalence in selected regions across Ghana

This article has 25 authors:
1. Peter Kojo Quashie
2. Joe Kimanthi Mutungi
3. Francis Dzabeng
4. Daniel Oduro-Mensah
5. Precious C. Opurum
6. Kesego Tapela
7. Aniefiok John Udoakang
8. WACCBIP COVID-19 Team
9. Ivy Asante
10. Lily Paemka
11. Frederick Kumi-Ansah
12. Osbourne Quaye
13. Emmanuela Amoako
14. Ralph Armah
15. Charlyne Kilba
16. Nana Afia Boateng
17. Michael Ofori
18. George B. Kyei
19. Yaw Bediako
20. Nicaise Ndam
21. James Abugri
22. Patrick Ansah
23. William K. Ampofo
24. Francisca Mutapi
25. Gordon A. Awandare
Reviewed by ScreenIT

This article has 1 evaluationAppears in 1 listLatest version Jan 15, 2026Latest activity May 2, 2021
An HLA Association With COVID-19 Vaccine Reactogenicity Correlates With Fewer SARS-CoV-2 Infections and Monocyte Activation

This article has 35 authors:
1. Jill Hollenbach
2. Anshika Srivastava
3. Demetra Chatzileontiadou
4. Anurag Adhikari
5. Rayo Suseno
6. Sean Lin
7. Juliano Boquett
8. Jamie Tuibeo
9. Tasneem Yusufali
10. Noah Peyser
11. Ticiana Farias
12. Katherine Kichula
13. Andrea Nguyen
14. Irvin Jose
15. Dhilshan Jayasinghe
16. Katerina Tarassi
17. Elissavet Kontou
18. Janesha Maddumage
19. Kleio Ampelakiotou
20. Alexandra Tsirogianni
21. Michael Dewar-Oldis
22. Peter Barnard
23. Joe Sabatino
24. Dimitrios Zoulas
25. Emily Ariens
26. Timothy Mercer
27. Emma Grant
28. Lloyd D'Orsogna
29. Corey Smith
30. Paul Norman
31. Gregory Marcus
32. Jeffrey Olgin
33. Mark J. Pletcher
34. Martin Maiers
35. Stephanie Gras
This article has no evaluationsLatest version Dec 17, 2025

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Background

Methods

Results

Conclusions

Article activity feed

Related articles

A Preliminary Prognostic Model for Predicting Poor Prognosis in COVID-19 Integrating Lung Epithelial Injury (KL-6) with Routine Care Markers

Trends of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) antibody prevalence in selected regions across Ghana

An HLA Association With COVID-19 Vaccine Reactogenicity Correlates With Fewer SARS-CoV-2 Infections and Monocyte Activation