COVID-19 Biomarkers in research: Extension of the OncoMX cancer biomarker data model to capture biomarker data from other diseases

N Gogate
D Lyman
K.A Crandall
R Kahsay
D.A Natale
S Sen
R Mazumder

This article has been Reviewed by the following groups

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

Evaluated articles (ScreenIT)

Abstract

Scientists, medical researchers, and health care workers have mobilized worldwide in response to the outbreak of COVID-19, caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2; SCoV2). Preliminary data have captured a wide range of host responses, symptoms, and lingering problems post-recovery within the human population. These variable clinical manifestations suggest differences in influential factors, such as innate and adaptive host immunity, existing or underlying health conditions, co-morbidities, genetics, and other factors. As COVID-19-related data continue to accumulate from disparate groups, the heterogeneous nature of these datasets poses challenges for efficient extrapolation of meaningful observations, hindering translation of information into clinical applications. Attempts to utilize, analyze, or combine biomarker datasets from multiple sources have shown to be inefficient and complicated, without a unifying resource. As such, there is an urgent need within the research community for the rapid development of an integrated and harmonized COVID-19 Biomarker Knowledgebase. By leveraging data collection and integration methods, backed by a robust data model developed to capture cancer biomarker data we have rapidly crowdsourced the collection and harmonization of COVID-19 biomarkers. Our resource currently has 138 unique biomarkers. We found multiple instances of the same biomarker substance being suggested as multiple biomarker types during our extensive cross-validation and manual curation. As a result, our Knowledgebase currently has 265 biomarker type combinations. Every biomarker entry is made comprehensive by bringing in together ancillary data from multiple sources such as biomarker accessions (canonical UniProtKB accession, PubChem Compound ID, Cell Ontology ID, Protein Ontology ID, NCI Thesaurus Code, and Disease Ontology ID), BEST biomarker category, and specimen type (Uberon Anatomy Ontology) unified with ontology standards. Our preliminary observations show distinct trends in the collated biomarkers. Most biomarkers are related to the immune system (SAA,TNF-∝, and IP-10) or coagulopathies (D-dimer, antithrombin, and VWF) and a few have already been established as cancer biomarkers (ACE2, IL-6, IL-4 and IL-2). These trends align with proposed hypotheses of clinical manifestations compounding the complexity of COVID-19 pathobiology. We explore these trends as we put forth a COVID-19 biomarker resource that will help researchers and diagnosticians alike. All biomarker data are freely available from https://data.oncomx.org/covid19 .

SciScore for 10.1101/2020.09.09.196220: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Institutional Review Board Statement	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	not detected.

Table 2: Resources

Software and Algorithms
Sentences	Resources
Data Collection and Compilation: Curators searched for “COVID-19 biomarkers” in Google Scholar that were publicly available after January 2020.	Google Scholar suggested: (Google Scholar, RRID:SCR_008878)
Biomarker accession could include the canonical UniProtKB accession45, PubChem Compound ID46, Cell Ontology ID47,	UniProtKB suggested: (UniProtKB, RRID:SCR_004426)

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share …

SciScore for 10.1101/2020.09.09.196220: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Institutional Review Board Statement	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	not detected.

Table 2: Resources

Software and Algorithms
Sentences	Resources
Data Collection and Compilation: Curators searched for “COVID-19 biomarkers” in Google Scholar that were publicly available after January 2020.	Google Scholar suggested: (Google Scholar, RRID:SCR_008878)
Biomarker accession could include the canonical UniProtKB accession45, PubChem Compound ID46, Cell Ontology ID47,	UniProtKB suggested: (UniProtKB, RRID:SCR_004426)

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.

Read the original source

Version published to 10.1101/2020.09.09.196220 on bioRxiv
Sep 10, 2020

A Preliminary Prognostic Model for Predicting Poor Prognosis in COVID-19 Integrating Lung Epithelial Injury (KL-6) with Routine Care Markers

This article has 7 authors:
1. Yunlai Liang
2. Kun Wang
3. Lu Long
4. Qizhuo Hou
5. Wenze Yu
6. Kangkang Huang
7. Bin Yi
This article has no evaluationsLatest version Feb 3, 2026
Integrated Transcriptomic Analysis Reveals Distinct Immune Response Signatures and Prognostic Biomarkers in SARS-CoV-2 Infection

This article has 3 authors:
1. Haixia Li
2. Weiyan Zhu
3. Lu Yu
This article has no evaluationsLatest version Jan 28, 2026
DIVERSITY AND CLINICAL CORRELATIONS OF SARS-CoV-2 VARIANT DURING THE INTRODUCTION OF THE DELTA VARIANT IN GUATEMALA

This article has 13 authors:
1. Claudia Carranza
2. Lucia Ortiz
3. Maria Eugenia Castellanos
4. Ana Silvia Gonzalez-Reiche
5. Renata Mendizabal-Cabrera
6. Zain Khalil
7. Adriana van DeGuchte
8. Keith Farrugia
9. Mariana Herrera
10. Ernesto Mena
11. Celia Cordon-Rosales
12. Harm van Bakel
13. Daniel R. Perez
Reviewed by Access Microbiology

This article has 3 evaluationsLatest version Feb 3, 2026Latest activity Jul 20, 2025

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Preliminary Prognostic Model for Predicting Poor Prognosis in COVID-19 Integrating Lung Epithelial Injury (KL-6) with Routine Care Markers

Integrated Transcriptomic Analysis Reveals Distinct Immune Response Signatures and Prognostic Biomarkers in SARS-CoV-2 Infection

DIVERSITY AND CLINICAL CORRELATIONS OF SARS-CoV-2 VARIANT DURING THE INTRODUCTION OF THE DELTA VARIANT IN GUATEMALA