A Viral Fragmentation Signature for SARS-CoV-2 in Clinical Samples Correlating with Contagiousness

Abstract

The viral load of SARS-CoV-2 in clinical samples as measured by the primary diagnostic tool of RT-PCR is an imperfect readout for infection potential as most targeted assays designed for sensitivity, indiscriminately detect short and long RNA fragments, although infectivity is embodied only in the whole virus and its intact genome. Here, we used next-generation sequencing (NGS) to characterize 155 clinical samples and show sensitive and quantitative detection of viral RNA which confirmed subgenomic RNA in 57.6% of samples and provided a novel method to determine relative integrity of viral RNA in samples. The relative abundance of long fragments quantified as a viral fragmentation score was positively associated with viral load and inversely related to time from disease onset. An empirically determined score cut-off for presence of substantially fragmented RNA was able to identify 100% of samples collected after 8 days of illness with poor infection potential in line with current clinical understanding of infectiousness of SARS-CoV-2. The quantification of longer fragments in addition to existing short targets in an NGS or RT-PCR-based assay could provide a valuable readout of infection potential simultaneous to the detection of any fragments of SARS-CoV-2 RNA in test samples.

SciScore for 10.1101/2021.01.11.21249265: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Institutional Review Board Statement	not detected.
Randomization	Each forward primer additionally includes on the 5’ end, a random 10 nucleotide sequence to serve as molecular barcode.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	not detected.

Table 2: Resources

Software and Algorithms
Sentences	Resources
A total of 155 samples, corresponding to 48 individuals were analyzed by NGS.	NGS suggested: (PM4NGS, RRID:SCR_019164)
Barcode sequences were clustered based on sequence and amplicon identity, and consensus calling was done for each molecular tag (or barcode) cluster, by first performing global alignment among all associated reads using MAFFT.	MAFFT suggested: (MAFFT, RRID:SCR_0…

SciScore for 10.1101/2021.01.11.21249265: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Institutional Review Board Statement	not detected.
Randomization	Each forward primer additionally includes on the 5’ end, a random 10 nucleotide sequence to serve as molecular barcode.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	not detected.

Table 2: Resources

Software and Algorithms
Sentences	Resources
A total of 155 samples, corresponding to 48 individuals were analyzed by NGS.	NGS suggested: (PM4NGS, RRID:SCR_019164)
Barcode sequences were clustered based on sequence and amplicon identity, and consensus calling was done for each molecular tag (or barcode) cluster, by first performing global alignment among all associated reads using MAFFT.	MAFFT suggested: (MAFFT, RRID:SCR_011811)
Insert length analysis and calculation of viral fragmentation score: For each positive sample with sequencing data, SAMtools was used to capture insert sizes from alignment files with the following specifications: samtools view -f65 -F2048 SampleXYZ_consensus.bam \| cut -f 1,4,9 > SampleXYZ_f65F2048.txt where f65 = filter in read paired, first in pair (Reads which are paired and insert sizes for first read in read pair and second in read in read pair will have same same insert size with “negative” length); F2048 = filter out reads with supplementary alignment (removes most subgenomic RNA reads which have part of reads with supplementary alignment and removes chimeric reads.	samtools suggested: (SAMTOOLS, RRID:SCR_002105)
Regression analysis and curve fitting was done using Prism 8.0.1.	Prism suggested: (PRISM, RRID:SCR_005375)

Results from OddPub: We did not detect open data. We also did not detect open code. Researchers are encouraged to share open data when possible (see Nature blog).

Results from LimitationRecognizer: We detected the following sentences addressing limitations in the study:

There are important limitations to this study. The use of the exact cut-off of VFS to categorize samples into clinical groups may not universally apply. Nonetheless, the basic premise of the argument that more intact virus would reflect in more longer fragments in a clinical sample would universally apply. As the study’s main conclusions are based on NGS, with some comparison to RT-PCR results, without evidence from virus culture studies it cannot be ruled out that some of the observed infectivity measures are correlated and result from a feature of the specific NGS method. In conclusion, we have applied NGS to comprehensively characterize longitudinal samples collected from different sites. NGS is an enabling tool that provides sequence-related information for which it is primarily designed, and also information from size and length dimensions. Based on this, we identify fragment length differences among clinical samples which are correlated to clinical features of infectiousness of SARS-CoV-2, quantification of which could be incorporated as relevant and straightforward measure to determine infection potential.

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We did not find any issues relating to the usage of bar graphs.

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a protocol registration statement.

Read the original source

A Viral Fragmentation Signature for SARS-CoV-2 in Clinical Samples Correlating with Contagiousness

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Persistent Immune Dysregulation during Long COVID is Manifested in Antibodies Targeting Envelope and Nucleocapsid Proteins

DIVERSITY AND CLINICAL CORRELATIONS OF SARS-CoV-2 VARIANT DURING THE INTRODUCTION OF THE DELTA VARIANT IN GUATEMALA

Assessing Mass Screening as an Effective Tool for Pandemic Management

This article has been Reviewed by the following groups

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Persistent Immune Dysregulation during Long COVID is Manifested in Antibodies Targeting Envelope and Nucleocapsid Proteins

DIVERSITY AND CLINICAL CORRELATIONS OF SARS-CoV-2 VARIANT DURING THE INTRODUCTION OF THE DELTA VARIANT IN GUATEMALA

Assessing Mass Screening as an Effective Tool for Pandemic Management