Analysis of SARS-CoV-2 known and novel subgenomic mRNAs in cell culture, animal model, and clinical samples using LeTRS, a bioinformatic tool to identify unique sequence identifiers

Xiaofeng Dong
Rebekah Penrice-Randal
Hannah Goldswain
Tessa Prince
Nadine Randle
I'ah Donovan-Banfield
Francisco J Salguero
Julia Tree
Ecaterina Vamos
Charlotte Nelson
Jordan Clark
Yan Ryan
James P Stewart
Malcolm G Semple
J Kenneth Baillie
Peter J M Openshaw
Lance Turtle
David A Matthews
Miles W Carroll
Alistair C Darby
Julian A Hiscox

This article has been Reviewed by the following groups

Read the full article

Listed in

Evaluated articles (ScreenIT)

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has a complex strategy for the transcription of viral subgenomic mRNAs (sgmRNAs), which are targets for nucleic acid diagnostics. Each of these sgmRNAs has a unique 5′ sequence, the leader–transcriptional regulatory sequence gene junction (leader–TRS junction), that can be identified using sequencing. High-resolution sequencing has been used to investigate the biology of SARS-CoV-2 and the host response in cell culture and animal models and from clinical samples. LeTRS, a bioinformatics tool, was developed to identify leader–TRS junctions and can be used as a proxy to quantify sgmRNAs for understanding virus biology. LeTRS is readily adaptable for other coronaviruses such as Middle East respiratory syndrome coronavirus or a future newly discovered coronavirus. LeTRS was tested on published data sets and novel clinical samples from patients and longitudinal samples from animal models with coronavirus disease 2019. LeTRS identified known leader–TRS junctions and identified putative novel sgmRNAs that were common across different mammalian species. This may be indicative of an evolutionary mechanism where plasticity in transcription generates novel open reading frames, which can then subject to selection pressure. The data indicated multiphasic abundance of sgmRNAs in two different animal models. This recapitulates the relative sgmRNA abundance observed in cells at early points in infection but not at late points. This pattern is reflected in some human nasopharyngeal samples and therefore has implications for transmission models and nucleic acid–based diagnostics. LeTRS provides a quantitative measure of sgmRNA abundance from sequencing data. This can be used to assess the biology of SARS-CoV-2 (or other coronaviruses) in clinical and nonclinical samples, especially to evaluate different variants and medical countermeasures that may influence viral RNA synthesis.

Version published to 10.1093/gigascience/giac045
Jan 1, 2022

SciScore for 10.1101/2021.03.03.433753: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Institutional Review Board Statement	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	not detected.
Cell Line Authentication	not detected.

Table 2: Resources

Experimental Models: Cell Lines
Sentences	Resources
The LeTRS was also tested with a combined Nanopore cDNA ARTIC v3 amplicon dataset of 7 published viral cell culture samples (barcode01-barcode07) 16, and a dataset from a published direct RNA Nanopore sequencing analysis Vero cells infected with SARS-CoV-2 or an uninfected negative control 2.	Vero suggested: CLS Cat# 605372/p622_VERO, RRID:CVCL_0059)
Software and Algorithms
Sentences	Resources
The generated raw FastQ files (2 x 250 bp) were …

SciScore for 10.1101/2021.03.03.433753: (What is this?)

Please note, not all rigor criteria are appropriate for all manuscripts.

Table 1: Rigor

Institutional Review Board Statement	not detected.
Randomization	not detected.
Blinding	not detected.
Power Analysis	not detected.
Sex as a biological variable	not detected.
Cell Line Authentication	not detected.

Table 2: Resources

Experimental Models: Cell Lines
Sentences	Resources
The LeTRS was also tested with a combined Nanopore cDNA ARTIC v3 amplicon dataset of 7 published viral cell culture samples (barcode01-barcode07) 16, and a dataset from a published direct RNA Nanopore sequencing analysis Vero cells infected with SARS-CoV-2 or an uninfected negative control 2.	Vero suggested: CLS Cat# 605372/p622_VERO, RRID:CVCL_0059)
Software and Algorithms
Sentences	Resources
The generated raw FastQ files (2 x 250 bp) were trimmed to remove Illumina adapter sequences using Cutadapt v1.2.1 26.	Cutadapt suggested: (cutadapt, RRID:SCR_011841)
The reads were further trimmed to remove low quality bases, using Sickle v1.200 27 with a minimum window quality score of 20.	Sickle suggested: (Sickle, RRID:SCR_006800)
The LeTRS was also tested with a combined Nanopore cDNA ARTIC v3 amplicon dataset of 7 published viral cell culture samples (barcode01-barcode07) 16, and a dataset from a published direct RNA Nanopore sequencing analysis Vero cells infected with SARS-CoV-2 or an uninfected negative control 2.	SARS-CoV-2 suggested: (Active Motif Cat# 91351, RRID:AB_2847848)
Sequencing data alignment and basic filtering: LeTRS controlled Hisat2 v2.1.0 28 to map the paired-end Illumina reads against the SARS-CoV-2 reference genome (NC_045512.2) with the default setting, and Minimap2 v2.1 29 to align the Nanopore cDNA reads and direct RNA-seq reads on the viral genome using Minimap2 with “–ax splice” and “-ax splice -uf -k14” parameters, respectively.	Minimap2 suggested: (Minimap2, RRID:SCR_018550)
LeTRS provided 10 known leader-TRS junctions to improve alignment accuracy by using “--known-splicesite-infile” function in Hisat2 and “--junc-bed” function in Minimap2, but this application could be optionally switched off by users.	Hisat2 suggested: (HISAT2, RRID:SCR_015530)
In order to remove low mapping quality and mis-mapped reads before searching the leader-TRS junction sites, LeTRS used Samtools v1.9 30 to have basic filtering for the reads in the output Sam/Bam files according to their alignment states as shown (Table 9 - basic filtering).	Samtools suggested: (SAMTOOLS, RRID:SCR_002105)
Leader-TRS junction plotting: LeTRS-plot was developed as an automatic plotting tool that interfaces with the R package ggplot2 v3.3.3 to view the leader-TRS junctions in the tables generated by LeTRS (Figure 3-5).	ggplot2 suggested: (ggplot2, RRID:SCR_014601)

Results from OddPub: Thank you for sharing your code.

Results from LimitationRecognizer: An explicit section about the limitations of the techniques employed in this study was not found. We encourage authors to address study limitations.

Results from TrialIdentifier: No clinical trial numbers were referenced.

Results from Barzooka: We found bar graphs of continuous data. We recommend replacing bar graphs with more informative graphics, as many different datasets can lead to the same bar graph. The actual data may suggest different conclusions from the summary statistics. For more information, please see Weissgerber et al (2015).

Results from JetFighter: We did not find any issues relating to colormaps.

Results from rtransparent:

Thank you for including a conflict of interest statement. Authors are encouraged to include this statement when submitting to a journal.
Thank you for including a funding statement. Authors are encouraged to include this statement when submitting to a journal.
No protocol registration statement was detected.

Read the original source

Version published to 10.1101/2021.03.03.433753 on bioRxiv
Mar 3, 2021

A conserved long-range RNA interaction in SARS-CoV-2 recruits ADAR1 to enhance virus proliferation

This article has 18 authors:
1. Siwy Ling Yang
2. Louis DeFalco
3. Sainan Wang
4. Yi Hao Wong
5. Jian Han
6. Chee Keng Mok
7. Kiat Yee Tan
8. Su Ying Lim
9. Zhiya Zhao
10. Yu Zhang
11. Jovi Jian An Lim
12. Joy S. Xiang
13. Radoslaw Sobota
14. Lin-Fa Wang
15. Justin Jang Hann Chu
16. Andres Merits
17. Roland G. Huber
18. Yue Wan
This article has no evaluationsLatest version Jul 8, 2025
Coronavirus protein interaction mapping in bat and human cells identifies molecular and genetic switches for immune evasion and replication

This article has 57 authors:
1. Jyoti Batra
2. Magdalena Rutkowska
3. Yuan Zhou
4. Chengjin Ye
5. Rithika Adavikolanu
6. Janet M. Young
7. Durga Anand
8. Sooraj Verma
9. Martin Gordon
10. Shivali Malpotra
11. Anastasija Cupic
12. Thomas Kehrer
13. Jack M Moen
14. Declan M. Winters
15. Ajda Rojc
16. Ignacio Mena
17. Sadaf Aslam
18. Carles Martinez-Romero
19. Isabela Conde Viñas
20. Zain Khalil
21. Keith Farrugia
22. Atoshi Banerjee
23. Dafna Tussia-Cohen
24. Melanie Dos Santos
25. Sourobh Maji
26. Monita Muralidharan
27. Helene Foussard
28. Irene P. Chen
29. Rotem Fuchs
30. CJ San Felipe
31. Lorena Zuliani-Alvarez
32. Promisree Choudhury
33. Kirsten Obernier
34. Ségolène Gracias
35. Rahul Suryawanshi
36. Carlos Ibáñez
37. Javier Juste
38. Lars Pache
39. Taha Y. Taha
40. Nolwenn Jouvenet
41. Kliment A. Verba
42. James S Fraser
43. Caroline Demeret
44. Robert M. Stroud
45. Harm van Bakel
46. Melanie Ott
47. Tzachi Hagai
48. Ben Polacco
49. Danielle L Swaney
50. Ignacia Echeverria
51. Mehdi Bouhaddou
52. Manon Eckhardt
53. Harmit S. Malik
54. Luis Martinez-Sobrido
55. Lisa Miorin
56. Adolfo García-Sastre
57. Nevan J Krogan
This article has no evaluationsLatest version Jul 28, 2025
Utility of long read Nanopore sequencing as a tool for rapid insight into the host response in patients infected with SARS-CoV-2 or influenza A virus at point-of-care

This article has 18 authors:
1. Natasha Y. Rickett
2. Catherine Hartley
3. Ghada T. Shawli
4. Andrew George Shapanis
5. Aaron Ions Gardner
6. Nicholas Harding
7. Jelmer Legebeke
8. Jenny Lord
9. Stephen Poole
10. Nathan J. Brendish
11. Anthony P. Williams
12. Fabio Strazzeri
13. James P. R. Schofield
14. Paul J. Skipp
15. Tristan W. Clark
16. Diana Baralle
17. Julian A. Hiscox
18. Rebekah Penrice-Randal
This article has no evaluationsLatest version Aug 11, 2025

This article has been Reviewed by the following groups

Listed in

Abstract

Article activity feed

Related articles

A conserved long-range RNA interaction in SARS-CoV-2 recruits ADAR1 to enhance virus proliferation

Coronavirus protein interaction mapping in bat and human cells identifies molecular and genetic switches for immune evasion and replication

Utility of long read Nanopore sequencing as a tool for rapid insight into the host response in patients infected with SARS-CoV-2 or influenza A virus at point-of-care