Benchmarking long-read RNA-sequencing technologies with LongBench: a cross-platform reference dataset profiling cancer cell lines with bulk and single-cell approaches

Yupei You
Ashleigh Solano
James Lancaster
Margaux David
Changqing Wang
Shian Su
Camilla Pasquali
Kathleen Zeglinski
Reza Ghamsari
Manveer Chauhan
Josie Gleeson
Yair D. J. Prawer
Jin Ng
Bénédicte Dubois
Isabelle Cleynen
Marie-Liesse Asselin-Labat
Kate D. Sutherland
Michael B. Clark
Quentin Gouil
Matthew E. Ritchie

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Long-read RNA sequencing enables full-length transcript profiling and improved isoform resolution, but variable platforms and evolving chemistries demand careful benchmarking for reliable application. We present LongBench , a matched, multi-platform reference dataset spanning bulk, single-cell, and single-nucleus transcriptomics across eight human lung cancer cell lines with synthetic spike-in controls. LongBench incorporates three state-of-the-art long-read protocols alongside Illumina short reads: Oxford Nanopore Technologies (ONT) PCR-cDNA, ONT direct RNA, and PacBio Kinnex. We systematically evaluate transcript capture, quantification accuracy, differential expression, isoform usage, variant detection, and allele-specific analyses. Our results show high concordance in gene-level differential analyses across protocols, but reduced consistency for transcript-level and isoform analyses due to lengthand platform-dependent biases. Single-cell long-read data are highly concordant with bulk for high-confidence features, though single-nuclei data show reduced feature detection. LongBench provides one of the largest publicly available long-read benchmarking resources, enabling rigorous cross-platform evaluation and guiding technology selection for transcriptomic research.

Version published to 10.1101/2025.09.11.675724 on bioRxiv
Sep 12, 2025

Optimizing bioinformatic workflows to extract clinically usable gene expression data from targeted RNA sequencing panels: comparison with total RNAseq

This article has 12 authors:
1. Xiaokang Pan
2. Ashley Patton
3. Yi Seok Chang
4. Ryan Stevens
5. Nehad Mohamed
6. Matthew Hunt
7. Daniel Chappell
8. Yan Hu
9. Cecelia Miller
10. Weiqiang Zhao
11. Matthew Avenarius
12. Dan Jones
This article has no evaluationsLatest version Feb 3, 2026
DoseH-seq: A single-cell multiome platform to decode gene-dosage logic driving developmental reversion and cell fate reprogramming

This article has 25 authors:
1. Ying Yang
2. Ralph Patrick
3. Xiaoli Chen
4. Stacey Anderson
5. Jingyu Zhang
6. Yifei Huang
7. Mohammadhossein Esmaeili
8. Kanupriya Tiwari
9. Shivangi Wani
10. Monisha Ganesan
11. Hsin-Yi Chou
12. Dominique Power
13. Cassy M Spiller
14. Sas Loganathan
15. Solal Chauquet
16. Michael Piper
17. Majid Alhomrani
18. Walaa Alsanie
19. Sonia Shah
20. Josephine Bowles
21. Jessica C Mar
22. Shyuan T Ngo
23. Melanie D White
24. Marina Naval-Sanchez
25. Christian M Nefzger
This article has no evaluationsLatest version Dec 23, 2025
Cell-type-specific transcriptomic signatures associated with Alzheimer’s disease in the ROSMAP cohort: a single-nucleus RNA-seq pseudobulk analysis.

This article has 1 author:
1. Jose Israel Nadal Vidal
This article has no evaluationsLatest version Jan 6, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Optimizing bioinformatic workflows to extract clinically usable gene expression data from targeted RNA sequencing panels: comparison with total RNAseq

DoseH-seq: A single-cell multiome platform to decode gene-dosage logic driving developmental reversion and cell fate reprogramming

Cell-type-specific transcriptomic signatures associated with Alzheimer’s disease in the ROSMAP cohort: a single-nucleus RNA-seq pseudobulk analysis.