A systematic benchmark of bioinformatics methods for single-cell and spatial RNA-seq Nanopore long-read data

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Alternative splicing plays a crucial role in transcriptomic complexity, yet remains difficult to resolve at the single-cell level due to the limitations of short-read technologies. Coupling single-cell with long-read sequencing offers full-length transcript coverage, enabling more accurate isoform detection. Multiple specialized computational tools tailored for single-cell and spatial long-read transcriptomics have been developed, with diverse strategies. To compare the effectiveness of these approaches, we generated paired short-read and Nanopore long-read single-cell datasets, tailored for benchmarking bioinformatics tools. We evaluated ten state-of-the-art methods, spanning four analytical dimensions: barcodes and UMI detection, demultiplexing and UMI clustering, gene-level expression profiling, and isoform detection and quantification. Using real and simulated datasets across different protocols, sequencing depths and chemistries, we assessed the accuracy, robustness, and scalability of each tool. Our results revealed method-specific trade-offs, and highlight the importance of sequencing quality and UMI correction strategies. This benchmark provides a practical resource for optimizing isoform analysis and accurate gene expression profiling in single-cell and spatial transcriptomics using long-read sequencing. Our benchmarking workflow is designed to be reusable, thereby enabling method developers to compare their own approaches against the set of reference methods evaluated in this work.

Article activity feed