Enhancing transcriptome expression quantification through accurate assignment of long RNA sequencing reads with TranSigner

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Recently developed long-read RNA sequencing technologies promise to provide a more accurate and comprehensive view of transcriptomes compared to short-read sequencers, primarily due to their capability to achieve full-length sequencing of transcripts. However, realizing this potential requires computational tools tailored to process long reads, which exhibit a higher error rate than short reads. Existing methods for assembling and quantifying long-read data often disagree on expressed transcripts and their abundance levels, leading researchers to lack confidence in the transcriptomes produced using this data. One approach to address the uncertainties in transcriptome assembly and quantification is by assigning reads to transcripts, enabling a more detailed characterization of transcript support at the read level. Here, we introduce TranSigner, a versatile tool that assigns long reads to any input transcriptome. TranSigner consists of three consecutive modules performing: read alignment to the given transcripts, transcripts, computation of compatibility scores based on alignment scores and positions, and execution of an expectation-maximization algorithm to probabilistically assign read fractions to transcripts while estimating transcript abundances. Using simulated and experimental datasets from three well studied organisms — Homo Sapiens, Arabidopsis thaliana and Mus musculus — we demonstrate that TranSigner achieves higher accuracy in transcript abundance estimation and read assignment compared to existing tools.

Article activity feed