LoRTIA Plus: a chemistry-agnostic, feature-first software package for long-read transcriptome annotation

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Long-read RNA sequencing (lrRNA-seq) enables direct reconstruction of full-length transcripts, yet existing annotation tools show variable performance across genomes and library chemistries, particularly for novel isoforms. We present LoRTIA Plus, a chemistry-agnostic suite for transcriptome annotation and reconstruction from lrRNA-seq data. LoRTIA Plus first detects and filters transcription start sites (TSSs), transcription end sites (TESs), and introns using adapter-aware and quality-based criteria, and evaluates read support before assembling high-confidence transcript models. We benchmarked LoRTIA Plus against bambu, FLAIR, IsoQuant, and NAGATA on KSHV transcriptomes with dense overlap, using a validated literature-supported boundary set, and on transcriptomes from three human cell lines from the Long Read RNA-seq Genome Annotation Assessment Project (LRGASP) sequenced with five long-read chemistries. On KSHV, LoRTIA Plus achieved the highest F1 scores for TSSs, TESs, and transcripts in both direct-cDNA and direct-RNA datasets by improving recall without sacrificing precision. Across human datasets, LoRTIA Plus consistently ranked among the top boundary annotators across all chemistries and was the best-performing tool in PCR-based libraries, while remaining highly competitive on native RNA. Junction- and isoform-level analyses show that LoRTIA Plus yields a rich, reproducible repertoire of novel isoforms and transcript boundaries from viral to human transcriptomes.

Article activity feed