Full-length isoform constructor (FLIC) – a tool for isoform discovery based on long reads

Alexandra M. Kasianova
Anna V. Klepikova
Oleg A. Gusev
Guzel R. Gazizova
Maria D. Logacheva
Aleksey A Penin

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Advances in high-throughput sequencing have illuminated a complexity of transcriptome landscape in eukaryotes. An inherent part of this complexity is the presence of multiple isoforms generated by the alternative splicing and the use of alternative transcription start and polyadenylation sites. However, currently available tools have limited capacity to infer full-length isoforms. To address this problem, we developed a new pipeline, FLIC (Full-Length Isoform Constructor). FLIC is based on the long-read transcriptome data and integrates several key features: 1) utilizing biological replicate concordance to filter out noise and artifacts; 2) employing peak calling to precisely identify transcription start and polyadenylation sites; 3) enabling robust isoform reconstruction with minimal reliance on existing annotations. We evaluated FLIC using a dedicated set of real and simulated data of Arabidopsis thaliana cDNA sequencing. Results demonstrate that FLIC accurately reconstructs known and novel isoforms, outperforming existing tools, especially in the absence of reference annotations. A direct comparison with CAGE, currently regarded as golden standard for TSS identification shows that FLIC is equally accurate, while being much less time-consuming. Thus FLIC provides a valuable tool for comprehensive transcript characterization, particularly for non-model organisms or when dealing with incomplete or inaccurate annotations.

Version published to 10.1101/2025.05.27.656444 on bioRxiv
May 31, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed