Comprehensive Transcriptome Quality Assessment Using CATS: Reference-free and Reference-based Approaches

Kristian Bodulić
Kristian Vlahoviček

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Accurate assessment of transcriptome assembly quality is critical to ensure the reliability of subsequent transcriptomic analyses. We present CATS (Comprehensive Assessment of Transcript Sequences), a tool offering both reference-free (CATS-rf) and reference-based (CATS-rb) transcriptome quality evaluation pipelines. CATS-rf maps RNA-seq reads back to the assembled transcripts and computes four interpretable scoring components that capture common assembly errors. CATS-rb assesses transcriptome completeness via alignment to a reference genome, supporting both annotation-free and annotation-based scoring. We benchmarked CATS on 672 transcriptomes from simulated and public RNA-seq data. CATS-rf outperformed existing tools in both transcript-level accuracy assessment and demonstrated high sensitivity to diverse assembly error types. CATS-rb produced robust transcriptome completeness estimates even without external annotation, with its scoring metrics strongly reflecting assembly quality. These results highlight CATS as an accurate, interpretable, and broadly applicable framework for evaluating transcriptome assemblies.

Version published to 10.1101/2025.07.22.666112 on bioRxiv
Jul 25, 2025

Benchmarking RNA-seq Tools for Real-World Diagnostic Applications

This article has 15 authors:
1. Sarah Silverstein
2. Kaushik Ganapathy
3. Sandra Donkervoort
4. Veronique Bolduc
5. Ying Hu
6. Justin Moy
7. Prech Uapinyoying
8. Svetlana Gorokhova
9. Vijay Ganesh
10. Ben Weisburd
11. Rotem OrBach
12. A. Reghan Foley
13. Pejman Mohammadi
14. David Adams
15. Carsten Bonnemann
This article has no evaluationsLatest version Jan 29, 2026
Optimizing bioinformatic workflows to extract clinically usable gene expression data from targeted RNA sequencing panels: comparison with total RNAseq

This article has 12 authors:
1. Xiaokang Pan
2. Ashley Patton
3. Yi Seok Chang
4. Ryan Stevens
5. Nehad Mohamed
6. Matthew Hunt
7. Daniel Chappell
8. Yan Hu
9. Cecelia Miller
10. Weiqiang Zhao
11. Matthew Avenarius
12. Dan Jones
This article has no evaluationsLatest version Feb 3, 2026
A Benchmarking Framework to Catalyze Individual Human Genome Projects

This article has 3 authors:
1. Manjushri kalpande
2. Apoorva Ganesh
3. Subhashini Srinivasan
This article has no evaluationsLatest version Dec 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Benchmarking RNA-seq Tools for Real-World Diagnostic Applications

Optimizing bioinformatic workflows to extract clinically usable gene expression data from targeted RNA sequencing panels: comparison with total RNAseq

A Benchmarking Framework to Catalyze Individual Human Genome Projects