Expanding and improving analyses of nucleotide recoding RNA-seq experiments with the EZbakR suite

Isaac W. Vock
Justin W. Mabin
Martin Machyna
Alexandra Zhang
J. Robert Hogg
Matthew D. Simon

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Nucleotide recoding RNA sequencing methods (NR-seq; TimeLapse-seq, SLAM-seq, TUC-seq, etc.) are powerful approaches for assaying transcript population dynamics. In addition, these methods have been extended to probe a host of regulated steps in the RNA life cycle. Current bioinformatic tools significantly constrain analyses of NR-seq data. To address this limitation, we developed EZbakR ( https://github.com/isaacvock/EZbakR ), an R package to facilitate a more comprehensive set of NR-seq analyses, and fastq2EZbakR ( https://github.com/isaacvock/fastq2EZbakR ), a Snakemake pipeline for flexible preprocessing of NR-seq datasets, collectively referred to as the EZbakR suite. Together, these tools generalize many aspects of the NR-seq analysis workflow. The fastq2EZbakR pipeline can assign reads to a diverse set of genomic features (e.g., genes, exons, splice junctions), and EZbakR can perform analyses on any combination of these features. EZbakR extends standard NR-seq mutational modeling to support multi-label analyses (e.g., s ⁴ U and s ⁶ G dual labeling), and implements an improved hierarchical model to better account for transcript-to-transcript variance in metabolic label incorporation. EZbakR also generalizes dynamical systems modeling of NR-seq data to support analyses of premature mRNA processing and flow between subcellular compartments. Finally, EZbakR implements flexible and well-powered comparative analyses of all estimated parameters via design matrix-specified generalized linear modeling. The EZbakR suite will thus allow researchers to make full, effective use of NR-seq data.

Version published to 10.1371/journal.pcbi.1013179
Jul 3, 2025
Version published to 10.1101/2024.10.14.617411 on bioRxiv
Oct 17, 2024

Discuss this preprint

Listed in

Abstract

Article activity feed