AutoRNAseq: Automated Bulk RNA-seq Analysis Pipeline
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Summary
Improved accessibility of high-throughput RNA sequencing has increased the amount of data generated each year. This increase in data creates a need for reproducible pipelines that can process RNA-seq data consistently across experiments. AutoRNAseq addresses this need by providing a Snakemake-based workflow for bulk RNA-seq analysis by automating data retrieval, quality control, and gene quantification. Unlike existing RNA-seq workflows that require users to coordinate multiple pipelines and pre-configure reference data, AutoRNAseq provides a single, end-to-end workflow that automates data acquisition, reference preparation, quality control, alignment, and quantification with minimal user intervention. AutoRNAseq is applicable to any domain requiring consistently processed RNA-seq datasets, including bioinformatics, computational biology, and drug-response studies.
Availability and Implementation
AutoRNAseq is implemented in Snakemake and available at https://gitlab.com/unebraska/lagbh-public/autornaseq . Documentation and example configuration files are provided in the GitLab README file and this paper’s Supplementary Information. The code to reproduce the statistics presented here is in the GitLab repository under the “publication” folder.