Determining sufficient sequencing depth in RNA-Seq differential expression studies

Andrew J. Bass
David G. Robinson
John D. Storey

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

@cwarden45's saved articles (cwarden45)

Abstract

RNA-Seq studies require a sufficient read depth to detect biologically important genes. Sequencing below this threshold will reduce statistical power while sequencing above will provide only marginal improvements in power and incur unnecessary sequencing costs. Although existing methodologies can help assess whether there is sufficient read depth, they are unable to guide how many additional reads should be sequenced to reach this threshold. We provide a new method called superSeq that models the relationship between statistical power and read depth. We apply the superSeq framework to 393 RNA-Seq experiments (1,021 total contrasts) in the Expression Atlas and find the model accurately predicts the increase in statistical power gained by increasing the read depth. Based on our analysis, we find that most published studies (> 70%) are undersequenced, i.e., their statistical power can be improved by increasing the sequencing read depth. In addition, the extent of saturation is highly dependent on statistical methodology: only 9.5%, 29.5%, and 26.6% of contrasts are saturated when using DESeq2 , edgeR , and limma , respectively. Finally, we also find that there is no clear minimum per-transcript read depth to guarantee saturation for an entire technology. Therefore, our framework not only delineates key differences among methods and their impact on determining saturation, but will also be needed even as technology improves and the read depth of experiments increases. Researchers can thus use superSeq to calculate the read depth to achieve required statistical power while avoiding unnecessary sequencing costs.

Version published to 10.1101/635623 on bioRxiv
May 13, 2019

Single-cell RNA-seq using UltraMarathonRT expands the known transcriptome

This article has 8 authors:
1. Chia-Ling Chou
2. Anastasiya Grinko
3. Li-Tao Guo
4. Alexander M. Leipold
5. Teresa Rummel
6. Florian Erhard
7. Anna Marie Pyle
8. Antoine-Emmanuel Saliba
This article has no evaluationsLatest version Oct 6, 2025
Q40 sequencing reduces costs and enhances detection of low- frequency somatic variants

This article has 12 authors:
1. Shumeng Duan
2. Yaqing Liu
3. Xiaorou Guo
4. Zhiyin An
5. Ruiwen Ma
6. Qiaochu Chen
7. Yanming Xie
8. Qingwang Chen
9. Ying Yu
10. Lianhua Dong
11. Leming Shi
12. Yuanting Zheng
This article has no evaluationsLatest version Sep 23, 2025
Gel-free library preparation for next-generation RNA sequencing and small RNA quantification

This article has 4 authors:
1. Peter Dedon
2. Ruixi Chen
3. Lili Liu
4. Bo Cao
This article has no evaluationsLatest version Oct 8, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Single-cell RNA-seq using UltraMarathonRT expands the known transcriptome

Q40 sequencing reduces costs and enhances detection of low- frequency somatic variants

Gel-free library preparation for next-generation RNA sequencing and small RNA quantification