Profiling ranked list enrichment scoring in sparse data elucidates algorithmic tradeoffs

Alexander T. Wenzel
John Jun
Pablo Tamayo
Jill P. Mesirov

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Gene Set Enrichment Analysis (GSEA) is a method for quantifying pathway and process activation in groups of samples, and its single sample version (ssGSEA) scores activation using mRNA abundance in a single sample. GSEA and ssGSEA were developed for “bulk” samples rather than individual cell technologies such as microarrays and bulk RNA-sequencing (RNA-seq) data. The growing use of single cell RNA-sequencing (scRNA-seq) raises the possibility of using ssGSEA to quantify pathway and process activation in individual cells. However, scRNA-seq data is much sparser than RNA-seq data. Here we show that ssGSEA as designed for bulk data is subject to some amount of score uncertainty and other technical issues when applied to individual cells from scRNA-seq data. We also show that a ssGSEA can be applied robustly to “pseudobulk” aggregate groups of a few hundred to a few thousand cells provided appropriate normalization is used. Finally, in comparing this approach to other ranked list enrichment methods, we find that the UCell method is most robust to sparsity. We have made the aggregate cell version of ssGSEA available as a Python package and GenePattern module and will also modularize UCell for use on GenePattern as well.

Version published to 10.1101/2024.06.03.597180v1 on bioRxiv
Jun 4, 2024

Experimental and Computational Methods for Allelic Imbalance Analysis from Single-Nucleus RNA-seq Data

This article has 24 authors:
1. Sean K Simmons
2. Xian Adiconis
3. Nathan Haywood
4. Jacob Parker
5. Zechuan Lin
6. Zhixiang Liao
7. Idil Tuncali
8. Aziz Al'Khafaji
9. Asa Shin
10. Karthik Jagadeesh
11. Kirk Gosik
12. Michael Gatzen
13. Jonathan T Smith
14. Daniel N El Kodsi
15. Yuliya Kuras
16. Clare Baecher-Allan
17. Geidy E Serrano
18. Thomas G Beach
19. Kiran Garimella
20. Orit Rozenblatt-Rosen
21. Aviv Regev
22. Xianjun Dong
23. Clemens Scherzer
24. Joshua Z Levin
This article has no evaluationsLatest version Aug 16, 2024
pyVIPER: A fast and scalable Python package for rank-based enrichment analysis of single-cell RNASeq data

This article has 9 authors:
1. Alexander L.E. Wang
2. Zizhao Lin
3. Luca Zanella
4. Lukas Vlahos
5. Miquel Anglada Girotto
6. Aziz Zafar
7. Heeju Noh
8. Andrea Califano
9. Alessandro Vasciaveo
This article has no evaluationsLatest version Aug 27, 2024
Interpretable scRNA-seq Analysis with Intelligent Gene Selection

This article has 8 authors:
1. Tianhao Ni
2. Xinyu Zhang
3. Kaixiu Jin
4. Guanxiong Pei
5. Nan Xue
6. Guanao Yan
7. Taihao Li
8. Bingjie Li
This article has no evaluationsLatest version Sep 3, 2024

Listed in

Abstract

Article activity feed

Related articles

Experimental and Computational Methods for Allelic Imbalance Analysis from Single-Nucleus RNA-seq Data

pyVIPER: A fast and scalable Python package for rank-based enrichment analysis of single-cell RNASeq data

Interpretable scRNA-seq Analysis with Intelligent Gene Selection