scCoBench: Benchmarking single cell RNA-seq co-expression using promoter-reporter lines

Tran N. Chau
Kook Hui Ryu
Razan Alajoleen
Bastiaan O. R. Bargmann
John Schiefelbein
Song Li

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Single-cell RNA sequencing (scRNA-seq) has become a powerful tool for uncovering transcriptomic heterogeneity and reconstructing gene regulatory networks in complex tissues. However, the sparsity, high noise levels, and dropout events inherent to scRNA-seq data pose challenges for accurate inference of gene-gene relationships. In this study scCoBench, we systematically benchmark correlation metrics, pseudo bulk analysis, and imputation methods using promoter-reporter and native gene pairs as internal controls to evaluate the performance of ten widely used gene-gene co-expression measurements. Interestingly, we found that commonly used data scaling and normalization approaches lead to lower correlation between promoter reporter and native gene pairs in most of the co-expression methods. Moreover, we assess the impact of five popular imputation techniques, including scImpute, SAVER, Autoencoder (AE), Variational Autoencoder (VAE), and Generative Adversarial Network (GAN) on recovering biologically relevant co-expression patterns. Our results demonstrate that imputation models not only markedly enhance correlation between each promoter-reporter and native gene pair but also increase the number of cells co-expressing both genes. Imputation also improved transcription factor target gene correlations and revealed stronger associations among genes within the same protein complex. This work highlights the utility of promoter-reporter systems for benchmarking computational methods and underscores the potential of deep learning-based imputation to improve the biologically relevant signals of scRNA-seq data.

Version published to 10.1101/2025.05.26.656221 on bioRxiv
May 30, 2025

DoseH-seq: A single-cell multiome platform to decode gene-dosage logic driving developmental reversion and cell fate reprogramming

This article has 25 authors:
1. Ying Yang
2. Ralph Patrick
3. Xiaoli Chen
4. Stacey Anderson
5. Jingyu Zhang
6. Yifei Huang
7. Mohammadhossein Esmaeili
8. Kanupriya Tiwari
9. Shivangi Wani
10. Monisha Ganesan
11. Hsin-Yi Chou
12. Dominique Power
13. Cassy M Spiller
14. Sas Loganathan
15. Solal Chauquet
16. Michael Piper
17. Majid Alhomrani
18. Walaa Alsanie
19. Sonia Shah
20. Josephine Bowles
21. Jessica C Mar
22. Shyuan T Ngo
23. Melanie D White
24. Marina Naval-Sanchez
25. Christian M Nefzger
This article has no evaluationsLatest version Dec 23, 2025
Understanding Pathways in Bioinformatics, Genomics, and Health Applications

This article has 1 author:
1. Diptarup Mallick
This article has no evaluationsLatest version Jan 19, 2026
Self-supervised Graph Contrastive Learning for scRNA-seq Clustering

This article has 1 author:
1. Tong Wu
This article has no evaluationsLatest version Dec 11, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DoseH-seq: A single-cell multiome platform to decode gene-dosage logic driving developmental reversion and cell fate reprogramming

Understanding Pathways in Bioinformatics, Genomics, and Health Applications

Self-supervised Graph Contrastive Learning for scRNA-seq Clustering