scRGP: Prediction of Single-cell Genetic Perturbation Transcriptional Responses based on Rank in Multiple Scenarios
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Single-cell perturbation sequencing technologies (e.g., Perturb-seq, CROP-seq), which integrate CRISPR-based gene editing with single-cell transcriptome profiling, have revolutionized the analysis of transcriptomic changes induced by genetic perturbations at single-cell resolution. These technologies serve as a powerful tool for identifying key genes that inhibit tumor growth or reverse cancer cell phenotypes. However, they face two major challenges: data explosion with high experimental costs, and data complexity characterized by high dimensionality, noise, sparsity, and heterogeneity. To address these challenges, we developed the single-cell Rank-based Genetic Perturbation predictor (scRGP), the first deep learning framework leveraging gene expression rank-order information for this task. scRGP demonstrates superior performance in terms of robustness, cross-cell-line perturbation prediction, and high-throughput screening. Specifically, scRGP achieves an approximately 10-16 percentage points improvement in Pearson correlation coefficient (PCC) over state-of-the-art methods (e.g., GEARS and scFoundation) for single- and double-gene perturbation predictions, while also extending prediction capability to triple-gene perturbations. Furthermore, it outperforms these methods by approximately 5-9 percentage points in cross-cell-line predictions. These advancements promise to shift the paradigm of single-cell perturbation studies from experiment-driven to computation-driven approaches, providing new support for functional genomics and precision medicine.