Interpretable machine learning enables de novo mapping of cell type-specific RNA splicing regulation from scRNA-seq data

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Context-dependent regulation of alternative splicing (AS), largely mediated by RNA-binding proteins (RBPs), is a key post-transcriptional mechanism shaping diverse biological processes. Experimental approaches for probing splicing regulation, such as CLIP-based assays and RBP perturbations, suffer from low throughput, poor physiological relevance, and bulk resolution that overlooks cellular heterogeneity. Here, we present CASREL (cell-specific alternative splicing regulation inference via explainable learning), a machine learning framework that reconstructs RBP-AS regulatory circuitry directly from single-cell RNA sequencing data without reliance on prior RBP-RNA binding annotations. CASREL integrates ensemble learning with SHAP-based interpretation to uncover RBP-AS associations, leveraging unique features of single-cell splicing profiles, including polarized isoform usage, minimal averaging of regulatory programs, and resilience to expression noise. Applications across diverse tissues and cell types demonstrate its robustness, accuracy, and biological relevance, establishing CASREL as a powerful method for de novo mapping of cell-specific RNA splicing regulation in physiological contexts.

Article activity feed