Single-cell disentangled representations for perturbation modeling and treatment effect estimation
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Dissecting cell-state-specific changes in gene regulation following perturbations is crucial for understanding biological mechanisms. However, single-cell sequencing provides only unmatched snapshots of cells under different conditions. This destructive measurement process hinders the estimation of individualized treatment effects (ITEs), which are essential for pinpointing these heterogeneous mechanistic responses. We present scDRP, a generative framework that lever-ages disentangled representation learning to separate perturbation-dependent and perturbation-independent latent variables via a sparsity regularized β -VAE. Assuming quantile-preserving effects of perturbations conditional on confounders, scDRP performs conditional optimal transport in the latent space to infer counterfactual states and estimate ITEs. Applied to simulated and real single-cell perturbation datasets, scDRP accurately estimates treatment effects and individual counterfactual responses, revealing cell type-specific functional gene module dynamics. Specifically, it captures distinct cellular patterns under rhinovirus and cigarette-smoke extract exposures, reveals heterogeneous responses to interferon stimulation across diverse immune cell types and identified distinct functional module activation in chronic myeloid leukemia cells following CRISPR knockouts targeting different genes. scDRP also generalizes to unseen perturbation doses and combinations. Our framework provides a principled computational approach to elucidate heterogeneous causal relationships from single-cell perturbation data, promoting to a deeper understanding of cellular and molecular mechanisms.