FADVI: disentangled representation learning for robust integration of single-cell and spatial omics data
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Integrating single-cell and spatial omics data remains challenging due to strong batch effects across experiments and platforms. Existing methods focus on minimizing these effects but cannot disentangle technical variation from true biological signals. Here, we present FADVI, a variational autoencoder framework partitioning the latent space into batch-specific, label-related, and residual subspaces. By combining supervised classification, adversarial training, and cross-covariance penalty, FADVI enforces independent representations that preserve biological variation while correcting batch effects. Benchmarking across scRNA-seq, scATAC-seq, and high-resolution spatial transcriptomics datasets, FADVI consistently outperformed state-of-the-art integration methods. FADVI also enables feature attribution, revealing genes associated with cell type identity and batch variation. Together, these results demonstrate that FADVI provides robust, interpretable integration for large-scale single-cell and spatial omics data, offering a powerful framework for downstream analysis and discovery.