FADVI: disentangled representation learning for robust integration of single-cell and spatial omics data

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Integrating single-cell and spatial omics data remains challenging due to strong batch effects across experiments and platforms. Existing methods focus on minimizing these effects but cannot disentangle technical variation from true biological signals. Here, we present FADVI, a variational autoencoder framework partitioning the latent space into batch-specific, label-related, and residual subspaces. By combining supervised classification, adversarial training, and cross-covariance penalty, FADVI enforces independent representations that preserve biological variation while correcting batch effects. Benchmarking across scRNA-seq, scATAC-seq, and high-resolution spatial transcriptomics datasets, FADVI consistently outperformed state-of-the-art integration methods. FADVI also enables feature attribution, revealing genes associated with cell type identity and batch variation. Together, these results demonstrate that FADVI provides robust, interpretable integration for large-scale single-cell and spatial omics data, offering a powerful framework for downstream analysis and discovery.

Article activity feed