Biologically Guided Variational Inference for Interpretable Multimodal Single-Cell Integration and Mechanistic Discovery

Lucas Arnoldt
Julius Upmeier zu Belzen
Luis Herrmann
Khue Nguyen
Fabian Theis
Benjamin Wild
Roland Eils

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Multi-omics technologies allow for a detailed characterization of cell types and states across multiple omics layers, helping to identify features that differentiate biological conditions, such as chemical or CRISPR-based perturbations. However, current tools employing variational inference on single-cell datasets, including methods for paired and mosaic integration, transfer learning, and modality imputation, typically act as black boxes. This lack of interpretability makes it challenging to evaluate whether biological variation is preserved, which can compromise downstream analyses. Here, we introduce NetworkVI, a sparse deep generative model designed for the integration and interpretation of multimodal single-cell data. NetworkVI utilizes biological prior knowledge as an inductive bias, specifically it relies on gene-gene interactions inferred from topologically associated domains and structured ontologies like the Gene Ontology to aggregate gene embeddings to cell embeddings, enhancing the interpretability at the gene and subcellular level. While achieving state-of-the-art data integration, modality imputation, and cell label transfer via query-to-reference mapping benchmarks across bimodal and trimodal datasets, NetworkVI additionally excels in providing biologically meaningful modality- and cell type-specific interpretations. NetworkVI aids researchers in identifying associations between genes and biological processes and uncovers immune evasion mechanisms in a Perturb CITE-seq dataset of melanoma cells. NetworkVI will support researchers in interpreting cellular disease mechanisms, guiding biomarker discovery, and ultimately aiding the development of targeted therapies in large-scale single-cell multimodal atlases.

NetworkVI is available at http://github.com/LArnoldt/networkVI .

Version published to 10.1101/2025.06.10.657924 on bioRxiv
Jun 12, 2025

Accurate, scalable, and unified single-cell atlas integration with scBIOT

This article has 2 authors:
1. Haihui Zhang
2. Peiwu Qin
This article has no evaluationsLatest version Jan 19, 2026
Discovering cell types and states from reference atlases with heterogeneous single-cell ATAC-seq features

This article has 2 authors:
1. Xiuwei Zhang
2. Yuqi Cheng
This article has no evaluationsLatest version Dec 10, 2025
Deep Learning Architectures for Multi-Omics Data Integration: Bridging Biomarker Discovery and Clinical Translation

This article has 2 authors:
1. Akshay Krishnan Pushparaj
2. Malarmathi Muthukumar
This article has no evaluationsLatest version Jan 26, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Accurate, scalable, and unified single-cell atlas integration with scBIOT

Discovering cell types and states from reference atlases with heterogeneous single-cell ATAC-seq features

Deep Learning Architectures for Multi-Omics Data Integration: Bridging Biomarker Discovery and Clinical Translation