Semi-parametric Empirical Bayes Method for Multiplet Detection in snATAC-seq with Probabilistic Multi-omic Integration

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Multiplets, formed when multiple cells are captured in a droplet, produce hybrid molecular profiles that confound single-cell analyses. Detecting multiplets in single-nucleus ATAC-seq (snATAC-seq) data is particularly challenging due to sparsity and overdispersion of chromatin accessibility measurements. We introduce SEBULA, a semi-parametric empirical Bayes model that yields well-calibrated posterior probabilities for multiplet detection, enabling principled false discovery rate control. SEBULA also integrates probabilistic evidence with complementary signals from other modalities, such as scRNA-seq. Benchmarking on simulations and seven annotated trimodal DOGMA-seq datasets demonstrates SEBULA’s superior performance. The open-source software is computationally efficient.

Article activity feed