CMDdemux: an efficient single cell demultiplexing method
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Multiplexing technologies label cells with molecular tags, allowing cells from different donors to be pooled together for sequencing. Although this approach enhances cell throughput, eliminates batch effects, and enables doublet detection, limitations of hashtag-based labelling can still lead to low-quality data. Existing demultiplexing methods can accurately assign donor identities in high-quality datasets, but they often fail on low-quality data. To address this, we developed CMDdemux, a method comprising three key steps: within-cell centered log-ratio (CLR) normalization of hashtag count data, K-medoids clustering, and classification of cells based on Mahalanobis distance. By integrating both hashing and mRNA data, CMDdemux achieves high accuracy in distinguishing singlets, doublets, and negatives. It also provides visualization tools to help users inspect potentially misclassified droplets. We benchmarked CMDdemux against existing methods using a range of high- and low-quality datasets. Results show that CMDdemux consistently outperforms other approaches, demonstrating robust performance on both high- and low-quality data where other methods fail. CMDdemux is particularly effective in handling diverse types of low-quality multiplexing data across different multiplexing technologies.