Cleanet: robust doublet detection in cytometry data based on protein expression patterns

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Flow and mass cytometry experiments are essential for profiling immune cells at single cell resolution. Better understanding of human immunology increasingly involves analyzing studies at the scale of hundreds or thousands of samples, with data analysis a significant bottleneck. This trend increases the demand for automated analysis methods. In particular, a common preprocessing step in cytometry data analysis is distinguishing single cells from doublets (or multiplets), events in which two (or more) cells pass simultaneously through the detector. Typically, doublets are identified on two-dimensional density plots, using their high measured values for DNA intercalators (mass cytometry) or scattering channels (flow cytometry). Despite its popularity, this bivariate gating method is sometimes imprecise: for example, we show that bivariate gating of mass cytometry data can mistake single eosinophils for doublets, due to their high DNA content. Taking inspiration from methods already used in single cell transcriptomics, but not in the cytometry community, we propose an alternative approach. Our method, called Cleanet, first simulates doublet events, then identifies true events with protein expression similar to the simulated doublets. This simple method is completely automated and detects both homotypic and heterotypic doublets. We validate it in datasets acquired with mass and flow cytometry; moreover, we verify with imaging flow cytometry that events predicted to be doublets truly consist of multiple cells. Cleanet can also classify doublets based on their component cell types, which potentially enables the study of cell-cell interactions, mining extra information out of doublet events that would otherwise be discarded. As a proof of concept, we demonstrate that Cleanet can detect a treatment-specific increase in interactions between two cell lines. By automating doublet detection and classification, we aim to streamline the data analysis in large cytometry studies and provide a more accurate picture of both immune cell populations and cell-cell interactions.

Article activity feed