MAssively-Parallel Flow cytometry Xplorer (MAPFX): A Toolbox for Analysing Data from the Massively-Parallel Cytometry Experiments

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Massively-Parallel Cytometry (MPC) experiments allow cost-effective quantification of more than 200 surface proteins at single-cell resolution. The Infinity Flow (Inflow) analysis protocol was developed to measure highly informative protein ‘ backbone ’ markers on all cells in all wells distributed across three 96-well plates, along with well-specific exploratory protein ‘ infinity ’ markers. Backbone markers can be used to impute the infinity markers on cells in all other wells using machine learning methods. This protocol offers unprecedented opportunities for more comprehensive classification of cell types. However, some aspects of the protocol can be improved, including methods for background correction and removal of unwanted variation. Here, we propose MAPFX as an end-to-end toolbox that carefully pre-processes the raw data from MPC experiments, and further imputes the ‘missing’ infinity markers in the wells without those measurements. Our pipeline starts by performing background correction on raw intensities to remove the noise from electronic baseline restoration and fluorescence compensation by adapting a normal-exponential convolution model. Unwanted technical variation, from sources such as well effects, is then removed using a log-normal model with plate, column, and row factors, after which infinity markers are imputed using the informative backbone markers as predictors. The completed dataset can then be used for clustering and other statistical analyses. Unique features of our approach include performing background correction prior to imputation and removing unwanted variation from the data at the cell-level, while explicitly accounting for the potential association between biology and unwanted factors. We benchmark our pipeline against alternative pipelines and demonstrate that our approach is better at preserving biological signals, removing unwanted variation, and imputing unmeasured infinity markers.

Article activity feed