PILOT-GM-VAE: Patient-Level Analysis of single cell Disease Atlas with Optimal Transport of Gaussian Mixture Variational Autoencoders
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Motivation
The analysis of single cell disease atlases represents a challenge due to the presence of batch effects, low quality of disease samples, and the multi-scale nature of the data, i.e., samples are described by different cell distributions. Because of these, few computational approaches are performing sample-level disease progression analysis so far.
Results
Here, we introduce Patient-Level Analysis with Optimal Transport based on Gaussian Mixture Variational Autoencoders (PILOT-GM-VAE). PILOT-GM-VAE explores the power of GM-VAE to estimate models describing complex single cell distributions through efficient optimal transport algorithms for estimating the distance between Gaussian Mixtures. Extensive benchmarking on several single cell disease atlases and competing approaches demonstrates the performance of PILOT-GM-VAE in sample-level clustering, sample-level trajectory inference, and batch correction tasks. Moreover, we performed a case study on a breast cancer disease atlas, where PILOT-GM-VAE highlighted cellular and molecular changes associated with breast cancer disease progression.
Availability
The software, code, and data for benchmarking are available at https://github.com/CostaLab/PILOT-GM-VAE/tree/main