PILOT-GM-VAE: Patient-Level Analysis of single cell Disease Atlas with Optimal Transport of Gaussian Mixture Variational Autoencoders

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Motivation

The analysis of single cell disease atlases represents a challenge due to the presence of batch effects, low quality of disease samples, and the multi-scale nature of the data, i.e., samples are described by different cell distributions. Because of these, few computational approaches are performing sample-level disease progression analysis so far.

Results

Here, we introduce Patient-Level Analysis with Optimal Transport based on Gaussian Mixture Variational Autoencoders (PILOT-GM-VAE). PILOT-GM-VAE explores the power of GM-VAE to estimate models describing complex single cell distributions through efficient optimal transport algorithms for estimating the distance between Gaussian Mixtures. Extensive benchmarking on several single cell disease atlases and competing approaches demonstrates the performance of PILOT-GM-VAE in sample-level clustering, sample-level trajectory inference, and batch correction tasks. Moreover, we performed a case study on a breast cancer disease atlas, where PILOT-GM-VAE highlighted cellular and molecular changes associated with breast cancer disease progression.

Availability

The software, code, and data for benchmarking are available at https://github.com/CostaLab/PILOT-GM-VAE/tree/main

Article activity feed