DiffusionST: A deep generative diffusion model-based framework for enhancing spatial transcriptomics data quality and identifying spatial domains

Yaxuan Cui
Yang Cui
Ruheng Wang
Zheyong Zhu
Xin Zeng
Kenta Nakai
Feifei Cui
Zilong Zhang
Hua Shi
Yan Chen
Xiucai Ye
Tetsuya Sakurai
Leyi Wei

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recent advancements in spatial transcriptomics technology have generated substantial volumes of spatial transcriptome data. However, the quality of this data is often compromised due to the limitations of current sequencing technologies. To address this issue, DiffusionST proposes a method for imputing spatial transcriptomics data and clustering the imputed data. The method employs a graph convolutional network (GCN) model combined with a newly designed loss function, denoising data using the zero-inflated negative binomial (ZINB) distribution, and data enhancement through a diffusion model to improve clustering accuracy. DiffusionST demonstrates superior clustering accuracy compared to six of the most popular spatial transcriptomics clustering algorithms. DiffusionST also excels in data imputation when compared to five single-cell RNA sequencing (scRNA-seq) imputation algorithms. Additionally, DiffusionST’s robustness against noise is quantitatively validated by manually introducing random dropout noise into the dataset, where our model significantly enhances the quality of spatial transcriptomic data. Moreover, DiffusionST is well-suited for high-resolution spatial transcriptomics data and has been demonstrated, through survival analysis and cell-cell communication studies, to dissect spatial domains within breast cancer tissues. These findings provide strong evidence of DiffusionST’s efficacy in handling spatial transcriptomic data especially with strong noise, making it a valuable tool in this field.

Version published to 10.1101/2025.06.12.659243 on bioRxiv
Jun 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed