Systematic evaluation of robustness to cell type mismatch of deconvolution methods for spatial transcriptomics data

Utkarsh M. Mahamune
Aldo Jongejan
Antoine H. C. van Kampen
Lisa G. M. van Baarsen
Perry D. Moerland

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Sequencing-based spatial transcriptomics (ST) approaches preserve spatial information but with limited cellular resolution. Single-cell RNA-sequencing (scRNA-seq) techniques, on the other hand, provide single-cell resolution but lose spatial resolution because of the tissue dissociation step. With these complementary strengths in mind, computational tools have been developed to combine scRNA-seq and ST data. These approaches use deconvolution to identify cell types and their reoctive proportions present at each location in ST data, with the aid of a scRNA-seq reference dataset. It has been suggested that deconvolution methods are sensitive to the absence of cell types in the scRNA-seq reference, a problem referred to as cell type mismatch.

Here, we used extensive simulations to systematically evaluate the robustness to cell type mismatch of six state-of-the-art deconvolution methods tailored for spatially resolved transcriptomics data, along with two deconvolution methods designed for bulk RNA-seq data. At baseline, that is, with no cell types missing from the reference data, cell2location, RCTD, and CARD were the best performing methods, while SPOTlight performed worst. By simulating various cell type mismatch scenarios, we found that the performance of deconvolution methods decreases proportionally to the number of cell types missing from the reference data. Moreover, for most deconvolution methods the decrease in performance is similar relative to their baseline performance. We also observed that those methods that perform well at baseline tend to assign the proportions of a missing cell type to the transcriptionally most similar cell types present in the reference data.

This study highlights the adverse effects of cell type mismatch on the performance of deconvolution methods for ST data and stresses the need for methods that are more robust to this type of mismatch.

Version published to 10.1101/2025.08.12.669903 on bioRxiv
Aug 15, 2025

Microenvironment-aware transcriptome reconstruction in spatial transcriptomics

This article has 7 authors:
1. Shi-Tong Yang
2. Pai Peng
3. Hui-Feng He
4. Meng-Guo Wang
5. Bo-Han Si
6. Xiao-Fei Zhang
7. Luonan Chen
This article has no evaluationsLatest version Jan 13, 2026
Comprehensive benchmarking of RNA velocity methods across single-cell datasets

This article has 6 authors:
1. Jin Liu
2. Yida Wu
3. Chuihan Kong
4. Xu Liao
5. Zhixiang Lin
6. Xiaobo Sun
This article has no evaluationsLatest version Feb 2, 2026
Discovering cell types and states from reference atlases with heterogeneous single-cell ATAC-seq features

This article has 2 authors:
1. Xiuwei Zhang
2. Yuqi Cheng
This article has no evaluationsLatest version Dec 10, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Microenvironment-aware transcriptome reconstruction in spatial transcriptomics

Comprehensive benchmarking of RNA velocity methods across single-cell datasets

Discovering cell types and states from reference atlases with heterogeneous single-cell ATAC-seq features