Automated registration of spatial expression data scales multimodal integration to large cohorts
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Recent advances in spatial proteomics enable quantification of the spatial distribution of protein expression across a variety of scales, resolutions, and multiplexing. Registering images from such technologies across modalities is an essential task that enables both the integration of complementary imaging technologies and validation of biological findings. This can be particularly challenging when the modalities capture fundamentally different types of data, such as light intensity, probe counts, or heavy metal counts. However, few datasets and methods address this problem at scale. Here, we introduce the largest dataset to date of cross-modality imaging of both cell line and tissue slides suitable for benchmarking registration methods. We further present Twocan, a Bayesian optimization framework that enables robust automated registration between immunofluorescence imaging and highly multiplex spatial proteomics data. Our method achieves significantly higher registration success rates compared to existing approaches across our comprehensive dataset of 954 image pairs.