spEMO: Leveraging Multi-Modal Foundation Models for Analyzing Spatial Multi-Omic and Histopathology Data

Hongyu Zhao
Tianyu Liu
Tinglin Huang
Tong Ding
Hao Wu
Peter Humphrey
Sudhir Perincheri
Kurt Schalper
Rex Ying
Hua Xu
James Zou
Faisal Mahmood

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recent advances in pathology foundation models (PFMs), which are pretrained on large-scale histopathological images, have significantly accelerated progress in disease-centered applications. In parallel, spatial multi-omic technologies collect gene and protein expression levels at high spatial resolution, offering rich understanding of tissue context. However, current models fall short in effectively integrating these complementary data modalities. To fill in this gap, we introduce spEMO, a novel computational system that unifies embeddings from pathology foundation models and large language models (LLMs) to analyze spatial multi-omic data. By incorporating multimodal representations and information, spEMO outperforms models trained on single-modality data across a broad range of downstream tasks, including spatial domain identification, spot-type classification, whole-slide disease-state prediction and interpretation, inference of multicellular interactions, and automated medical report generation. The outstanding performances of spEMO in these tasks demonstrate its strength in both biological and clinical applications. Additionally, we propose a new evaluation task, known as multi-modal alignment, to assess the information retrieval capabilities of pathology foundation models. This task provides a principled benchmark for evaluating and improving model architectures. Collectively, spEMO represents a step forward in building holistic, interpretable, and generalizable AI systems for spatial biology and pathology.

Version published to 10.21203/rs.3.rs-6941589/v1 on Research Square
Jul 8, 2025

Combining Real and Synthetic Data to Overcome Limited Training Datasets in Multimodal Learning

This article has 5 authors:
1. Niccolo Marini
2. Zhaohui Liang
3. Sivaramakrishnan Rajaraman
4. Zhiyun Xue
5. Sameer Antani
This article has no evaluationsLatest version Jul 17, 2025
Benchmark Evaluation of Multi-Modal Large Language Models for Ophthalmic Diagnosis

This article has 10 authors:
1. Weihua Yang
2. Shoujun Huang
3. Junhong Chen
4. Jiaoman Wang
5. Ping Zhang
6. Wending Du
7. Yuan Hong
8. Dexing Kong
9. Wei Lou
10. Wei Chi
This article has no evaluationsLatest version Jul 23, 2025
DeepPathway: Predicting Pathway Expression from Histopathology Images

This article has 9 authors:
1. Muhammad Ahtazaz Ahsan
2. Karen Piper Hanley
3. Martin Fergie
4. Claire O’leary
5. Gerben Borst
6. Federico Roncaroli
7. Magnus Ratrray
8. Mudassar Iqbal
9. Syed Murutza Baker
This article has no evaluationsLatest version Jul 25, 2025

Listed in

Abstract

Article activity feed

Related articles

Combining Real and Synthetic Data to Overcome Limited Training Datasets in Multimodal Learning

Benchmark Evaluation of Multi-Modal Large Language Models for Ophthalmic Diagnosis

DeepPathway: Predicting Pathway Expression from Histopathology Images