Self-Supervised AI Reveals a Hidden Landscape of Prognostic Spatial Patterns in Multiplex Immunofluorescence Images

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Modern spatial proteomic methods, such as multiplex immunofluorescence (mIF) imaging, offer a data-rich view of spatial biology in intact tissues. However, interpreting its complexity is a major bottleneck, limiting its potential for biological discovery and clinical translation. Current computational methods often rely on segmentation-based approaches that discard crucial morphological information and are limited to testing pre-defined hypotheses. Here, we introduce a self-supervised learning (SSL) framework that enables hypothesis-agnostic, context-aware discovery of biomarkers directly from mIF images. Our approach extracts rich feature representations that capture holistic architectural patterns, which integrate cellular morphology, marker interactions, and microenvironmental context without human supervision. Applying this framework to over 7,000 mIF tissue images from over 1,800 patients in two distinct cancer types, we demonstrate superior prognostic performance over conventional segmentation analyses. The method autonomously identified previously unknown and potentially clinically actionable biological patterns. In lung adenocarcinoma, these include a Ki67-mediated immune evasion phenotype, a subcellular pattern of GLB1 expression which aligns with low-grade EGFR-driven tumours, and distinct modes of tumour-immune interaction in PD-L1+ patients. We also find a regulatory T-cell-mediated immunosupressive environment promoting tumour budding in colorectal carcinoma. Our work establishes SSL as a powerful, scalable, and unbiased platform to decode tissue ecosystems while being fully explainable without pre-defined hypotheses. This paradigm shift transforms high-plex imaging from a hypothesis-testing tool into a hypothesis-generating engine that can accelerate the discovery of next-generation spatial biomarkers.

Article activity feed