Cyst-X: AI-Powered Pancreatic Cancer Risk Prediction from Multicenter MRI in Centralized and Federated Learning
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Pancreatic cancer, projected to become the second-deadliest malignancy in Western societies by 2030, requires urgent innovations in early detection and risk stratification. Intraductal papillary mucinous neoplasms (IPMNs) represent critical precursor lesions, but current clinical guidelines demonstrate suboptimal accuracy in malignancy prediction, leading to either unnecessary surgeries or missed opportunities for early intervention. In this paper, we present Cyst-X, an artificial intelligence (AI) framework that accurately predicts IPMN malignant transformation using multicenter magnetic resonance imaging (MRI) data. Unlike most previous approaches that rely on computed tomography (CT), our method capitalizes on the superior soft tissue contrast of MRI, allowing for more precise identification of subtle imaging biomarkers. We developed and validated deep learning models on 723 T1-weighted and 738 T2-weighted MRI scans from 764 patients among seven international institutions, demonstrating significantly superior performance (AUC=0.82) compared to current clinical (Kyoto) guidelines (AUC=0.75) and expert radiologists. AI-based imaging features correlate strongly with clinically recognized malignancy markers, providing potential biologically relevant insights. This approach holds promise to significantly refine clinical decision-making, reduce unnecessary surgeries, and better identify high-risk IPMN patients for timely intervention. Our approach integrates a novel pancreas segmentation network with robust classification models that identify subtle imaging biomarkers associated with malignancy risk. Importantly, we demonstrate that these models retain high performance in a privacy-preserving federated learning setting, where institutions collaboratively train AI models without exchanging patient data to address key regulatory and ethical barriers. We publicly release the Cyst-X dataset--the first large-scale, multi-center pancreatic cyst MRI collection--to accelerate research in this field. This study addresses a critical clinical need while establishing technical foundations for privacy-preserving AI in radiology that could transform pancreatic cancer management through earlier intervention and reduced unnecessary procedures. The dataset can be accessed at https://osf.io/74vfs/, and the source code for our deep learning segmentation and classification models is available at https://github.com/NUBagciLab/Cyst-X.