A Pan-Organ Vision-Language Model for Generalizable 3D CT Representations

Cameron Beeche
Joonghyun Kim
Hamed Tavolinejad
Bingxin Zhao
Rakesh Sharma
Jeffrey Duda
James Gee
Farouk Dako
Anurag Verma
Colleen Morse
Bojian Hou
Li Shen
Hersh Sagreiya
Christos Davatzikos
Scott Damrauer
Marylyn D. Ritchie
Daniel Rader
Qi Long
Tianlong Chen
Charles E. Kahn
Julio Chirinos
Walter R. Witschey
Penn Medicine Biobank

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Generalizable foundation models for computed tomographic (CT) medical imaging data are emerging AI tools anticipated to vastly improve clinical workflow efficiency. However, existing models are typically trained within narrow imaging contexts, including limited anatomical coverage, contrast settings, and clinical indications. These constraints reduce their ability to generalize across the broad spectrum of real-world presentations encountered in volumetric CT imaging data. We introduce Percival, a vision-language foundation model trained on over 400,000 CT volumes and paired radiology reports from more than 50,000 participants enrolled in the Penn Medicine BioBank. Percival employs a dual-encoder architecture with a transformer-based image encoder and a BERT-style language encoder, aligned via symmetric contrastive learning. Percival was validated on over 20,000 participants imaging data encompassing over 100,000 CT volumes. In image-text recall tasks, Percival outperforms models trained on limited anatomical windows. To assess Percival’s clinical knowledge, we evaluated the biologic, phenotypic and prognostic relevance using laboratory-wide, phenome-wide association studies and survival analyses, uncovering a rich latent structure aligned with physiological measurements and disease phenotypes.

Version published to 10.1101/2025.07.03.25330654 on medRxiv
Jul 3, 2025

Hybrid Vision Transformers for Accurate Recognition of Lung Lesions and Anatomical Structures in Bronchoscopic Imaging

This article has 3 authors:
1. Rolyph Erwan NTOUTOUME NGUEMA
2. Mohamad Forouzanfar
3. Ali Traore
This article has no evaluationsLatest version Jun 25, 2025
Urethra contours on MRI: multidisciplinary consensus educational atlas and reference standard for artificial intelligence benchmarking

This article has 14 authors:
1. Yuze Song
2. Lily Nguyen
3. Anna Dornisch
4. Madison Baxter
5. Tristan Barrett
6. Anders M Dale
7. Robert T Dess
8. Mukesh Harisinghani
9. Sophia C Kamran
10. Michael A Liss
11. Daniel JA Margolis
12. Eric P Weinberg
13. Sean A Woolen
14. Tyler M Seibert
This article has no evaluationsLatest version Jul 2, 2025
MiniMORPH: A Morphometry Pipeline for Low-Field MRI in Infants

This article has 15 authors:
1. Chiara Casella
2. Aksel Leknes
3. Niall J. Bourke
4. Ayo Zahra
5. Daniel Elijah Scheiene
6. Vanessa Kyriakopoulou
7. Simone R. Williams
8. Layla E. Bradford
9. Joanitta Murungi
10. Steven C.R. Williams
11. Sean C.L. Deoni
12. Victoria Nankabirwa
13. Kirsten A Donald
14. Muriel Marisa Katharina Bruchhage
15. Jonathan O’Muircheartaigh
This article has no evaluationsLatest version Jul 3, 2025

Listed in

Abstract

Article activity feed

Related articles

Hybrid Vision Transformers for Accurate Recognition of Lung Lesions and Anatomical Structures in Bronchoscopic Imaging

Urethra contours on MRI: multidisciplinary consensus educational atlas and reference standard for artificial intelligence benchmarking

MiniMORPH: A Morphometry Pipeline for Low-Field MRI in Infants