Integrating Multimodal Data with Large Foundation Models in Healthcare

Hyunwoo Choi
Eunji Kang
Daehyun Song
Gyeong Jung

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Large Foundation Models (LFMs) have rapidly become a transformative force in the field of medical analysis, enabling unprecedented advancements in the interpretation, integration, and reasoning over complex and heterogeneous healthcare data. By leveraging massive datasets encompassing medical images, electronic health records, clinical notes, and genomic sequences, LFMs facilitate a unified framework that transcends the limitations of traditional, narrowly scoped machine learning models. This survey comprehensively reviews the state-of-the-art developments in LFMs tailored for medical applications, elucidating their architectural paradigms, training methodologies, and deployment strategies. We begin by examining the foundational building blocks that enable LFMs to handle multimodal data through modality-specific encoders and sophisticated fusion mechanisms, emphasizing the critical role of cross-attention and contrastive learning techniques in producing semantically aligned latent representations. The survey further explores practical applications, including diagnosis prediction, automated report generation, treatment recommendation, and personalized medicine, highlighting how LFMs enhance clinical decision-making by providing richer contextual understanding and reasoning capabilities.Despite their promise, the integration of LFMs into clinical practice faces significant challenges related to interpretability, data privacy, fairness, and scalability. We delve into these issues in depth, discussing the implications of model opacity, bias amplification, regulatory constraints, and the scarcity of labeled medical data. Cutting-edge solutions such as federated learning, self-supervised pretraining, and fairness-aware algorithms are examined as potential mitigations. Ethical considerations are addressed to ensure responsible AI deployment that safeguards patient rights, promotes equitable healthcare, and fosters trust among medical professionals and patients alike. Finally, the survey outlines future research opportunities, including advances in efficient training paradigms, improved model transparency, robust multimodal integration, and privacy-preserving technologies. The discussion underscores the necessity of interdisciplinary collaboration and human–AI partnership to realize the full potential of LFMs in improving health outcomes globally. Through this extensive analysis, we aim to provide researchers, clinicians, and policymakers with a holistic understanding of LFMs’ capabilities, challenges, and prospects in the rapidly evolving landscape of medical artificial intelligence.

Version published to 10.20944/preprints202508.0498.v1
Aug 7, 2025

Exploring the Role of Synthetic Data in the Future of AI in Healthcare: A Scoping Review of Frameworks, Challenges, and Implications

This article has 4 authors:
1. Mohammad Ishtiaque Rahman
2. Razuan Hossain
3. S.M. Sayem
4. Forhan Bin Emdad
This article has no evaluationsLatest version Aug 5, 2025
Combining Real and Synthetic Data to Overcome Limited Training Datasets in Multimodal Learning

This article has 5 authors:
1. Niccolo Marini
2. Zhaohui Liang
3. Sivaramakrishnan Rajaraman
4. Zhiyun Xue
5. Sameer Antani
This article has no evaluationsLatest version Jul 17, 2025
Retrieval-Augmented Generation (RAG) in Healthcare: A Comprehensive Review

This article has 3 authors:
1. Fnu Neha
2. Deepshikha Bhati
3. Deepak Kumar Shukla
This article has no evaluationsLatest version Aug 13, 2025

Listed in

Abstract

Article activity feed

Related articles

Exploring the Role of Synthetic Data in the Future of AI in Healthcare: A Scoping Review of Frameworks, Challenges, and Implications

Combining Real and Synthetic Data to Overcome Limited Training Datasets in Multimodal Learning

Retrieval-Augmented Generation (RAG) in Healthcare: A Comprehensive Review