Multi-View Echocardiographic Embedding for Accessible AI Development

Takeshi Tohyama
Ahram Han
Dukyong Yoon
Kenneth Paik
Brian Gow
Nura Izath
Jacques Kpodonu
Leo Anthony Celi

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Background and Aims

Echocardiography serves as a cornerstone of cardiovascular diagnostics through multiple standardized imaging views. While recent AI foundation models demonstrate superior capabilities across cardiac imaging tasks, their massive computational requirements and reliance on large-scale datasets create accessibility barriers, limiting AI development to well-resourced institutions. Vector embedding approaches offer promising solutions by leveraging compact representations from original medical images for downstream applications. Furthermore, demographic fairness remains critical, as AI models may incorporate biases that confound clinically relevant features. We developed a multi-view encoder framework to address computational accessibility while investigating demographic fairness challenges.

Methods

We utilized the MIMIC-IV-ECHO dataset (7,169 echocardiographic studies) to develop a transformer-based multi-view encoder that aggregates view-level representations into study-level embeddings. The framework incorporated adversarial learning to suppress demographic information while maintaining clinical performance. We evaluated performance across 21 binary classification tasks encompassing echocardiographic measurements and clinical diagnoses, comparing against foundation model baselines with varying adversarial weights.

Results

The multi-view encoder achieved a mean improvement of 9.0 AUC points (12.0% relative improvement) across clinical tasks compared to foundation model embeddings. Performance remained robust with limited echocardiographic views compared to the conventional approach. However, adversarial learning showed limited effectiveness in reducing demographic shortcuts, with stronger weighting substantially compromising diagnostic performance.

Conclusions

Our framework democratizes advanced cardiac AI capabilities, enabling substantial diagnostic improvements without massive computational infrastructure. While algorithmic approaches to demographic fairness showed limitations, the multi-view encoder provides a practical pathway for broader AI adoption in cardiovascular medicine with enhanced efficiency in real-world clinical settings.

Structured graphical abstract or graphical abstract

Key Question

Can multi-view encoder frameworks achieve superior diagnostic performance compared to foundation model embeddings while reducing computational requirements and maintaining robust performance with fewer echocardiographic views for cardiac AI applications?

Key Finding

Multi-view encoder achieved 12.0% relative improvement (9.0 AUC points) across 21 cardiac tasks compared to foundation model baselines, with efficient 512-dimensional vector embeddings and robust performance using fewer echocardiographic views.

Take-home Message

Vector embedding approaches with attention-based multi-view integration significantly improve cardiac diagnostic performance while reducing computational requirements, offering a pathway toward more efficient AI implementation in clinical settings.

Translational Perspective

Our proposed multi-view encoder framework overcomes critical barriers to the widespread adoption of artificial intelligence in echocardiography. By dramatically reducing computational requirements, the multi-view encoder approach allows smaller healthcare institutions to develop sophisticated AI models locally. The framework maintains robust performance with fewer echocardiographic examinations, which addresses real-world clinical constraints where comprehensive imaging is not feasible due to patient factors or time limitations. This technology provides a practical way to democratize advanced cardiac AI capabilities, which could improve access to cardiovascular care across diverse healthcare settings while reducing dependence on proprietary datasets and massive computational resources.

Version published to 10.1101/2025.08.15.25333725 on medRxiv
Aug 19, 2025

Generating diffusion MRI scalar maps from T1-weighted images using Reversible GANs

This article has 9 authors:
1. Tamoghna Chattopadhyay
2. Gautam Mehendale
3. Sophia I. Thomopoulos
4. Himanshu Joshi
5. Ganesan Venkatasubramanian
6. John P. John
7. Jose Luis Ambite
8. Greg Ver Steeg
9. Paul M. Thompson
This article has no evaluationsLatest version Sep 1, 2025
Deep Learning-Based Multi-View Echocardiographic Framework for Comprehensive Diagnosis of Pericardial Disease

This article has 10 authors:
1. Sihyeon Jeong
2. In Tae Moon
3. Jaeik Jeon
4. Dawun Jeong
5. Jina Lee
6. Jiyeon Kim
7. Seung-Ah Lee
8. Yeonggul Jang
9. Yeonyee E. Yoon
10. Hyuk-Jae Chang
This article has no evaluationsLatest version Jul 25, 2025
Using Diffusion Transformers to Generate Synthetic Diffusion Scalar Maps for Data Augmentation

This article has 6 authors:
1. Tamoghna Chattopadhyay
2. Chirag Jagad
3. Pavithra Senthilkumar
4. Sophia I. Thomopoulos
5. Julio E. Villalón-Reina
6. Paul M. Thompson
This article has no evaluationsLatest version Sep 1, 2025

Listed in

Abstract

Background and Aims

Methods

Results

Conclusions

Structured graphical abstract or graphical abstract

Key Question

Key Finding

Take-home Message

Translational Perspective

Article activity feed

Related articles

Generating diffusion MRI scalar maps from T1-weighted images using Reversible GANs

Deep Learning-Based Multi-View Echocardiographic Framework for Comprehensive Diagnosis of Pericardial Disease

Using Diffusion Transformers to Generate Synthetic Diffusion Scalar Maps for Data Augmentation