Eye-Guided Multimodal Fusion: Toward an Adaptive Learning Framework Using Explainable Artificial Intelligence

Sahar Moradizeyveh
Ambreen Hanif
Sidong Liu
Yuankai Qi
Amin Beheshti
Antonio Di Ieva

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Interpreting diagnostic imaging and identifying clinically relevant features remain challenging tasks, particularly for novice radiologists who often lack structured guidance and expert feedback. To bridge this gap, we propose an Eye-Gaze Guided Multimodal Fusion framework that leverages expert eye-tracking data to enhance learning and decision-making in medical image interpretation. By integrating chest X-ray (CXR) images with expert fixation maps, our approach captures radiologists’ visual attention patterns and highlights regions of interest (ROIs) critical for accurate diagnosis. The fusion model utilizes a shared backbone architecture to jointly process image and gaze modalities, thereby minimizing the impact of noise in fixation data. We validate the system’s interpretability using Gradient-weighted Class Activation Mapping (Grad-CAM) and assess both classification performance and explanation alignment with expert annotations. Comprehensive evaluations, including robustness under gaze noise and expert clinical review, demonstrate the framework’s effectiveness in improving model reliability and interpretability. This work offers a promising pathway toward intelligent, human-centered AI systems that support both diagnostic accuracy and medical training.

Version published to 10.3390/s25154575
Jul 24, 2025
Version published to 10.20944/preprints202505.1631.v1
May 20, 2025

A Survey of Contrastive Learning in Medical AI: Foundations, Biomedical Modalities, and Future Directions

This article has 6 authors:
1. George Obaido
2. Ibomoiye Domor Mienye
3. Kehinde Aruleba
4. Chidozie Williams Chukwu
5. Ebenezer Esenogho
6. Cameron Modisane
This article has no evaluationsLatest version Dec 26, 2025
Bridging Perception and Reasoning: An Evidence-Based Agentic System for Diagnosis and Treatment Recommendations of Vascular Anomalies

This article has 3 authors:
1. Yize Zhang
2. Yajing Qiu
3. Xiaoxi Lin
This article has no evaluationsLatest version Jan 19, 2026
Application of Deep Learning Strategies in the Standardization and Diagnostic Efficiency Enhancement of Chest X-ray Imaging

This article has 5 authors:
1. Wen Chang Tseng
2. Yung-Cheng Wang
3. Wei-Chi Chen
4. Sen-Ping Lin
5. Kang-Ping Lin
This article has no evaluationsLatest version Dec 18, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Survey of Contrastive Learning in Medical AI: Foundations, Biomedical Modalities, and Future Directions

Bridging Perception and Reasoning: An Evidence-Based Agentic System for Diagnosis and Treatment Recommendations of Vascular Anomalies

Application of Deep Learning Strategies in the Standardization and Diagnostic Efficiency Enhancement of Chest X-ray Imaging