DermFusionX: An Explainable CNN–MLP Late Fusion Framework for Multimodal Skin Lesion Classification

Vanshika Sharma

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deep learning has led to extraordinary performance in domains like computer vision and natural language processing, leading to its expansion into fields such as healthcare, which demand high transparency. In clinical practice, dermatologists work with multiple data sources, such as patient metadata and lesion images, for diagnosis. Motivated by this, we propose a multimodal approach to enhance skin lesion classification on the HAM10000 dataset. We conducted extensive experiments comparing unimodal models (using only metadata or images) with multimodal models (combining both). Our evaluation included several pre-trained Convolutional Neural Networks (e.g., ResNet50, VGG19, XceptionNet, InceptionV3) and a novel custom architecture, DermFusionX. Results demonstrate that multimodal models significantly outperform their unimodal counterparts, with DermFusionX achieving precision and recall rates of 90%. To ensure transparency, we employed explainable AI techniques (LIME and SHAP) to interpret the models' decisions.

Version published to 10.21203/rs.3.rs-7703674/v1 on Research Square
Sep 25, 2025

Clinical Application of Vision Transformers for Melanoma Classification: A Multi-Dataset Evaluation Study

This article has 5 authors:
1. Antony Garcia
2. Jixing Zhou
3. Gabriela Pinero-Crespo
4. Thomas Beachkofsky
5. Xinming Huang
This article has no evaluationsLatest version Oct 6, 2025
Out-of-Distribution Performance Analysis of Skin Lesion Classifiers for dermoscopic images

This article has 7 authors:
1. Eva Milara
2. Vanesa Gómez-Martínez
3. David Chushig-Muzo
4. María Castro-Fernández
5. Gustavo M. Callico
6. Conceição Granja
7. Cristina Soguero-Ruiz
This article has no evaluationsLatest version Sep 9, 2025
Pre- and Post-Gated Attention-based Multimodal Fusion for Skin Lesion Classification

This article has 3 authors:
1. Thi-Trang Nguyen
2. Van-Hieu Vu
3. Viet-Anh Nguyen
This article has no evaluationsLatest version Sep 11, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Clinical Application of Vision Transformers for Melanoma Classification: A Multi-Dataset Evaluation Study

Out-of-Distribution Performance Analysis of Skin Lesion Classifiers for dermoscopic images

Pre- and Post-Gated Attention-based Multimodal Fusion for Skin Lesion Classification