Primary Classification of Skin Diseases Using an Explainable Multimodal VLM with a DBSCAN-Centroid-Based Confidence Score

Joon Hyoung Jun
Jong Yung Yoon
Kyung Hi Choi
Suk Woo Son

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper presents an explainable multimodal vision–language framework for the primary classification of skin diseases. Using compact vision–language models (VLMs)—Gemma 3 4B and Qwen 2.5 VL 7B—the system integrates synthetic skin tumor and lesion images with natural-language disease descriptions, grounding its predictions in lay-accessible dermatologic concepts to improve interpretability. Low-rank adaptation (LoRA) fine-tuning on the AI-Hub synthetic skin-tumor dataset demonstrates the feasibility of deploying such models in resource-constrained environments. Model performance is evaluated using quantitative metrics, including accuracy, precision, recall, and F1-score, and a DBSCAN-centroid-based semantic confidence-scoring method is introduced to estimate cluster similarities in the image-embedding space. The experimental results show that lightweight multimodal VLMs can achieve stable and accurate performance on primary skin disease classification, indicating their potential as explainable, AI-assisted tools for dermatologic decision support.

Version published to 10.20944/preprints202511.1987.v1
Nov 25, 2025

UniSkin-Net: A Unified Multi-Task Framework for Skin Cancer Segmentation, Classification, and Detection

This article has 5 authors:
1. Eman Abdullah Aldakheel
2. Mohammed Zakariah
3. Syed Umar Amin
4. Parul Dubey
5. Zafar Iqbal Khan
This article has no evaluationsLatest version Dec 22, 2025
Multimodal Vision-Language Framework for Text-Guided Leukemia Classification Using Advanced Deep Learning Architectures

This article has 2 authors:
1. Seyed Vahab Shojaedini
2. Mohammad Momenian
This article has no evaluationsLatest version Jan 21, 2026
An enhanced explainable thyroid disease diagnosis by leveraging cluster-smote and machine learning models

This article has 4 authors:
1. Usman Suleh
2. Badamasi Alhaji Ahmed
3. Farouk Lawan Gambo
4. Fatima Umar Zambuk
This article has no evaluationsLatest version Jan 27, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

UniSkin-Net: A Unified Multi-Task Framework for Skin Cancer Segmentation, Classification, and Detection

Multimodal Vision-Language Framework for Text-Guided Leukemia Classification Using Advanced Deep Learning Architectures

An enhanced explainable thyroid disease diagnosis by leveraging cluster-smote and machine learning models