Improving the Quality of Skin Lesion Data for Training Vision-Language Models

Atufigwege Mwakatapanya
Tess Watt
Christos Chrysoulas

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Skin cancer diagnosis using machine learning faces significant challenges, primarily due to the lack of well-labelled and balanced skin lesion datasets. Most available datasets are limited to only two lesion types; melanoma and nevi, or they exhibit severe class and skin tone imbalance leading to biases during model training. Furthermore, vision-language models (VLMs) face an additional challenge in training using these datasets as they also lack semantic labelling for effective training. To address these challenges, several researchers have used general adversarial networks (GANs) to generate realistic synthetic images. While this can improve the diagnostic accuracy, it raises ethical and trust concerns especially in clinical settings. Moreover, applying GANs on imbalanced datasets amplifies the existing biases. This paper proposes an alternative approach by curating and combining the existing public datasets; HAM10000 and BCN20000, into a single well-labelled dataset called RHB, optimised for training Google’s Gemma 3 4B model.

Version published to 10.21203/rs.3.rs-8871094/v1 on Research Square
Feb 27, 2026

MaSA-UNet: Manhattan Self-Attention U-Net for Skin Lesion Segmentation

This article has 2 authors:
1. Abel A. Reyes-Angulo
2. Sidike Paheding
This article has no evaluationsLatest version Jan 20, 2026
A Novel Multi-Stage Fusion Pipeline for Robust and Interpretable Melanoma Classification Using Physics-Informed and Vision-Language Models

This article has 2 authors:
1. G. Isha
2. F. D Asbel sherlin
This article has no evaluationsLatest version Mar 2, 2026
AI for Classifying Oral Cancer and Precursor Lesions Using Visible-Light Photography

This article has 4 authors:
1. Charles Goodmaker
2. Rishi Bhandari
3. Anwar Tappuni
4. Tuan Pham
This article has no evaluationsLatest version Feb 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

MaSA-UNet: Manhattan Self-Attention U-Net for Skin Lesion Segmentation

A Novel Multi-Stage Fusion Pipeline for Robust and Interpretable Melanoma Classification Using Physics-Informed and Vision-Language Models

AI for Classifying Oral Cancer and Precursor Lesions Using Visible-Light Photography