Optimizing Deep Learning for Skin Cancer: A Comparative Study of Convolutional and Attention-Based Models

Khaled Wael Ezzat

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Skin cancer is among the most prevalent malignancies worldwide, with over 1.5 million new cases estimated in 2022 alone according to GLOBOCAN data ^[1] . Despite the availability of dermoscopy, experienced dermatologists achieve a melanoma detection sensitivity of approximately 75–84% using visual examination, a rate that underscores the diagnostic limitations of unaided clinical assessment ^[2] . This study presents a systematic comparison of five deep learning architectures for the automated classification of seven skin lesion types using the HAM10000 dataset ^[6] , comprising 10,015 dermoscopic images. We evaluate four architectures spanning both convolutional and attention-based paradigms: ResNet-50, EfficientNet-B4, ConvNeXt-Base, Swin Transformer-Base, and Vision Transformer (ViT-B/16). To address the pronounced class imbalance inherent in the dataset, we employed patient-level data partitioning via GroupShuffleSplit to prevent lesion leakage across splits, and WeightedRandomSampler during training. All models were trained using AdamW optimization with label smoothing and mixed-precision training. Transformer-based architectures were further stabilized through linear warmup scheduling and stochastic depth regularization. Our best single model, ViT-B/16, achieved a test accuracy of 85.66% and a macro AUC-ROC of 0.9629. An ensemble of EfficientNet-B4 and Swin Transformer-Base achieved the highest overall performance with a test accuracy of 86.57%, a balanced accuracy of 79.98%, a macro F1-score of 0.7856, and a macro AUC-ROC of 0.9811. These results demonstrate that heterogeneous ensemble strategies combining architecturally diverse models offer a meaningful improvement over individual classifiers in dermoscopic lesion classification.

Version published to 10.21203/rs.3.rs-9285260/v1 on Research Square
Apr 8, 2026

Deep Learning Optimisation Strategies for Uveal Melanoma Detection Using Ultra-Widefield Photography

This article has 5 authors:
1. Michael Heiferman
2. Sanjay Ganesh
3. Virginia Tasso
4. Reem AlAhmadi
5. Darvin Yi
This article has no evaluationsLatest version May 11, 2026
Next-Generation Skin Cancer Detection Using Efficient Fuzzy Fusion of Genomic and Imaging Data

This article has 7 authors:
1. Afiur Rahaman Molla
2. Arnab Maity
3. Suvajit Saha
4. Rupsha Bhattacharya
5. Aritra Chakraborty
6. Surama Biswas
7. Subhrapratim Nath
This article has no evaluationsLatest version Jun 8, 2026
A New Hybrid Method for Brain Tumor Detection Based on Deep Learning

This article has 1 author:
1. Shamim Sharbaf
This article has no evaluationsLatest version May 28, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Deep Learning Optimisation Strategies for Uveal Melanoma Detection Using Ultra-Widefield Photography

Next-Generation Skin Cancer Detection Using Efficient Fuzzy Fusion of Genomic and Imaging Data

A New Hybrid Method for Brain Tumor Detection Based on Deep Learning