Comparative analysis of convolutional and vision transformer models for automated leukocyte classification enhanced by generative color augmentation

João Kasprowicz
Alexandre Gonçalves Silva

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Manual differential leukocyte counting is a critical yet time-consuming and observer-dependent process in clinical hematology. This study presents a comparative analysis of You Only Look Once v11 (YOLOv11) and Vision Transformer (ViT) architectures for the classification of 14 leukocyte types and artifacts using a private clinical dataset. We further investigated the impact of HistAuGAN, a domain-specific data augmentation strategy designed to simulate real-world staining variability. Across experimental settings, ViT models achieved higher overall performance than YOLOv11 variants, and the application of HistAuGAN led to systematic improvements in both architectural families. The best-performing configuration, trained on the HistAuGAN-augmented dataset, achieved a macro F1-score of 98.36% and an overall accuracy of 99.75% on the validation set. To assess generalization capacity, this configuration was additionally evaluated on the public PBC and LISC datasets, demonstrating meaningful cross-dataset performance without architectural modification. Model interpretability was examined through attention- and activation-based saliency analyses, indicating that predictions were primarily driven by morphologically relevant leukocyte regions rather than background structures. These findings suggest that combining global-context modeling with domain-informed augmentation provides a robust and clinically coherent framework for fine-grained leukocyte classification.

Version published to 10.1007/s11760-026-05309-2
Apr 24, 2026
Version published to 10.21203/rs.3.rs-7926842/v1 on Research Square
Nov 6, 2025

Optimizing Deep Learning for Skin Cancer: A Comparative Study of Convolutional and Attention-Based Models

This article has 1 author:
1. Khaled Wael Ezzat
This article has no evaluationsLatest version Apr 8, 2026
GWO-Based Fed-UNet-CNN Model for Leukocyte Classification Across Developmental Stages

This article has 1 author:
1. Dilip Nallamasa
This article has no evaluationsLatest version Mar 24, 2026
A Comparative Study of an AI Model’s Robustness to Synthetic Data in Solving the Problem of Color Image Classification

This article has 3 authors:
1. Marina Barulina
2. Sergey Okunkov
3. Ivan Ulitin
This article has no evaluationsLatest version Apr 7, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Optimizing Deep Learning for Skin Cancer: A Comparative Study of Convolutional and Attention-Based Models

GWO-Based Fed-UNet-CNN Model for Leukocyte Classification Across Developmental Stages

A Comparative Study of an AI Model’s Robustness to Synthetic Data in Solving the Problem of Color Image Classification