A Comparative Study of an AI Model’s Robustness to Synthetic Data in Solving the Problem of Color Image Classification

Marina Barulina
Sergey Okunkov
Ivan Ulitin

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This study examines the impact of data augmentation on machine learning perfor-mance, focusing on how synthetic data influences various neural network architec-tures. Common issues such as limited data, class imbalance, and poor coverage often lead to low model metrics, and data augmentation is frequently used to address these problems. The research aims to identify the optimal proportion of synthetic data, assess its effects across different architectures, and analyze the impact of augmenting only specific classes in a multi-class medical image classification task. Twelve widely used architectures were selected for the experiments, including classical convolutional networks, visual transformers, and the hybrid ConvNeXt model. Results showed that no universal optimal augmentation ratio exists, as model robust-ness to synthetic data varies, even within the same architecture family. Transformer and hybrid models demonstrated greater stability, while convolutional networks exhibited inconsistent behavior, likely due to higher sensitivity to data bias.

Version published to 10.20944/preprints202604.0294.v1
Apr 7, 2026

Selective State-Space Models in Medical Image Processing

This article has 3 authors:
1. Ali Emre Gök
2. Mustafa Yurdakul
3. Şakir Taşdemir
This article has no evaluationsLatest version Apr 14, 2026
ML-ConvNet: A Lightweight and Interpretable Unified Architecture for Medical Image Classification Across Modalities

This article has 10 authors:
1. Williams Ayivi
2. Xiaoling Zhang
3. Yeongx Yeong Hyeon Gu
4. Amil Aligayev
5. Ali Alqahtani
6. Wisdom Xornam Ativi
7. Francis Sam
8. Muhammed Amin Abdullah
9. Emmanuel Sarpong Addai Gyarteng
10. Mugahed A. Al-antari
This article has no evaluationsLatest version Mar 17, 2026
Improving Robust Image Classification Under Common Corruptions: A PDE-Regularized Variational Information Bottleneck Network

This article has 2 authors:
1. Gor Gharagyozyan
2. Mariam Haroutunian
This article has no evaluationsLatest version Mar 24, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Selective State-Space Models in Medical Image Processing

ML-ConvNet: A Lightweight and Interpretable Unified Architecture for Medical Image Classification Across Modalities

Improving Robust Image Classification Under Common Corruptions: A PDE-Regularized Variational Information Bottleneck Network