Data Quality Assessment of GAN and VAE-Generated Malicious Network Traffic for Cybersecurity Applications

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Data scarcity and class imbalance pose persistent challenges in cybersecurity AI, par-ticularly for intrusion detection systems, where real-world malicious network traffic is rare and sensitive. To address this, the present study explores the generation of synthetic network traffic using deep generative models, focusing on both Generative Adversarial Networks (GANs) and Variational Autoencoders (VAEs). Building upon recent advances in data synthesis, we introduce a systematic framework for Data Quality Assessment (DQA) to evaluate the realism and utility of generated malicious traffic. Our approach compares the outputs of GANs and VAEs not only in terms of statistical similarity to real attack patterns, but also by measuring their effect on the performance of super-vised/unsupervised Intrusion Detection Models. By embedding synthetic samples into the training process, we quantify improvements in classification accuracy, recall, and robustness under various threat scenarios. The outcomes of this work aim to enhance trust in synthetic data generation techniques, offering reliable augmentation strategies for cybersecurity applications under data-limited conditions.

Article activity feed