Trust-Aware Benchmarking of GAN, VAE, and Diffusion Models for Synthetic Data in Image and Tabular Domains

Anil Kumar Shukla

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Synthetic data generation using generative AI models offers a promising solution to challenges in data availability, privacy, and fairness. However, comparative insights across model families and data modalities remain limited, especially when trust-related dimensions such as fairness, privacy, and efficiency are considered alongside fidelity.This paper presents a trust-aware benchmarking study across both visual (CIFAR-10, MNIST) and tabular (Adult Income) domains, evaluating representative baselines—vanilla GAN/WGAN-GP, standard and β-VAE, DDPM and classifier-guided DDPM, and hybrids such as VAE-GAN and Latent Diffusion. Advanced models including StyleGAN2/3, BigGAN, CTGAN, Diffusion Transformers, and GigaGAN are acknowledged to situate the findings within the evolving landscape of foundation-scale generators.Performance is assessed using a multi-objective framework that integrates fidelity (FID, precision/recall), fairness (demographic parity), privacy leakage resistance, and computational efficiency. Results show that hybrid latent diffusion models achieve near-diffusion fidelity (FID 10.2 vs. 8.5 on CIFAR-10; 7.8 vs. 6.2 on MNIST) while reducing sampling time by over 70%. On tabular data, hybrids balance accuracy (84.7%) and fairness (0.93), whereas classical GANs and VAEs exhibit trade-offs between fidelity, efficiency, and fairness.To the best of our knowledge, this is the first study to benchmark GANs, VAEs, diffusion, and hybrid models across both image and tabular data using a unified, trust-aware evaluation framework. By providing reproducible, cross-domain comparisons, this work offers practical guidance for selecting and deploying generative models in trust-sensitive applications such as healthcare, finance, and autonomy.

Version published to 10.21203/rs.3.rs-7649434/v1 on Research Square
Oct 13, 2025

Fairness-Aware Deep Learning for Job Application Screening

This article has 1 author:
1. Timothy Adeyemi
This article has no evaluationsLatest version Mar 12, 2026
Synthetic Participants Generated by Large Language Models: A Systematic Literature Review

This article has 3 authors:
1. Eduard Kuric
2. Peter Demcak
3. Matus Krajcovic
This article has no evaluationsLatest version Mar 10, 2026
Persistence Landscapes Across Privacy Budgets for Explanation Methods Across Differential Privacy Mechanisms

This article has 1 author:
1. Paul Zheng
This article has no evaluationsLatest version Feb 16, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Fairness-Aware Deep Learning for Job Application Screening

Synthetic Participants Generated by Large Language Models: A Systematic Literature Review

Persistence Landscapes Across Privacy Budgets for Explanation Methods Across Differential Privacy Mechanisms