Extended Counterfactual Adversarial Examples forMitigating Privacy Risk in Adversarially Robust Models

Aohan Sun
Yanrong Lu
Wencheng Yang
Ji Zhang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In this paper, we propose extended Counterfactual Adversarial ExampleGeneration (e-CAEG), which is an advanced version of our published conferencepaper in APWeb-WAIM 2025. Based on the conference paper, we summarizecontributions in this paper as follows. Firstly, e-CAEG leverages latent spacerepresentations to generate in-distribution adversarial examples for both targetedand untargeted scenarios. Secondly, e-CAEG acts as a regularizer that bridgesthe generalization gap by forcing the model to rely on robust semantic features.Finally, experiments on MNIST and Fashion-MNIST, supported by t-SNE distributionalvisualizations, demonstrate that our approach effectively lowers membershipinference accuracy to near-random levels while preserving model utility.Furthermore, we analyze the trade-offs between accuracy, robustness, and privacy,identifying an optimal balance achieved when approximately 95% of thetraining data consists of e-CAEG-generated examples.

Version published to 10.21203/rs.3.rs-8923995/v1 on Research Square
Mar 19, 2026

Differentially Private Lasso: An ISTA Framework with Finite-Iteration Guarantees

This article has 2 authors:
1. Jiahui Zhang
2. Chi Seng Pun
This article has no evaluationsLatest version Mar 24, 2026
DL-DPGAN: A Correlation-Regularized Differentially Private GAN for Privacy-Utility Balanced Synthetic Data Generation

This article has 3 authors:
1. Mohammad Emadi
2. Vahideh Moghtadaiee
3. Mina Alishahi
This article has no evaluationsLatest version Apr 14, 2026
Mitigating text data privacy risks from gradient and model inversion attacks with a dual-pronged defense

This article has 2 authors:
1. Yuxin Xie
2. Ying Gao
This article has no evaluationsLatest version Apr 7, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Differentially Private Lasso: An ISTA Framework with Finite-Iteration Guarantees

DL-DPGAN: A Correlation-Regularized Differentially Private GAN for Privacy-Utility Balanced Synthetic Data Generation

Mitigating text data privacy risks from gradient and model inversion attacks with a dual-pronged defense