DT-DFRS: Enhanced Data-Free Robustness Stealing via Dual Teacher Guidance in Black-Box Settings

Rania El-Sayed
Hoda Baraka
Mayada Hadhoud

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Model Stealing Attacks (MSAs) are identified as a significant privacy threat to Machine Learning as a Service (MLaaS). MSAs aim to craft a substitute model that has the same performance by just querying MLaaS. Various techniques have been proposed to steal the accuracy as well as the robustness of target models so that these models achieve not only the same performance as the victim model but also their robustness against adversarial attacks. Since the training data, architecture, and parameters of these models are inaccessible due to privacy issues, most approaches rely on distillation methods. In this process, a clone model is trained to imitate the behavior of the target model, effectively stealing its efficiency.Robustness Distillation (RD) addresses both the efficiency and robustness challenges of existing models. However, most existing approaches focus solely on distilling model accuracy while neglecting robustness, despite its importance in safety-critical scenarios. Additionally, many approaches rely on access to real or proxy datasets, which is often infeasible due to privacy constraints. Other approaches assume the availability of Soft-Label (SL) predictions, which requires retrieving the outputs from the softmax layer lying before the final classification.In this paper, we propose a novel Dual Teacher Data-Free Hard-Label Robustness Stealing attack (DT-DFRS) that enables robustness distillation without requiring real or proxy data while preserving the model's efficiency in hard-label settings.Our experiments demonstrate how our DT-DFRS is effective over existing state-of-the-art data-free hard-label methods. Our proposed model improves the baseline by 3.41% and 3.13% for CIFAR-10 and CIFAR-100 datasets, respectively.

Version published to 10.21203/rs.3.rs-7359779/v1 on Research Square
Aug 27, 2025

G-SAFE: Generative Synthetic Augmentation for Federated Edge Security

This article has 2 authors:
1. Tutan Ghosh¹
2. Ira Nath²
This article has no evaluationsLatest version Sep 24, 2025
Improving Transferability of Adversarial Examples with Mixed-Representation Attack

This article has 5 authors:
1. Yunfei Long
2. Zilin Tian
3. Liguo Zhang
4. Haocheng Xu
5. Huosheng Xu
This article has no evaluationsLatest version Sep 12, 2025
An Enhanced Machine Learning with NLP Modelling Technique for Smishing Attacks Detection in Low-Resourced Languages

This article has 2 authors:
1. Aaron Zimba
2. Katongo Ongani Phiri
This article has no evaluationsLatest version Oct 6, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

G-SAFE: Generative Synthetic Augmentation for Federated Edge Security

Improving Transferability of Adversarial Examples with Mixed-Representation Attack

An Enhanced Machine Learning with NLP Modelling Technique for Smishing Attacks Detection in Low-Resourced Languages