Evaluating the transferability of adversarial robustness to target domains

Anna-Kathrin Kopetzki
Aleksandar Bojchevski
Stephan Günnemann

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Knowledge transfer is an effective method for learning, particularly useful when labeled data are limited or when training a model from scratch is too expensive. Most of the research on transfer learning focuses on achieving accurate models, overlooking the crucial aspect of adversarial robustness. However, ensuring robustness is vital, especially when applying transfer learning in safety-critical domains. We compare robustness of models obtained by 11 training procedures on source domains and 3 retraining schemes on target domains, including normal, adversarial, contrastive, and Lipschitz constrained training variants. Robustness is analyzed by adversarial attacks with respect to two different transfer learning model outputs: (i) the latent representations and (ii) the predictions. Studying latent representations in correlation with predictions is crucial for robustness of transfer learning models, since they are solely learned on the source domain. Besides adversarial attacks that aim at changing the prediction, we also analyze the effect of directly attacking representations. Our results show that adversarial robustness can transfer across domains, but effective robust transfer learning requires techniques that ensure robustness independent of the training data to preserve them during the transfer. Retraining on the target domain has a minor impact on the robustness of the target model. Representations exhibit greater robustness compared to predictions across both the source and target domain.

Version published to 10.1007/s10115-024-02333-x
Jan 25, 2025
Version published to 10.21203/rs.3.rs-5117436/v1 on Research Square
Dec 3, 2024

Learning a More Expressive Ensemble with Alternate Propagating Strategy for Enhancing Robustness

This article has 1 author:
1. Jiachen Yu
This article has no evaluationsLatest version May 29, 2025
Towards Robust and Scalable Mixture of Experts Architectures for Large Language and Vision Models

This article has 3 authors:
1. Aamina Yousra
2. Jumanah Fawziya
3. Fawzi Gamal
This article has no evaluationsLatest version Jul 2, 2025
Mitigating Adversarial Attacks Uncertainty Through Interval Analysis

This article has 6 authors:
1. Yiqun Xu
2. Zhen Wei
3. Fangzhen Ge
4. Zhehao Li
5. Xing Wei
6. Yang Lu
This article has no evaluationsLatest version Jul 10, 2025

Listed in

Abstract

Article activity feed

Related articles

Learning a More Expressive Ensemble with Alternate Propagating Strategy for Enhancing Robustness

Towards Robust and Scalable Mixture of Experts Architectures for Large Language and Vision Models

Mitigating Adversarial Attacks Uncertainty Through Interval Analysis