Adversarial Sample Generation Method for Scene Text Images Based on Up-sampling

Zihao Zeng
Chenxiao Wang
Xiao Yang
Yong Chen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Scene Text Recognition (STR) has significantly enhanced the efficiency of information acquisition and interaction in natural environments. However, it also introduces potential security risks, such as the unauthorized recognition and extraction of sensitive textual information from the environment, including personal identification numbers, license plates, and other confidential data. To address privacy protection in scene text, recent research has proposed using minimal pixel perturbations to safeguard textual information, making the content observable but difficult to extract accurately. However, such perturbation attacks are easily noticeable to the human eye, allowing adversaries to counteract them with defensive measures such as filtering small perturbations. Existing methods fail to simultaneously ensure high visual quality and make the perturbation attacks imperceptible. In this study, we propose a novel scene text adversarial sample generation method incorporating up-sampling. This method achieves a high attack success rate while increasing the payload applied to the image, preserving the perturbed image quality, and improving the stealthiness of the adversarial samples. To further enhance the quality of the perturbed images, we introduce the Adaptive Local Search Attack (ALSA), which utilizes adaptive perturbation based on visual quality and perceptual loss to ensure that the perturbed image remains as similar as possible to the original image in human vision, which can further enhance the stealthiness of adversarial samples and make perturbation attacks difficult to detect. Our experimental results show that the proposed method maintains high visual quality while achieving a better protection success rate across various text recognition models compared to existing methods.

Version published to 10.21203/rs.3.rs-5287761/v1 on Research Square
Oct 30, 2024

DR-SigAttack: Distribution-Relevant Signature Attack Withstands Defense Mechanisms for Offline Signature Verification

This article has 6 authors:
1. Wei Jia
2. Lidong Zheng
3. Jiaen Chen
4. Mingjian Zhang
5. Wu Da
6. Yuchen Zheng
This article has no evaluationsLatest version Dec 10, 2025
Improving Adversarial Robustness of DNNs via Margin-Based Label Encoding

This article has 3 authors:
1. Keji Han
2. Yun Li
3. Deqiang Li
This article has no evaluationsLatest version Dec 29, 2025
AdverFuse: Robust fusion of multimodal images based on dynamic attention and adversarial learning

This article has 5 authors:
1. Fangyan Zhang
2. Fan Zhang
3. Yingbing Liu
4. Fei Ma
5. Chunsheng Hu
This article has no evaluationsLatest version Dec 24, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

DR-SigAttack: Distribution-Relevant Signature Attack Withstands Defense Mechanisms for Offline Signature Verification

Improving Adversarial Robustness of DNNs via Margin-Based Label Encoding

AdverFuse: Robust fusion of multimodal images based on dynamic attention and adversarial learning