Robust Deep Active Learning via Distance-Measured Data Mixing and Adversarial Training

Shinan Song
Xing Wang
Shike Dong
Jingyan Jiang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Accurate uncertainty estimation in unlabeled data represents a fundamental challenge in active learning. Traditional deep active learning approaches suffer from a critical limitation: uncertainty-based selection strategies tend to concentrate excessively around noisy decision boundaries, while diversity-based methods may miss samples that are crucial for decision-making. This over-reliance on confidence metrics when employing deep neural networks as backbone architectures often results in suboptimal data selection. We introduce Distance-Measured Data Mixing (DM2), a novel framework that estimates sample uncertainty through distance-weighted data mixing to capture inter-sample relationships and the underlying data manifold structure. This approach enables informative sample selection across the entire data distribution while maintaining focus on near-boundary regions without overfitting to the most ambiguous instances. To address noise and instability issues inherent in boundary regions, we propose a boundary-aware feature fusion mechanism integrated with fast gradient adversarial training. This technique generates adversarial counterparts of selected near-boundary samples and trains them jointly with the original instances, thereby enhancing model robustness and generalization capabilities under complex or imbalanced data conditions. Comprehensive experiments across diverse tasks, model architectures, and data modalities demonstrate that our approach consistently surpasses strong uncertainty-based and diversity-based baselines while significantly reducing the number of labeled samples required for effective learning.

Version published to 10.3390/e27111159
Nov 14, 2025
Version published to 10.20944/preprints202510.0404.v1
Oct 7, 2025

Improving Adversarial Robustness of DNNs via Margin-Based Label Encoding

This article has 3 authors:
1. Keji Han
2. Yun Li
3. Deqiang Li
This article has no evaluationsLatest version Dec 29, 2025
Hybrid Neural Tangent Kernel–SGD Optimization for Robust and Scalable Deep Learning Across Medical, Sensor, and Image Domains

This article has 1 author:
1. Ahmed Mubaraki
This article has no evaluationsLatest version Dec 31, 2025
Probabilistic von Mises–Fisher Representation Learning forFew-Shot Remote Sensing Scene Classification

This article has 5 authors:
1. Zhong Ji
2. Ci Liu
3. Hongsheng Zhang
4. Chen Tang
5. Yanwei Pang
This article has no evaluationsLatest version Jan 7, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Improving Adversarial Robustness of DNNs via Margin-Based Label Encoding

Hybrid Neural Tangent Kernel–SGD Optimization for Robust and Scalable Deep Learning Across Medical, Sensor, and Image Domains

Probabilistic von Mises–Fisher Representation Learning forFew-Shot Remote Sensing Scene Classification