AIR: Activation based Isotropic Regularisation

Keshav Gupta

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In the deep learning world, regularisation plays a significant role in preventing the overfitting and improving the model generalisation by taking control over model's complexity. Traditional approaches, such as weight decay (L2 regularization), primarily constrain the magnitude of model parameters a.k.a model weights, whereas recent methods like gradient variance regularization (GVR) focus on stabilizing the optimization process and generally take gradients into account for that. In this paper, we introduce Activation based Isotropic Regularization (AIR), a novel regularization approach that explicitly minimizes the variance of activations using the concept of subspaces corresponding to different training samples and this approach complements weight-based and gradient-based methods by promoting more stable feature representations, which in turn enhance generalization. Furthermore, we propose a hybrid variant called AIR+L2, that combines the methodology of AIR with traditional weight decay method called L2 based regularisation. This combination leverages the strengths of both methods: AIR reduces feature-level fluctuations, while L2 prevents over-parameterization. Extensive experiments on benchmark datasets using both MLP and CNN architectures demonstrate that AIR consistently improves model robustness and convergence, and AIR+L2 achieves superior performance compared to either method in isolation.

Version published to 10.21203/rs.3.rs-8005826/v1 on Research Square
Nov 10, 2025

<p class="MDPI12title">Comparative Deep Learning Analysis of Regularization Techniques on Generalization in Baseline CNNs and ResNet Architectures for Machine Learning-Based Image Classification

This article has 5 authors:
1. Mohsen Mohammadagha
2. Farbod Bigdeli
3. Shayan Sharifi
4. Maryam Deldadehasl
5. Saeid Ataei
This article has no evaluationsLatest version Nov 10, 2025
Efficient Operator Learning with Derivative-Enhanced Parameter Sensitivity Information and Hybrid Optimization

This article has 3 authors:
1. Jesus Gonzalez-Sieiro
2. David Pardo
3. Victor Calo
This article has no evaluationsLatest version Oct 30, 2025
Comparative Deep Learning Analysis of Regularization Techniques on Generalization in Baseline CNNs and ResNet Architectures for Machine Learning-Based Image Classification

This article has 5 authors:
1. Mohsen Mohammadagha
2. Farbod Bigdeli
3. Shayan Sharifi
4. Maryam Deldadehasl
5. Saeid Ataei
This article has no evaluationsLatest version Oct 20, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

<p class="MDPI12title">Comparative Deep Learning Analysis of Regularization Techniques on Generalization in Baseline CNNs and ResNet Architectures for Machine Learning-Based Image Classification

Efficient Operator Learning with Derivative-Enhanced Parameter Sensitivity Information and Hybrid Optimization

Comparative Deep Learning Analysis of Regularization Techniques on Generalization in Baseline CNNs and ResNet Architectures for Machine Learning-Based Image Classification