Optimizing Explainability-Accuracy Trade-offs in Deep Neural Networks via Constrained Information Bottleneck Regularization

Wei-Lin Chen
Mei-Ying Xu
Hao-Ran Lu
Yan-Jie Fang
Zhou Shen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The increasing complexity of deep neural networks (DNNs) has led to a pressing need to optimize the trade-off between model accuracy and explainability. In this paper, we introduce a novel framework employing constrained information bottleneck regularization to explicitly balance these two critical aspects of model performance. Our methodology formalizes the relationship between accuracy and explainability as a constrained optimization problem, enabling the development of interpretable models without sacrificing predictive power. We explore the mathematical underpinnings of our approach, detailing the use of dual decomposition techniques and differentiable surrogate objectives for efficient implementation. Comprehensive empirical evaluations on benchmark datasets in vision and language demonstrate significant improvements in explainability-accuracy trade-offs compared to state-of-the-art methods. Our findings reveal that this framework can produce models that are not only high-performing but also adhere to stringent explainability constraints. Ultimately, this work aims to catalyze a paradigm shift within the AI community towards the development of reliable, transparent, and interpretable AI systems, ensuring their responsible deployment in high-stakes applications.

Version published to 10.21203/rs.3.rs-7994389/v1 on Research Square
Nov 4, 2025

Optimizing Deep Learning Architectures forEnhanced Computational Efficiency

This article has 2 authors:
1. Ying Wang
2. Hui Li
This article has no evaluationsLatest version Sep 22, 2025
GradLIME: A CNN Local Interpretation Model Based on Feature Gradient Activation

This article has 8 authors:
1. Jinwei Zhao
2. Jiedong Liu
3. Zhenghao Shi
4. Yu Liu
5. Majid Habib Khan
6. Wei Wang
7. Minhui Zhu
8. Xinhong Hei
This article has no evaluationsLatest version Sep 25, 2025
Impact of Hyperparameter Optimisation Techniques in Deep Learning-based Investment Predictions: An Indian ETF-based analysis

This article has 2 authors:
1. Alan Vellaiparambill
2. N Natchimuthu
This article has no evaluationsLatest version Nov 13, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Optimizing Deep Learning Architectures forEnhanced Computational Efficiency

GradLIME: A CNN Local Interpretation Model Based on Feature Gradient Activation

Impact of Hyperparameter Optimisation Techniques in Deep Learning-based Investment Predictions: An Indian ETF-based analysis