GradLIME: A CNN Local Interpretation Model Based on Feature Gradient Activation

Jinwei Zhao
Jiedong Liu
Zhenghao Shi
Yu Liu
Majid Habib Khan
Wei Wang
Minhui Zhu
Xinhong Hei

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

As deep learning technologies advance rapidly, there is a growing demand for greater transparency and reliability in neural network decision-making. This demand has spurred progress in the explainability of Convolutional Neural Networks (CNNs) in recent years, though significant challenges persist. Current explanation methods typically fall into two categories: those that rely entirely on the internal feature information of neural networks to construct explanations and those that use model-agnostic approaches based on visual concepts. The first approach faces limitations due to the highly abstract nature of the embedded features within neural networks and their fundamental differences from human reasoning processes, leading to inevitable deviations from human cognition. On the other hand, while model-agnostic methods can explore CNNs’ computational logic from a human-centric perspective, their independence from specific models makes it challenging to provide explanations directly linked to the network's computational structure. In some cases, these explanations may even deviate from the true underlying mechanisms of the model. To address these issues, this paper proposes a local explanation model based on feature gradient activation for CNNs, called GradLIME, which is built upon the local interpretable model-agnostic explanations (LIME) method. In the construction of the local linear explanation model, GradLIME incorporates feature gradient activation data from multiple layers of the CNN, facilitating the generation of a comprehensible local linear explanation that also fully utilises the embedded feature pertaining to the network's computational structure. Finally, experiments were conducted on standard datasets to provide qualitative and quantitative evaluations of the local explanations generated by GradLIME. The results demonstrate that, in comparison to numerous state-of-the-art explanation methods that provide visual explanations, GradLIME is more effective at distinguishing between important and unimportant features, and at extracting accurate local explanations that are easier for humans to understand in the context of CNN reasoning.

Version published to 10.21203/rs.3.rs-7004733/v1 on Research Square
Sep 25, 2025

Optimizing Explainability-Accuracy Trade-offs in Deep Neural Networks via Constrained Information Bottleneck Regularization

This article has 5 authors:
1. Wei-Lin Chen
2. Mei-Ying Xu
3. Hao-Ran Lu
4. Yan-Jie Fang
5. Zhou Shen
This article has no evaluationsLatest version Nov 4, 2025
Rethinking Convolutional Semantics for Image Caption Generation Beyond Recurrent Paradigms

This article has 4 authors:
1. Noah Macdonald
2. Sofia Leblanc
3. Landen Whitaker
4. Arman Chowdhury
This article has no evaluationsLatest version Oct 22, 2025
<p class="MDPI12title">Comparative Deep Learning Analysis of Regularization Techniques on Generalization in Baseline CNNs and ResNet Architectures for Machine Learning-Based Image Classification

This article has 5 authors:
1. Mohsen Mohammadagha
2. Farbod Bigdeli
3. Shayan Sharifi
4. Maryam Deldadehasl
5. Saeid Ataei
This article has no evaluationsLatest version Nov 10, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Optimizing Explainability-Accuracy Trade-offs in Deep Neural Networks via Constrained Information Bottleneck Regularization

Rethinking Convolutional Semantics for Image Caption Generation Beyond Recurrent Paradigms

<p class="MDPI12title">Comparative Deep Learning Analysis of Regularization Techniques on Generalization in Baseline CNNs and ResNet Architectures for Machine Learning-Based Image Classification