Interpreting BERT Using LIME and SHAP

Manish Shukla

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Transformer-based language models such as BERT have achieved state-of-the-art performance on diverse natural language processing tasks, yet their decision processes remain opaque. This paper presents a comprehensive framework for interpreting BERT’s predictions in multi-label text classification using two leading model-agnostic explainability techniques—Local Interpretable Model-Agnostic Explanations (LIME) and SHapley Additive exPlanations (SHAP). An end-to-end pipeline for fine-tuning BERT and producing token-level attributions is introduced. We systematically compare the explainers with respect to local fidelity, global consistency, stability and computational cost. Experimental results suggest that LIME generates intuitive, case-specific explanations while SHAP provides theoretically grounded and globally consistent attributions. By integrating the complementary strengths of both methods, we propose a hybrid interpretation strategy that balances interpretability, scalability and accuracy. The methodology is illustrated through a case study on multi-label genre classification from movie plot summaries. Detailed guidelines and synthetic visualisations are provided to enable practitioners to apply these techniques effectively and responsibly.

Version published to 10.31224/5078
Aug 12, 2025

Interpreting BERT Using LIME and SHAP

This article has 1 author:
1. Manish Shukla
This article has no evaluationsLatest version Aug 18, 2025
Democratizing Deep Expertise: A Framework for Extracting and Codifying Tacit Knowledge Using Large Language Models

This article has 1 author:
1. Irshad Abdulla
This article has no evaluationsLatest version Oct 7, 2025
Exploration of Stability Judgments: From Multimodal LLMs to Human Insights

This article has 7 authors:
1. Mury Fajar Dewantoro
2. Febri Abdullah
3. Yi Xia
4. Ibrahim Khan
5. Ruck Thawonmas
6. Wenwen Ouyang
7. Fitra Abdurrachman Bachtiar
This article has no evaluationsLatest version Sep 22, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Interpreting BERT Using LIME and SHAP

Democratizing Deep Expertise: A Framework for Extracting and Codifying Tacit Knowledge Using Large Language Models

Exploration of Stability Judgments: From Multimodal LLMs to Human Insights