Multi-Agent based Dynamic Anchors for Interpretation of Deep Learning Classifiers

Supreeth Suresh
Suresh Muknahallipatna

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Explainable Artificial Intelligence (XAI) provides insights into how black-box models make decisions. Among existing approaches, anchors provide high-precision, human-interpretable rules in the form of simple if-then conditions over input features. Classical anchors compute discrete instance-wise rules using a bandit-guided beam search without learning across instances or coordinating rules across classes. Consequently, they are fundamentally local and do not yield a coherent picture of the model's decision regions.We propose Reinforcement Learning Dynamic Anchors (RLDA) , a reinforcement learning (RL) formulation of anchor discovery, in which a policy learns to refine an axis-aligned box around an instance through a sequence of continuous actions, directly optimizing interpretable quantities such as precision and coverage. We then extend this framework to Multi-Agent Dynamic Anchors (MADA) , a cooperative game with one or more agents per class, where agents jointly learn class-wise anchor regions under shared rewards that encourage both local fidelity and a global structure, operating under defined equilibrium conditions.The trained policies were applied to data samples to generate both instance- and class-level rules, which were then tested globally across all classes. Experiments on standard tabular datasets showed that, first, RLDA provides more precise rules and the performance is comparable to classical anchors while producing reusable policies; and second, MADA yields class-wise rules with high precision, useful coverage, and reduced cross-class overlap, thereby providing a more global and structured explanation of the classifier.

Version published to 10.21203/rs.3.rs-9390427/v1 on Research Square
Apr 14, 2026

A hierarchical multi-agent reinforcement learning framework with high-level guidance from large language models

This article has 10 authors:
1. Jinyin Bai
2. Wei Zhu
3. Xiangchen Wang
4. KaiYang Kou
5. Shiluo Guo
6. Shuhong Liu
7. Dong Li
8. Tianjin Ni
9. Jinji Zhou
10. Yihao Zhong
This article has no evaluationsLatest version May 25, 2026
Trust Guided Reinforcement Learning for Safe Robot Navigation with Dynamic Window Approach

This article has 4 authors:
1. Yuhan Wang
2. Haonan Li
3. Hu Luo
4. Gebel Elena Sergeevna
This article has no evaluationsLatest version Apr 17, 2026
Default Feature Representations of the Cognitive Map

This article has 2 authors:
1. Armin Bazarjani
2. Payam Piray
This article has no evaluationsLatest version May 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A hierarchical multi-agent reinforcement learning framework with high-level guidance from large language models

Trust Guided Reinforcement Learning for Safe Robot Navigation with Dynamic Window Approach

Default Feature Representations of the Cognitive Map