From Individual Decisions to Team Emergence: A Survey on Explainable Cooperative Multi-Agent Reinforcement Learning

Lei Sheng
Xiliang Chen
Zhiqiang Pan
Fei Cai
Honghui Chen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Multi-Agent Reinforcement Learning (MARL) holds significant promise for cooperative decision-making, yet its reliance on deep neural networks (DNNs) creates ''black-box'' characteristics that impede trustworthy deployment in high-stakes scenarios. This lack of transparency complicates tracing decision logic and raises concerns about safety and accountability. This survey systematically reviews Explainable MARL (XMARL) for cooperative settings, deconstructing the decision-making chain from individual agent policies to collective team behavior. To address the absence of a unified framework, we introduce a novel multi-level taxonomy encompassing microscopic individual behavior, interaction mechanisms, team strategy emergence, and system-level performance. We organize core explanatory questions and technical approaches within this structure, summarize the principles and limitations of representative methods, and critically discuss key challenges such as evaluation standards, causal reasoning integration, and deployment adaptability. Our goal is to provide both theoretical foundation and technical guidance for building transparent, trustworthy, and verifiably cooperative multi-agent systems (MASs).

Version published to 10.21203/rs.3.rs-7965328/v1 on Research Square
Nov 14, 2025

Towards Efficient Optimization of Multi-Agent Social Simulation via Large Language Models

This article has 7 authors:
1. Kun Zhang
2. Xiaoyan Yu
3. Hao Peng
4. Zhe Yang
5. Ye Tian
6. Hao Jin
7. Tuoyu Feng
This article has no evaluationsLatest version Oct 27, 2025
Towards LLM Enhanced Decision: A Survey on Reinforcement Learning based Ship Collision Avoidance

This article has 7 authors:
1. Yizhou Wu
2. Jin Liu
3. Xingye Li
4. Junsheng Xiao
5. Tao Zhang
6. Haitong Xu
7. Lei Zhang
This article has no evaluationsLatest version Nov 5, 2025
Multi-Agent Deep Reinforcement Learning for Cooperative Path Planning of UAV Swarms

This article has 7 authors:
1. Pingping Qu
2. Huan Liu
3. Song Xu
4. Tengli Yu
5. Yunhao Chen
6. Ershen Wang
7. La Na
This article has no evaluationsLatest version Oct 16, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Towards Efficient Optimization of Multi-Agent Social Simulation via Large Language Models

Towards LLM Enhanced Decision: A Survey on Reinforcement Learning based Ship Collision Avoidance

Multi-Agent Deep Reinforcement Learning for Cooperative Path Planning of UAV Swarms