Exploring dominant strategies in iterated and evolutionary games: a multi-agent reinforcement learning approach

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Exploring dominant strategies in iterated games holds profound theoretical and practical significance across diverse domains. Previous studies, through mathematical analysis of limited cases, have unveiled classic strategies such as tit-for-tat, generous-tit-for-tat, win-stay-lose-shift, and zero-determinant strategies. While these strategies offer valuable insights into human decision-making, they represent only a small subset of possible strategies, constrained by limited mathematical and computational tools available to explore larger strategy spaces. To bridge this gap, we propose a novel approach using multi-agent reinforcement learning to delve into complex decision-making processes that go beyond human intuition. Our approach has led to the discovery of a new strategy, i.e., memory-two bilateral reciprocity (MTBR) strategy. MTBR consistently outperforms a wide range of strategies in pairwise interactions while achieving high payoffs. When introduced into an evolving population with diverse strategies, MTBR demonstrates dominance and fosters higher levels of cooperation and social welfare in both homogeneous and heterogeneous structures, as well as across various game types. This high performance is verified by simulations and mathematical analysis. Our work highlights the potential of multi-agent reinforcement learning in uncovering dominant strategies in complex environments, offering a new perspective on strategy exploration in iterated games.

Article activity feed