Learning from rewards and social information in naturalistic strategic behavior

Ionatan Kuperwajs
Bas van Opheusden
Evan Russek
Tom Griffiths

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Acting intelligently in complex environments poses a challenging learning problem: faced with many different situations and possible actions, how do people learn which action to take in each situation? While traditional laboratory-based experiments have been used to study specific learning mechanisms, these experiments often employ relatively simple tasks conducted over a short period of time. Thus, it is unclear to what extent these mechanisms are used in the significantly more complex and temporally extended environments people encounter in their everyday lives. To understand the processes by which people learn policies to guide their decisions, we investigate the opening strategies of novice online chess players over their first months of play. We use a large online data set consisting of 2,499,783 games, providing us with the necessary scale to explore learning mechanisms in a complex setting. In particular, we focus on two types of learning: reinforcement learning, or learning from rewards given repeated experiences, and social learning, or learning from the actions of others. We show that players’ choices are modulated by both game outcomes and observing their opponents’ actions, and that they exhibit important hallmarks of adaptive decision-making such as exploration and expertise. Our results provide evidence that people use sophisticated learning algorithms in naturalistic strategic behavior.

Version published to 10.31234/osf.io/d8zje on OSF Preprints
Aug 14, 2024

Decision rule inference limits social escape from learning traps

This article has 3 authors:
1. Rheza Budiono
2. Catherine A. Hartley
3. Todd Matthew Gureckis
This article has no evaluationsLatest version Sep 17, 2025
Decision rule inference limits social escape from learning traps

This article has 3 authors:
1. Rheza Budiono
2. Catherine A. Hartley
3. Todd Matthew Gureckis
This article has no evaluationsLatest version Sep 17, 2025
Decision rule inference limits social escape from learning traps

This article has 3 authors:
1. Rheza Budiono
2. Catherine A. Hartley
3. Todd Matthew Gureckis
This article has no evaluationsLatest version Sep 17, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Decision rule inference limits social escape from learning traps

Decision rule inference limits social escape from learning traps

Decision rule inference limits social escape from learning traps