Strategic variability in humans, pigeons, and rats.

Janne Reynders
Tom Verguts
Senne Braem

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Strategically variable behavior can be advantageous in various fields such as sports (unpredictability), art (creativity), science (innovation), and problem-solving (thinking outside the box). Although previous studies identified experimental conditions under which humans and non-human animals show increased variable decision-making, we have only a limited understanding of its underlying cognitive mechanisms. Using a reinforcement learning model, we simulate the use of three different theorized strategies in an adversarial reward learning environment that requires very high variability. Model simulations with a policy-gradient meta-learning algorithm show that agents could respond more optimally in such environments by (1) relying on a stochastic generator, (2) increasing one’s learning rate to allow for faster interactions between reinforcement learning and extinction, or (3) strategically upvalue unchosen actions using a frequency-based memory. After demonstrating the theoretical benefit of each of these strategies, we fitted our model on existing datasets of human-, pigeons- and rat behavior in adversarial environments. We show that, while all three species can engage in highly variable behavior, only humans strategically upvalue unchosen actions as a strategy to achieve variability.

Version published to 10.31234/osf.io/4fnjz_v2 on OSF Preprints
Mar 14, 2025
Version published to 10.31234/osf.io/4fnjz on OSF Preprints
Jan 6, 2025

Prompt Carefully! ChatGPT Displays Rule-Based Insensitivity to Contingencies

This article has 2 authors:
1. Francisco J. Ruiz
2. Verónica Cardona-Betancourt
This article has no evaluationsLatest version Jan 17, 2026
Action framing modulates the Pavlovian bias in go and no-go learning

This article has 2 authors:
1. Zhang Chen
2. Senne Braem
This article has no evaluationsLatest version Dec 2, 2025
Flexible selection of working memory representations to reduce cognitive cost

This article has 7 authors:
1. Jingjie Li
2. Ariel Ziqian Xu
3. Chaofei Bao
4. Albert Albesa-González
5. Liujunli Li
6. Claudia Clopath
7. Jeffrey C. Erlich
This article has no evaluationsLatest version Nov 29, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Prompt Carefully! ChatGPT Displays Rule-Based Insensitivity to Contingencies

Action framing modulates the Pavlovian bias in go and no-go learning

Flexible selection of working memory representations to reduce cognitive cost