Delayed reward information is underweighted in reinforcement learning with dispersed feedback

Miruna Cotet
David Poensgen
Ian Krajbich

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Learning is fundamental to adaptive behavior. In the typical learning task, each action is associated with only one outcome, which could be immediate or delayed. However, actions often have multiple consequences that unfold over time. Here, we used behavioral and eye-tracking experiments to study how people learn when their choices yield both immediate and delayed reward information. Importantly, the rewards themselves were all delivered at the end of the study so there was no reason to weight immediate and delayed reward information differently. Instead, we found that our subjects overweighted immediate reward information. Moreover, this bias increased over the course of the experiment and was still present when learning from others’ choices. The gaze data reveal mixed evidence that subjects looked more at immediate vs. delayed feedback, and across subjects, the relative dwell proportion did not predict the behavioral bias. Our results indicate that people prioritize not just immediate rewards, but immediate reward information. Unlike temporal discounting, this form of impatience is a clear mistake and leads to objectively worse outcomes.

Version published to 10.31234/osf.io/eyjmg_v1 on OSF Preprints
Jan 9, 2026

Ambiguity and confirmatory reward learning

This article has 2 authors:
1. Hayley Dorfman
2. Rahul Bhui
This article has no evaluationsLatest version Feb 16, 2026
Contrasting the classic effect of reward on attentional priority maps: Behavioral evidence for an attentional gating of spatially-specific reward signals

This article has 9 authors:
1. Bertrand Beffara
2. Fadila Hadj-Bouziane
3. Suliann Ben Hamed
4. Irene Cristofori
5. Leslie Tricoche
6. C. Nico Boehler
7. Leonardo Chelazzi
8. Elisa Santandrea
9. Emiliano Macaluso
This article has no evaluationsLatest version Feb 12, 2026
Contrasting the classic effect of reward on attentional priority maps: Behavioral evidence for an attentional gating of spatially-specific reward signals

This article has 9 authors:
1. Bertrand Beffara
2. Fadila Hadj-Bouziane
3. Suliann Ben Hamed
4. Irene Cristofori
5. Leslie Tricoche
6. C. Nico Boehler
7. Leonardo Chelazzi
8. Elisa Santandrea
9. Emiliano Macaluso
This article has no evaluationsLatest version Feb 12, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Ambiguity and confirmatory reward learning

Contrasting the classic effect of reward on attentional priority maps: Behavioral evidence for an attentional gating of spatially-specific reward signals

Contrasting the classic effect of reward on attentional priority maps: Behavioral evidence for an attentional gating of spatially-specific reward signals