Policy complexity suppresses dopamine responses

Samuel J. Gershman
Armin Lak

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Limits on information processing capacity impose limits on task performance. We show that animals achieve performance on a perceptual decision task that is near-optimal given their capacity limits, as measured by policy complexity (the mutual information between states and actions). This behavioral profile could be achieved by reinforcement learning with a penalty on high complexity policies, realized through modulation of dopaminergic learning signals. In support of this hypothesis, we find that policy complexity suppresses midbrain dopamine responses to reward outcomes, thereby reducing behavioral sensitivity to these outcomes. Our results suggest that policy compression shapes basic mechanisms of reinforcement learning in the brain.

Version published to 10.1101/2024.09.15.613150v1 on bioRxiv
Sep 16, 2024

Striatal indirect pathway mediates hesitation

This article has 3 authors:
1. Matthew A Geramita
2. Susanne E Ahmari
3. Eric A Yttri
This article has no evaluationsLatest version Sep 16, 2024
Individual differences in decision-making shape how mesolimbic dopamine regulates choice confidence and change-of-mind

This article has 3 authors:
1. Adrina Kocharian
2. A. David Redish
3. Patrick E. Rothwell
This article has no evaluationsLatest version Sep 16, 2024
Policy optimization emerges from noisy representation learning

This article has 3 authors:
1. Jonah W. Brenner
2. Chenguang Li
3. Gabriel Kreiman
This article has no evaluationsLatest version Nov 3, 2024

Listed in

Abstract

Article activity feed

Related articles

Striatal indirect pathway mediates hesitation

Individual differences in decision-making shape how mesolimbic dopamine regulates choice confidence and change-of-mind

Policy optimization emerges from noisy representation learning