Error-driven representation learning in the mesolimbic system

George Cai
Max F. Scheller
Wolfgang Kelsch
Samuel J. Gershman

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In reinforcement learning, an agent learns to map representations of the environment state to predictions of future reward. Most prior work in neuroscience has assumed a fixed representation and studied how reward prediction errors (thought to be conveyed by phasic dopamine signals) are used to update the mapping from representations to predictions. However, work in machine learning has demonstrated that much more powerful predictive systems can be learned by using the errors to update the representations themselves. We study whether the brain does something similar by leveraging simultaneous recordings of striatal projection neurons in the olfactory tubercle (putatively representing state features) and dopamine neurons in the ventral tegmental area. We show that trial-by-trial changes in striatal activity are more consistent with dopamine-driven representation learning than a variety of alternative updating schemes. This result suggests a convergence of representation learning principles in biological and artificial systems.

Version published to 10.64898/2026.05.18.725950 on bioRxiv
May 19, 2026

Phasic dopamine drives conditioned responding beyond its role in learning

This article has 4 authors:
1. Jay A. Hennig
2. Mark Burrell
3. Naoshige Uchida
4. Samuel J. Gershman
This article has no evaluationsLatest version Mar 25, 2026
Learning to select computations in recurrent neural circuits

This article has 8 authors:
1. Sixing Chen
2. Frederick Callaway
3. Sreejan Kumar
4. Shira M. Lupkin
5. Joni D. Wallis
6. Vincent B. McGinty
7. Erin L. Rich
8. Marcelo G. Mattar
This article has no evaluationsLatest version Apr 16, 2026
Dopamine dynamics in human anterior cingulate cortex during Pavlovian-instrumental conflict

This article has 19 authors:
1. Azadeh Nazemorroaya
2. Seth R. Batten
3. Itamar Grunfeld
4. Alexis Torres
5. Xavier Celaya
6. Olivia Moreland
7. Chloe Lattuca
8. Ava Wagle
9. Devin Nikjou
10. Leonardo S. Barbosa
11. Terry Lohrenz
12. Pearl Chiu
13. Gene A. Brewer
14. Samuel M. McClure
15. Mark R. Witcher
16. Robert W. Bina
17. P. Read Montague
18. Peter Dayan
19. Dan Bang
This article has no evaluationsLatest version Apr 6, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Phasic dopamine drives conditioned responding beyond its role in learning

Learning to select computations in recurrent neural circuits

Dopamine dynamics in human anterior cingulate cortex during Pavlovian-instrumental conflict