VTA dopamine neuron activity produces spatially organized value representations

Alejandro Pan-Vazquez
Christopher A. Zimmerman
Brenna McMannon
Julie M. J. Fabre
Miranta Louka
Tingying Jia
Yotam Sagiv
Steven J. West
Mayo Faulkner
International Brain Laboratory
Peter Dayan
Ilana B. Witten

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

How does the activity of midbrain dopamine (DA) neurons reinforce actions? A prominent hypothesis is that the activity of ventral tegmental area (VTA) DA neurons instructs representations of predicted reward, or value, in downstream neurons1. To directly test this model, we performed comprehensive striatal recordings in mice engaged in a trial-and-error probabilistic learning task where they continuously adapted their choices to obtain a reward of optogenetic stimulation of VTA DA neurons (paired with an auditory cue). We then assessed neural representations of action values (estimated from a behavioral model), revealing for the first time that VTA DA stimulation is sufficient to generate downstream neural correlates of action value. Surprisingly, these value correlates were strongest in the intermediate caudoputamen (CP) and weakest in the nucleus accumbens (NAc), despite NAc being the major projection target of VTA DA neurons2,3. This was true not only for the value of each choice, but also for state value (reward expectation) and relative value (the decision variable). However, value representations were differentially organized within the intermediate CP, with ventromedial domains (which receive inputs from orbitofrontal cortex) preferentially encoding state value and dorsolateral domains (which receive inputs from motor cortex) preferentially encoding relative value. A difference in learning rate for the value computation between NAc and CP did not explain the relatively weak value correlates in NAc. Instead, we found that VTA DA stimulation was sufficient to produce learned neural responses to the stimulation-paired auditory cue throughout the striatum, including in the NAc, and that animals work for this cue rather than for VTA DA stimulation itself. Overall, this suggests that VTA DA neurons support trial-and-error learning indirectly, by making stimuli valuable ('conditioned reinforcers'), which in turn support the generation of action value representations in the CP.

Version published to 10.1101/2025.11.04.685995 on bioRxiv
Nov 6, 2025

A hardwired neural circuit for temporal difference learning

This article has 8 authors:
1. Malcolm G Campbell
2. Yongsoo Ra
3. Zhiqin Chen
4. Shudi Xu
5. Mark Burrell
6. Sara Matias
7. Mitsuko Watabe-Uchida
8. Naoshige Uchida
This article has no evaluationsLatest version Sep 18, 2025
Ventral tegmental area dopamine controls timing variability

This article has 10 authors:
1. Matthew A. Weber
2. Kartik Sivakumar
3. Alexandra S. Bova
4. Ervina E. Tabakovic
5. Mackenzie M. Conlon
6. Mayu Oya
7. Rachel C. Cole
8. Arturo I. Espinoza
9. Youngcho Kim
10. Nandakumar S. Narayanan
This article has no evaluationsLatest version Oct 28, 2025
A role for prefrontal dopamine signaling in control of goal-directed actions

This article has 25 authors:
1. A. Petitbon
2. A. Contini
3. R. Walle
4. M-C. Allichon
5. L. Hardt
6. F. Ducrocq
7. P. Couty
8. A. Oummadi
9. M-F. Angelo
10. S. Delcasso
11. M. Cazala
12. M. Depret
13. Y. Harouna
14. A. Ilie
15. F. Acedo
16. J. Correa Vazquez
17. R. Ortole
18. R. Thebeaud
19. A. Blanc
20. M. Rousseaud
21. A. Andrianarivelo
22. G. Ferreira
23. J. Barik
24. P. Vanhoutte
25. P. Trifilieff
This article has no evaluationsLatest version Oct 12, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A hardwired neural circuit for temporal difference learning

Ventral tegmental area dopamine controls timing variability

A role for prefrontal dopamine signaling in control of goal-directed actions