Topological basal ganglia model with dopamine-modulated spike-timing-dependent plasticity reproduces reinforcement learning, discriminatory learning, and neuropsychiatric disorders

Carlos Enrique Gutierrez
Jean Lienard
Benoit Girard
Hidetoshi Urakubo
Kenji Doya

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The basal ganglia (BG) are central to action selection and reinforcement learning, yet how the topological organization of the BG circuit with dopamine (DA) D1- and D2-receptors shape learning remains unclear. We present a topologically organized spiking model of macaque BG with cortico-striatal inputs organized into competing channels, D1/D2 medium spiny neurons (MSNs), three-factor DA-modulated STDP for cortical synapses, asymmetric intra-striatal collaterals, and partially overlapping direct/indirect pathways. We validate resting activity and action selection, then study conditioning and generalization–discrimination learning using DA bursts (CS+) and dips (CS).

Two structural determinants emerged. First, pathway overlap ( λ ) trades off selection efficiency and learning speed: higher overlap degrades GPi-based selection efficiency during conditioning, yet accelerates convergence during discrimination by strengthening D2 influence on GPi. Second, lateral inhibition from MSN-D2 to MSNs ( κ ) helps constrain competing actions but is not sufficient alone; robust discrimination requires DA-dip–dependent up-modulation of D2 collateral efficacy ( η ), which speeds and, at low overlap, enables convergence.

Simulations under Parkinsonian and schizophrenia-like settings showed different deficits. A hypodopaminergic “Parkinsonian” STDP regime (D1 LTP loss, D2 LTD loss) impaired conditioning and failed to enhance discrimination. In contrast, attenuated D2 plasticity during DA dips (modeling methamphetamine-induced changes/schizophrenia-related dysregulation) selectively disrupted discrimination while sparing conditioning.

Finally, we demonstrate efficient scaling on the Fugaku supercomputer to rodent and non-human primate–relevant sizes, supporting large-scale, biologically grounded BG simulations. Together, the results highlight how pathway overlap and D2 collateral dynamics jointly regulate the speed and reliability of discrimination learning, and how specific DA perturbations map to distinct learning impairments.

Version published to 10.1101/2025.11.10.687760 on bioRxiv
Nov 12, 2025

Distinct dopaminergic spike-timing-dependent plasticity rules are suited to different functional roles

This article has 2 authors:
1. Baram Sosis
2. Jonathan E. Rubin
This article has no evaluationsLatest version Sep 26, 2025
Ventral striatal astrocytes contribute to reinforcement learning

This article has 14 authors:
1. Julia Pai
2. Fatih Sogukpinar
3. Kei Ogasawara
4. Garrett J Smith
5. Francesca R Fiocchi
6. Yanchao Dai
7. Yifan Wu
8. Michael J Frank
9. ShiNung Ching
10. Federica Lucantonio
11. Thomas Papouin
12. Marco Pignatelli
13. Naoki Hiratani
14. Ilya E Monosov
This article has no evaluationsLatest version Oct 20, 2025
Distributed cortical learning through LEC-mediated γ-synchrony

This article has 10 authors:
1. Ji-Song Guan
2. Di Yun
3. Zheng Wang
4. Shenglin Zhao
5. Zhenjie Wang
6. Haoran Ma
7. Fei He
8. Junfeng Lu
9. Yuanning Li
10. Hong Xie
This article has no evaluationsLatest version Oct 27, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Distinct dopaminergic spike-timing-dependent plasticity rules are suited to different functional roles

Ventral striatal astrocytes contribute to reinforcement learning

Distributed cortical learning through LEC-mediated γ-synchrony