Impaired Reward-Based Learning but Preserved Motor Invigoration in Chronic Stroke

S. Sporn
M. Herrojo Ruiz
R. Fathana
C. Zich
S. Bestmann
N. S. Ward

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Reward provides a feedback signal that modulates behaviour through several mechanisms, including invigorating performance and learning of action–outcome associations to guide future choices. After stroke, the ability to utilise reward feedback can be impaired, which may limit the benefits of rehabilitation approaches that use reinforcement. One possibility is that stroke causes a global impairment of reward processing, leading to both reduced invigoration and diminished learning from feedback. Alternatively, reward processing may be selectively disrupted, such that either invigoration or the ability to update beliefs from reward feedback is disproportionately affected.

To test these competing hypotheses, we recruited forty chronic stroke survivors and thirty age-matched healthy controls to complete a probabilistic reversal learning task with both their strong (non-paretic/dominant) and weak (paretic/non-dominant) limb. On each trial, participants reached to one of two targets associated with different reward probabilities that changed unpredictably over time, requiring continued monitoring of outcomes and adaptation of choice behaviour.

Stroke survivors showed reduced reward-based learning compared to controls, expressed as lower overall choice accuracy and a greater tendency to switch responses after rewarded trials (i.e., lower win–stay rates), particularly when using the weak upper limb. Control analyses confirmed that these selective impairments were not explained by general motor impairment or cognitive deficits. To identify the putative computations underlying these behavioural differences in reward-based learning we used an established model of hierarchical Bayesian inference, the Hierarchical Gaussian Filter (HGF). The HGF characterises learning dynamics as trial-by-trial updating of an agent’s beliefs about action–outcome probabilities and their change over time (environmental volatility). Compared to healthy controls, stroke survivors were slower to update their beliefs about action–reward contingencies, an effect most pronounced for the weak upper limb, whereas updating beliefs about environmental volatility remained intact. Reward-based invigoration was also preserved: strong trial-by-trial predictions about action–reward contingencies were associated with faster movement times, with comparable slopes of this association across groups, indicating that motivational drive was maintained in patients despite overall slower performance.

This behavioural dissociation between preserved motivational invigoration but impaired probabilistic reward-based learning highlights a key translational opportunity: to leverage intact motivational pathways to enhance rehabilitation intensity and compliance, and to develop adaptive feedback strategies that compensate for impaired reward learning. Harnessing these complementary approaches could strengthen recovery outcomes and support greater long-term independence after stroke.

Version published to 10.1101/2025.10.31.25338954 on medRxiv
Nov 4, 2025

Neurocomputational mechanisms underlying the distinct motivational influences of reward and punishment on cognitive control

This article has 9 authors:
1. Debbie M Yee
2. Mahalia Prater Fahey
3. Ziwei M Cheng
4. Xiamin Leng
5. Joonhwa Kim
6. Maisy Tarlow
7. Kaitlyn Mundy
8. Samuel Nevins
9. Amitai Shenhav
This article has no evaluationsLatest version Oct 17, 2025
Reward-induced endogenous pain inhibition scales with action-outcome certainty in humans

This article has 4 authors:
1. Fabrice Hubschmid
2. Simon Desch
3. Esther Florin
4. Susanne Becker
This article has no evaluationsLatest version Nov 1, 2025
Sex differences in task engagement and lapse rate during reward learning plateaus

This article has 9 authors:
1. C.G. Aguirre
2. J.H. Woo
3. L. Alhabbal
4. T. Fujioka
5. R. Moore
6. T. Ye
7. J.J. Castrellon
8. A. Soltani
9. A. Izquierdo
This article has no evaluationsLatest version Oct 31, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Neurocomputational mechanisms underlying the distinct motivational influences of reward and punishment on cognitive control

Reward-induced endogenous pain inhibition scales with action-outcome certainty in humans

Sex differences in task engagement and lapse rate during reward learning plateaus