Policy-Gradient Reinforcement Learning as a General Theory of Practice-Based Motor Skill Learning

Adrian M. Haith

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Mastering any new skill requires extensive practice, but the computational principles underlying this learning are not clearly understood. Existing theories of motor learning can explain short-term adaptation to perturbations, but offer little insight into the processes that drive gradual skill improvement through practice. Here, we propose that practice-based motor skill learning can be understood as a form of reinforcement learning (RL), specifically, policy-gradient RL, a simple, model-free method that is widely used in robotics and other continuous control settings. Here, we show that models based on policy-gradient learning rules capture key properties of human skill learning across a diverse range of learning tasks that have previously lacked any computational theory. We suggest that policy-gradient RL can provide a general theoretical framework and foundation for understanding how humans hone skills through practice.

Version published to 10.1101/2025.10.17.682587 on bioRxiv
Oct 20, 2025

A Brief Tutorial on Reinforcement Learning: From MDP to DDPG

This article has 1 author:
1. Tian Zhang
This article has no evaluationsLatest version Jan 6, 2026
Prediction in action: toward an empirical science of active inference

This article has 3 authors:
1. Jet Lageman
2. Johannes Jacobus Fahrenfort
3. Heleen A Slagter
This article has no evaluationsLatest version Jan 26, 2026
Spaced Practice and Reactive Inhibition Have Limited or No effects on Motor Sequence Learning

This article has 2 authors:
1. Mohan W. Gupta
2. Timothy C. Rickard
This article has no evaluationsLatest version Feb 2, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Brief Tutorial on Reinforcement Learning: From MDP to DDPG

Prediction in action: toward an empirical science of active inference

Spaced Practice and Reactive Inhibition Have Limited or No effects on Motor Sequence Learning