Reinforcement Learning versus Natural Language Programs: Where is Flexible Planning and Problem Solving in Natural Intelligence Coming From?

Hokin Deng

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Flexible planning and problem solving are hallmarks of intelligence. Cognitive computational neuroscientists propose that reinforcement learning (RL) is the most-likely candidate as the computational substrate of flexible behaviors. Considerable experimental effort has been dedicated in identifying the biophysical correlate of RL algorithms in animals and humans. An important assumption in these endeavors is that animals and humans solve complex tasks in the shared computational way. We argue that model-based reinforcement learning is not a very good framework for intelligent behaviors in both animals and humans. Our argument is that while animals might solve complex problems in non-linguistic algorithmized form, such as model-free RL, humans utilize language as expressible heuristics to reason and solve problems. Recently, roboticists show that using natural language programs as the computational substrate is the secret weapon for solving difficult long-horizon planning problems in robots, such as making a cup of tea. In this paper, we argue 1. natural language program is the computational substrate for humans and artificial agents, such as Large Language Models, solving complex problems and tasks; 2. model-free reinforcement learning (RL) and its variants is the most probable computational substrate for complex flexible behaviors in animals.

Version published to 10.31219/osf.io/dhm67_v1 on OSF Preprints
Jul 17, 2025

The role of reinforcement learning in pragmatic reasoning tasks: Modeling and validating the sources of individual differences

This article has 3 authors:
1. John Duff
2. Alexandra Mayn
3. Vera Demberg
This article has no evaluationsLatest version Aug 5, 2025
WITHDRAWN

This article has no evaluationsLatest version Jul 26, 2025
Towards Robots that Learn from Humans

This article has 1 author:
1. Dylan Losey
This article has no evaluationsLatest version Jul 21, 2025

Listed in

Abstract

Article activity feed

Related articles

The role of reinforcement learning in pragmatic reasoning tasks: Modeling and validating the sources of individual differences

WITHDRAWN

Towards Robots that Learn from Humans