Emergence of belief-like representations through reinforcement learning

Jay A. Hennig
Sandra A. Romero Pinto
Takahiro Yamaguchi
Scott W. Linderman
Naoshige Uchida
Samuel J. Gershman

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

[Demo list ] SCGB Investigators and Others June 2023 (mark2d2)

Abstract

To behave adaptively, animals must learn to predict future reward, or value. To do this, animals are thought to learn reward predictions using reinforcement learning. However, in contrast to classical models, animals must learn to estimate value using only incomplete state information. Previous work suggests that animals estimate value in partially observable tasks by first forming “beliefs”—optimal Bayesian estimates of the hidden states in the task. Although this is one way to solve the problem of partial observability, it is not the only way, nor is it the most computationally scalable solution in complex, real-world environments. Here we show that a recurrent neural network (RNN) can learn to estimate value directly from observations, generating reward prediction errors that resemble those observed experimentally, without any explicit objective of estimating beliefs. We integrate statistical, functional, and dynamical systems perspectives on beliefs to show that the RNN’s learned representation encodes belief information, but only when the RNN’s capacity is sufficiently large. These results illustrate how animals can estimate value in tasks without explicitly estimating beliefs, yielding a representation useful for systems with limited capacity.

Author Summary

Natural environments are full of uncertainty. For example, just because my fridge had food in it yesterday does not mean it will have food today. Despite such uncertainty, animals can estimate which states and actions are the most valuable. Previous work suggests that animals estimate value using a brain area called the basal ganglia, using a process resembling a reinforcement learning algorithm called TD learning. However, traditional reinforcement learning algorithms cannot accurately estimate value in environments with state uncertainty (e.g., when my fridge’s contents are unknown). One way around this problem is if agents form “beliefs,” a probabilistic estimate of how likely each state is, given any observations so far. However, estimating beliefs is a demanding process that may not be possible for animals in more complex environments. Here we show that an artificial recurrent neural network (RNN) trained with TD learning can estimate value from observations, without explicitly estimating beliefs. The trained RNN’s error signals resembled the neural activity of dopamine neurons measured during the same task. Importantly, the RNN’s activity resembled beliefs, but only when the RNN had enough capacity. This work illustrates how animals could estimate value in uncertain environments without needing to first form beliefs, which may be useful in environments where computing the true beliefs is too costly.

Version published to 10.1101/2023.04.04.535512 on bioRxiv
Apr 7, 2023

Confidence phenotypes: a unified computational account of value and decision certainty in reinforcement learning

This article has 3 authors:
1. Nicolás A. Comay
2. Guillermo Solovey
3. Pablo Barttfeld
This article has no evaluationsLatest version Feb 20, 2026
Learning to Model the World: A Survey of World Models in Artificial Intelligence

This article has 19 authors:
1. Jiahua Dong
2. Qi Lyu
3. Baichen Liu
4. Xudong Wang
5. Wenqi Liang
6. Duzhen Zhang
7. Jiahang Tu
8. Hongliu Li
9. Hanbin Zhao
10. Henghui Ding
11. Yulun Zhang
12. Zhi Han
13. Nicu Sebe
14. Fahad Shahbaz Khan
15. Salman Khan
16. Mubarak Shan
17. Philip Torr
18. Ming-Hsuan Yang
19. Dacheng Tao
This article has no evaluationsLatest version Mar 10, 2026
State Estimation as a Feasibility Condition for Cognition under Partial Observability

This article has 1 author:
1. Ian S. Howard
This article has no evaluationsLatest version Mar 20, 2026

Discuss this preprint

Listed in

Abstract

Author Summary

Article activity feed

Related articles

Confidence phenotypes: a unified computational account of value and decision certainty in reinforcement learning

Learning to Model the World: A Survey of World Models in Artificial Intelligence

State Estimation as a Feasibility Condition for Cognition under Partial Observability