Predictive learning enables compositional representations

Gauthier Boeshertz
Claudia Clopath

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The brain builds predictive models to plan future actions. These models generalize remarkably well to new environments, but it is unclear how neural circuits acquire this flexibility. Here, we show that compositional representations emerge in Recurrent Neural Networks (RNNs) trained solely to predict future sensory inputs. These representations have been observed in different areas of the brain, for example, in the motor cortex of monkeys, which have been shown to reuse primitives in sequences. They enable compositional generalization, a mechanism that could explain the brain’s adaptability, where independent modules representing different parts of the environment can be selected according to context. We trained an RNN to predict future frames in a visual environment defined by independent latent factors and their corresponding dynamics. We found that the network learned to solve this task by developing a compositional internal model. Specifically, it had disentangled representations of the static latent factors, and formed distinct, modular clusters, each selectively implementing a single dynamic. This modular and disentangled architecture enabled the network to exhibit compositional generalization, accurately predicting outcomes in novel contexts composed of unseen combinations of dynamics. Our findings present a powerful, unsupervised mechanism for learning the causal structure of an environment, suggesting that predicting the future can be sufficient to develop generalizable world models.

Significance

The brain can function in environments it has never been in before, an ability called generalization. For example, predict the future in environments with dynamics it has never seen before. In our study, we show that when a model of neural networks is trained to predict future observations, it will develop a modular structure that enables the brain to do one type of generalization, called compositional generalization, where known dynamics can be recomposed in new ways. Indeed, the network develops clusters that each perform a computation representing a dynamic. Importantly, it can recombine these clusters according to the input it is receiving, with no need for other clues. We also show that the network has distinct clusters of neurons coding for each latent of the environment.

Version published to 10.1101/2025.09.26.678731 on bioRxiv
Sep 27, 2025

A mechanistic theory of planning in prefrontal cortex

This article has 7 authors:
1. Kristopher T. Jensen
2. Peter Doohan
3. Mathias Sablé-Meyer
4. Sandra Reinert
5. Alon Baram
6. Thomas Akam
7. Timothy E. J. Behrens
This article has no evaluationsLatest version Sep 24, 2025
Reusable modular architecture enables flexible cognitive operations

This article has 3 authors:
1. Yuma Osako
2. Timothy Buschman
3. Mriganka Sur
This article has no evaluationsLatest version Sep 19, 2025
Multitasking Recurrent Networks Utilize Compositional Strategies for Control of Movement

This article has 2 authors:
1. John Lazzari
2. Shreya Saxena
This article has no evaluationsLatest version Sep 16, 2025

Discuss this preprint

Listed in

Abstract

Significance

Article activity feed

Related articles

A mechanistic theory of planning in prefrontal cortex

Reusable modular architecture enables flexible cognitive operations

Multitasking Recurrent Networks Utilize Compositional Strategies for Control of Movement