Policy optimization emerges from noisy representation learning

Jonah W. Brenner
Chenguang Li
Gabriel Kreiman

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Nervous systems learn representations of the world and policies to act within it. We present a framework that uses reward-dependent noise to facilitate policy opti- mization in representation learning networks. These networks balance extracting normative features and task-relevant information to solve tasks. Moreover, their representation changes reproduce several experimentally observed shifts in the neural code during task learning. Our framework presents a biologically plausible mechanism for emergent policy optimization amid evidence that representation learning plays a vital role in governing neural dynamics. Code is available at: NeuralThermalOptimization.

Version published to 10.1101/2024.11.01.621621v1 on bioRxiv
Nov 3, 2024

A Neural Circuit Framework for Economic Choice: From Building Blocks of Valuation to Compositionality in Multitasking

This article has 3 authors:
1. Aldo Battista
2. Camillo Padoa-Schioppa
3. Xiao-Jing Wang
This article has no evaluationsLatest version Mar 13, 2025
Recurrent Neural Network Exploration Strategies During Reinforcement Learning Depend on Network Capacity

This article has 4 authors:
1. H. Flimm
2. D. Tuzsus
3. I. Pappas
4. J. Peters
This article has no evaluationsLatest version Mar 13, 2025
Revealing the mechanisms underlying latent learning with successor representations

This article has 3 authors:
1. Matheus Menezes
2. Xiangshuai Zeng
3. Sen Cheng
This article has no evaluationsLatest version Mar 2, 2025

Listed in

Abstract

Article activity feed

Related articles

A Neural Circuit Framework for Economic Choice: From Building Blocks of Valuation to Compositionality in Multitasking

Recurrent Neural Network Exploration Strategies During Reinforcement Learning Depend on Network Capacity

Revealing the mechanisms underlying latent learning with successor representations