A computational principle of habit formation
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Actions are influenced by multiple decision-making systems – including a goal-directed system that favors rewarded actions and a habit system that repeats past actions – but precisely when one system prevails is not known. We show that when competition between these systems is resolved by a winner-take-all mechanism, the precise condition for the emergence of habits can be cast in terms of the well-known probability matching principle. The theory embodies a trade-off in which exploitation, or overmatching, maximizes reward but strengthens habits, while paradoxically, exploration preserves goal-directed behavior by sacrificing rewards. This tradeoff can be averted if learning operates on abstract latent state representations whereby knowing the broader context allows for switching between two habits instead of avoiding one, thus maximizing rewards without forfeiting flexibility. The theory explains a range of animal behaviors as well as task-dependent effects of striatal manipulation, and suggests that neural mechanisms governing exploration implicitly control arbitration between decision-making systems.