The role of reinforcement learning in pragmatic reasoning tasks: Modeling and validating the sources of individual differences

John Duff
Alexandra Mayn
Vera Demberg

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In Gricean pragmatics, inference during communication is regarded as a form of rational, domain-general reasoning about the intentions of other agents.Studies using the pictorial communication "reference game" task are sometimes used in support of this hypothesis.Yet, measures of pragmatic reasoning in this task sometimes reveal poor performance, with participants requiring many rounds of play before they exhibit theoretically-expected patterns, and demonstrating substantial individual differences in behavior.Do these results challenge the idea of widespread inferencing via fundamental social competence?We advance an alternative proposal here, which posits that these patterns emerge as a factor of the way participants perform pragmatic reasoning in a task: namely, they prefer to use simpler interpretation strategies until experience motivates the use of additional resources.Building off of work modeling task adaptation as reinforcement learning, we use the cognitive architecture ACT-R to simulate the expected behavior of individuals with this kind of resource-rational performance algorithm, subject to individualized parameters for reinforcement learning.These simulations provide a proof-of-concept for our adaptation proposal, recreating known patterns and generating new concrete predictions for the particular domain-general sources of individual variance in reference game tasks.We then go on to validate some of these new predictions in a pre-registered experiment, and find that pragmatic response behavior is indeed related to a participant's general persistence in self-directed exploration of strategies for task completion.Our results offer a path to reconcile difficult empirical data within idealistic models of pragmatic competence.From a broader perspective, we see this as a first step towards a theory of performance factors in pragmatic reasoning, and ultimately, a case study in the value of process-level computational modeling.

Version published to 10.31234/osf.io/9shv3_v1 on OSF Preprints
Aug 5, 2025

An ACT-R model of resource-rational performance in a pragmatic signaling game

This article has 3 authors:
1. John Duff
2. Alexandra Mayn
3. Vera Demberg
This article has no evaluationsLatest version Aug 6, 2025
The Dynamic Model of Cognitive Dominance Framework

This article has 1 author:
1. Chris Walsh
This article has no evaluationsLatest version Jul 26, 2025
A Dharmakīrtian Model of Relevance Realization in Cognitive Agents

This article has 2 authors:
1. Nadav Amir
2. John Dunne
This article has no evaluationsLatest version Aug 31, 2025

Listed in

Abstract

Article activity feed

Related articles

An ACT-R model of resource-rational performance in a pragmatic signaling game

The Dynamic Model of Cognitive Dominance Framework

A Dharmakīrtian Model of Relevance Realization in Cognitive Agents