Can Centaur Truly Simulate Human Cognition? The Fundamental Limitation of Instruction Understanding

Wei Liu
Nai Ding

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Recent advances in cognitive modeling have demonstrated the potential of large language models (LLMs) to unify diverse aspects of human cognition. The Centaur model, an LLM fine-tuned on cognitive tasks, achieves high performance across 160 psychological experiments, suggesting that a single model may capture multiple cognitive processes. However, whether this success stems from genuine task understanding or exploitation of superficial statistical cues remains unclear. To test this, we systematically manipulated Centaur’s input by (1) removing task instructions, (2) removing all contextual information, and (3) providing misleading instructions. All three manipulations remove information necessary for humans to perform the tasks. Results show that Centaur often maintains high performance under these manipulations, outperforming both baseline cognitive models and the unfine-tuned LLM (Llama) that receives correct instructions. These findings indicate that Centaur’s success likely relies on superficial statistical cues rather than true instruction comprehension. Our study highlights the need for more diverse out-of-distribution tests for LLM-based cognitive models.

Version published to 10.31234/osf.io/zfhv9_v1 on OSF Preprints
Aug 12, 2025

Centaur May Have Learned a Shortcut that Explains Away Psychological Tasks

This article has 2 authors:
1. Hanbo Xie
2. Jian-Qiao Zhu
This article has no evaluationsLatest version Sep 9, 2025
Rethinking Think-Aloud in the Age of Language Models

This article has 3 authors:
1. Hanbo Xie
2. Hua-Dong Xiong
3. Robert C Wilson
This article has no evaluationsLatest version Sep 10, 2025
Rethinking Think-Aloud in the Age of Language Models

This article has 3 authors:
1. Hanbo Xie
2. Hua-Dong Xiong
3. Robert C Wilson
This article has no evaluationsLatest version Sep 10, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Centaur May Have Learned a Shortcut that Explains Away Psychological Tasks

Rethinking Think-Aloud in the Age of Language Models

Rethinking Think-Aloud in the Age of Language Models