Worth the Weight: Modern LLMs Demonstrate Accurate Metacognitive Knowledge of Decision Weights in Multi-Attribute Choice

Trent N. Cash
Daniel Oppenheimer

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Users have become increasingly reliant on Large Language Models (LLMs), like ChatGPT, to complete a wide range of reasoning tasks, from managing workplace projects to giving personal advice. However, LLMs function as black boxes, leaving users with minimal insight into how they generate the responses that they do. One way that users can attempt to peek into these black boxes is by asking LLMs to explain their reasoning processes. The nascent literature on LLM faithfulness suggests that LLMs – like humans – often fail to accurately identify the information they use to inform their reasoning, suggesting a lack of metacognitive knowledge. Across two studies, we extend this research to the context of multi-attribute choice, tasking both LLMs and human participants (n = 436) with completing the Knowledge of Weights paradigm. Participants first completed a series of choice tasks in which they picked between homes that varied on six attributes, then self-reported the decision weight they believed they placed on each attribute in two different formats. In Study 1, we found that ChatGPT4o self-reported weights that were significantly less reflective of their choice behavior than humans. In Study 2, we found that three more-advanced LLMs (ChatGPT5, Sonnet 4, and Gemini 2.5 Flash) self-reported weights that were as accurate or more accurate than those provided by humans. These results suggest that LLMs can generate and maintain accurate metacognitive knowledge of their own decision-making processes as well or better than humans, but that this is a relatively new ability. Practical and theoretical implications are discussed.

Version published to 10.31234/osf.io/zq6md_v1 on OSF Preprints
Nov 12, 2025

A trade-off between reasoning ability and metacognitive sensitivity in large language models

This article has 5 authors:
1. Ruixin Sha
2. Conghui Sun
3. Chunliang Yang
4. Liang Luo
5. Xiao Hu
This article has no evaluationsLatest version Jan 6, 2026
Using Large Language Models to Explore and Predict Human Choice from Verbal Description

This article has 1 author:
1. Eyal Marantz
This article has no evaluationsLatest version Dec 17, 2025
TrAIngles: an LLM-Based Automatic Scoring Tool for Theory of Mind Assessments Across the Life Span

This article has 9 authors:
1. Serena Maria Stagnitto
2. Daniele Gatti
3. Irene Ceccato
4. Fulvia Castelli
5. Gabriele Chierchia
6. Luca Rinaldi
7. Rory Thomas Devine
8. Elena Cavallini
9. Serena Lecce
This article has no evaluationsLatest version Jan 17, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A trade-off between reasoning ability and metacognitive sensitivity in large language models

Using Large Language Models to Explore and Predict Human Choice from Verbal Description

TrAIngles: an LLM-Based Automatic Scoring Tool for Theory of Mind Assessments Across the Life Span