Machine Theory of Mind and How Large Language Models Mimic Human Mind Perception but Mask Representational Divergence

DIDIER Grimaldi
Carlos Carrasco-Farré

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

As LLMs increasingly mediate decisions, recommendations, and persuasive communication, a central question is whether their mind-attribution structure in output space is human-like or only behaviorally similar. This creates a behavior–geometry paradox: systems can produce human-like responses while relying on a latent structure that differs from human social cognition. We address this gap with a large-scale repeated-elicitation design spanning 16 agents, 65 mental capacities, and 178 replications (185,120 judgments), followed by dimensional analyses and benchmark comparisons with established human mind-perception frameworks. We report three findings. First, model judgments recover a recognizable human-like backbone of mind ascription. Second, this backbone is systematically compressed, with weaker separation among capacities and reduced structural contrast. Third, key dimensions are reconfigured under broader probes, including shifts in how affective, moral-mental, and reality-interaction capacities are partitioned. Together, these results show that behavioral alignment can coexist with representational divergence. The findings support extending safety and alignment evaluation beyond response-level performance to include audits of output-inferred representational geometry, with implications for governance, deployment in persuasive contexts, and theory-building at the intersection of AI and social cognition.

Version published to 10.21203/rs.3.rs-9161370/v1 on Research Square
Mar 25, 2026

Hebbian inertia and massless reasoning: comparative cognitive architecture in human and large language model systems

This article has 2 authors:
1. Emary Iacobucci
2. Joseph Woelfel
This article has no evaluationsLatest version Apr 24, 2026
The Psychology of Reification: Perceiving Social Reality as Mind-Independent

This article has 4 authors:
1. Maor Levitin
2. Oded Hirsh
3. Inon Raz
4. Michael Gilead
This article has no evaluationsLatest version Apr 9, 2026
Multimodal large language models converge on the human-like geometry of abstract emotion

This article has 7 authors:
1. Huiguang He
2. Changde Du
3. Yizhuo Lu
4. Zhongyu Huang
5. Yi Sun
6. Zisen Zhou
7. Shaozheng Qin
This article has no evaluationsLatest version Apr 2, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Hebbian inertia and massless reasoning: comparative cognitive architecture in human and large language model systems

The Psychology of Reification: Perceiving Social Reality as Mind-Independent

Multimodal large language models converge on the human-like geometry of abstract emotion