Six fallacies in substituting large language models for human participants

Zhicheng Lin

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Can AI systems like large language models (LLMs) replace human participants in behavioral and psychological research? Here I critically evaluate the “replacement” perspective and identify six interpretive fallacies that undermine its validity. These fallacies are: (1) equating token prediction with human intelligence, (2) treating LLMs as the average human, (3) interpreting alignment as explanation, (4) anthropomorphizing AI systems, (5) essentializing identities, and (6) substituting model data for human evidence. Each fallacy represents a potential misunderstanding about what LLMs are and what they can tell us about human cognition. The analysis distinguishes levels of similarity between LLMs and humans, particularly functional equivalence (outputs) versus mechanistic equivalence (processes), while highlighting both technical limitations (addressable through engineering) and conceptual limitations (arising from fundamental differences between statistical and biological intelligence). For each fallacy, specific safeguards are provided to guide responsible research practices. Ultimately, the analysis supports conceptualizing LLMs as pragmatic simulation tools—useful for role-play, rapid hypothesis testing, and computational modeling (provided their outputs are validated against human data)—rather than as replacements for human participants. This framework enables researchers to leverage language models productively while respecting the fundamental differences between machine intelligence and human thought.

Version published to 10.31234/osf.io/uqxcb_v2 on OSF Preprints
Aug 21, 2025
Version published to 10.1177/25152459251357566
Jul 1, 2025
Version published to 10.31234/osf.io/uqxcb_v1 on OSF Preprints
Jan 29, 2024

LLMs and Meaning: what the current semantic challenges in LLMs highlight about Natural Language

This article has 1 author:
1. Silvia Rondini
This article has no evaluationsLatest version Jan 22, 2026
Rethink Your Mental Model in the Age of Generative AI: A Triadic Framework for Human-AI Collaboration

This article has 2 authors:
1. Till Moritz Saßmannshausen
2. Sebastian Wagener
This article has no evaluationsLatest version Jan 8, 2026
Can LLMs Get High? A Dual-Metric Framework for Evaluating Psychedelic Simulation and Safety in Large Language Models

This article has 3 authors:
1. Ziv Ben-Zion
2. Guy Simon
3. Teddy Lazebnik
This article has no evaluationsLatest version Feb 2, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

LLMs and Meaning: what the current semantic challenges in LLMs highlight about Natural Language

Rethink Your Mental Model in the Age of Generative AI: A Triadic Framework for Human-AI Collaboration

Can LLMs Get High? A Dual-Metric Framework for Evaluating Psychedelic Simulation and Safety in Large Language Models