Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games

Ji Ma

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

As Large Language Model (LLM)-based agents increasingly undertake real-world tasks and engage with human society, how well do we understand their behaviors? We (1) investigate how LLM agents' prosocial behaviors---a fundamental social norm---can be induced by different personas and benchmarked against human behaviors; and (2) introduce a behavioral and social science approach to evaluate LLM agents' decision-making. We explored how different personas and experimental framings affect these AI agents' altruistic behavior in dictator games and compared their behaviors within the same LLM family, across various families, and with human behaviors. The findings reveal substantial variations and inconsistencies among LLMs and notable differences compared to human behaviors. Merely assigning a human-like identity to LLMs does not produce human-like behaviors. Despite being trained on extensive human-generated data, these AI agents are unable to capture the internal processes of human decision-making. Their alignment with human is highly variable and dependent on specific model architectures and prompt formulations; even worse, such dependence does not follow a clear pattern. LLMs can be useful task-specific tools but are not yet intelligent human-like agents.

Version published to 10.31219/osf.io/arvhx on OSF Preprints
Oct 28, 2024

Using Large Language Models to Explore and Predict Human Choice from Verbal Description

This article has 1 author:
1. Eyal Marantz
This article has no evaluationsLatest version Dec 17, 2025
A Moral Turing Test to assess How Subjective Belief and Objective Source Affect Detection and Agreement with LLM Judgments

This article has 3 authors:
1. Basile Garcia
2. Crystal Qian
3. Stefano Palminteri
This article has no evaluationsLatest version Jan 15, 2026
Effects of personality steering on cooperative behavior in Large Language Model agents

This article has 4 authors:
1. Mizuki Sakai
2. Mizuki Yokoyama
3. Wakaba Tateishi
4. Genki Ichinose
This article has no evaluationsLatest version Jan 30, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Using Large Language Models to Explore and Predict Human Choice from Verbal Description

A Moral Turing Test to assess How Subjective Belief and Objective Source Affect Detection and Agreement with LLM Judgments

Effects of personality steering on cooperative behavior in Large Language Model agents