Outraged AI: Large language models prioritise emotion over cost in fairness enforcement

Hao Liu
Yiqing Dai
Haotian Tan
Yu Lei
Yujia Zhou
Zhen Wu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Emotions guide human decisions, but whether large language models (LLMs) use emotion similarly remains unknown. We tested this using altruistic third-party punishment, where an observer incurs a personal cost to enforce fairness—a hallmark of human morality and often driven by negative emotion. In a large-scale comparison of 4,068 LLM agents with 1,159 adults across 796,100 decisions, LLMs used emotion to guide punishment, sometimes even more strongly than humans did: Unfairness elicited stronger negative emotion that led to more punishment; punishing unfairness produced more positive emotion than accepting; and critically, prompting self-reports of emotion causally increased punishment. However, mechanisms diverged: LLMs prioritised emotion over cost, enforcing norms in an almost all-or-none manner with reduced cost sensitivity, whereas humans balanced fairness and cost. Notably, reasoning models (o3-mini, DeepSeek-R1) were more cost-sensitive and closer to human behaviour than foundation models (GPT-3.5, DeepSeek-V3), yet remained heavily emotion-driven. These findings provide the first causal evidence of emotion-guided moral decisions in LLMs and reveal deficits in cost calibration and nuanced fairness judgements, reminiscent of early-stage human responses. We propose that LLMs progress along a trajectory paralleling human development; future models should integrate emotion with context‑sensitive reasoning to achieve human-like emotional intelligence.

Version published to 10.31234/osf.io/fnk57_v3 on OSF Preprints
Oct 19, 2025
Version published to 10.31234/osf.io/fnk57_v1 on OSF Preprints
Sep 11, 2025

Outraged AI: Large language models prioritise emotion over cost in fairness enforcement

This article has 6 authors:
1. Hao Liu
2. Yiqing Dai
3. Haotian Tan
4. Yu Lei
5. Yujia Zhou
6. Zhen Wu
This article has no evaluationsLatest version Oct 19, 2025
Training compact language models for artificial emotional intelligence: from bluffing to trust in a social deduction game

This article has 2 authors:
1. Christian Poglitsch
2. Johanna Pirker
This article has no evaluationsLatest version Sep 19, 2025
Inducing State Anxiety in LLM Agents Reproduces Human-Like Biases in Consumer Decision-Making

This article has 4 authors:
1. Ziv Ben-Zion
2. Zohar Elyoseph
3. Tobias Spiller
4. Teddy Lazebnik
This article has no evaluationsLatest version Sep 12, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Outraged AI: Large language models prioritise emotion over cost in fairness enforcement

Training compact language models for artificial emotional intelligence: from bluffing to trust in a social deduction game

Inducing State Anxiety in LLM Agents Reproduces Human-Like Biases in Consumer Decision-Making