A computational theory of learning moral weights

Glen William Spiteri
Maximilian Maier
Falk Lieder

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

What determines whose welfare people consider in their moral decisions? We propose that people learn entity-specific moral weights through reinforcement learning (RL), where decision outcomes provide the learning signal. We formalize this in a computational model in which agents update their moral weights for different stakeholders based on whether considering those stakeholders’ welfare led to better- or worse-than-expected outcomes. To test this model, we simulate agents learning in market environments (which reward cooperation with strangers) versus non-market environments (which reward exploitation). We show that this mechanism is sufficient to explain two empirical phenomena linking market integration to prosociality: (1) cross-cultural variation in dictator game offers across small-scale societies, and (2) within-cultural variation in lost-letter return rates across 188 Italian municipalities. Together, our simulation results suggest that updating moral weights via RL may be an important mechanism of moral change at both the individual and the societal level.

Version published to 10.31234/osf.io/rehta_v1 on OSF Preprints
Feb 12, 2026

Predicting the Machine: Intentionality Framing Reduces the Prediction Gap in Human–AI Cooperation

This article has 4 authors:
1. Laura K. Globig
2. Nadya Hanaveriesa
3. Sydney Levine
4. Jay Joseph Van Bavel
This article has no evaluationsLatest version Mar 11, 2026
Gambling for redemption: Consequential inconsistency in dynamic decision making

This article has 2 authors:
1. Jared M Hotaling
2. Luke Sztajnkrycer
This article has no evaluationsLatest version Apr 19, 2026
Now or later: A reinforcement learning model of behavioural delay

This article has 4 authors:
1. Sahiti Chebolu
2. Peiyuan Zhang
3. Wei Ji Ma
4. Peter Dayan
This article has no evaluationsLatest version Apr 19, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Predicting the Machine: Intentionality Framing Reduces the Prediction Gap in Human–AI Cooperation

Gambling for redemption: Consequential inconsistency in dynamic decision making

Now or later: A reinforcement learning model of behavioural delay