Correlational estimates of language effects are biased and directionally unpredictable: Evidence from large-scale field experiments

Akshina Banerjee
Oleg Urminsky

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The principle that correlation does not imply causation is foundational to scientific reasoning, yet correlational analyses routinely inform conclusions, particularly in research on the effects of language. We quantify the divergence between correlational and causal estimates using two large-scale field experiment datasets: 7,797 experiments on Upworthy.com (45,674 headlines) and 153,787 experiments across 398 news outlets (416,009 headlines), in which linguistic features of headlines were experimentally manipulated and click-through rates measured. Across 50 language constructs, correlational and causal estimates diverged in the estimated direction of effect 20-50% of the time. Critically, the direction of bias was unpredictable: correlational models underestimated causal effects in one dataset and overestimated them in the other. Standard corrections, including platform fixed effects, failed to eliminate this bias. These findings demonstrate that correlational evidence in language research is not merely imprecise but can be systematically misleading, and that the direction of distortion cannot be anticipated without experimental ground truth.

Version published to 10.31234/osf.io/h8wvg_v1 on OSF Preprints
Mar 18, 2026

Using large language models as a source of human behavioral data in social science experiments

This article has 2 authors:
1. Austin van Loon
2. Klint Kanopka
This article has no evaluationsLatest version Apr 3, 2026
What Fifty Years of Research on the Misinformation Effect Can Tell Us: A Meta-Analytic Review

This article has 3 authors:
1. Björn Alfons Edmar
2. Nathalie Shekila Anna Enquist
3. Timothy John Luke
This article has no evaluationsLatest version Apr 8, 2026
LLMs in the Lab: Can AI Predict What Real Participants Do?

This article has 2 authors:
1. Gafari LUKUMON
2. Ebenezer ESENOGHO
This article has no evaluationsLatest version Apr 8, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Using large language models as a source of human behavioral data in social science experiments

What Fifty Years of Research on the Misinformation Effect Can Tell Us: A Meta-Analytic Review

LLMs in the Lab: Can AI Predict What Real Participants Do?