Surprisal maps differently onto online measures of sentence processing

Alexandre Menezes Barroso

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Contextual predictability is a robust determinant of online sentence processing, but it is unclear whether predictability affects different online measures through a shared functional mapping. We address this question by comparing how surprisal relates to processing cost across multiple behavioral and electrophysiological measures in two complementary datasets: the Dutch RaCCooNS co-registered eye-tracking/EEG corpus and an English 205-sentence benchmark linking eye-movement, self-paced reading, and ERP data. Across both corpora, surprisal reliably predicted online processing cost. We then tested whether the same surprisal-to-cost mapping generalized across measures by estimating measure-specific power-law transformations under blocked cross-validation and confirmatory full-data models. The results did not support a single invariant linking function. Instead, the preferred mapping varied across measures within each corpus. In Dutch natural reading, first fixation was closest to ordinary surprisal, whereas later eye-movement measures and the fixation-related N400 favored steeper mappings. In the English benchmark, reading-time measures clustered in a compressed regime, the N400 was sublinear, and the P600 was superlinear. A targeted GAMM robustness analysis supported the main within-corpus dissociations. These findings argue against treating major online measures as interchangeable readouts of a single surprisal-linked cost signal. Surprisal remains central to sentence processing, but the mapping from surprisal to observable cost is itself part of the psycholinguistic theory that needs to be explained.

Version published to 10.31234/osf.io/pshqy_v1 on OSF Preprints
Apr 8, 2026

Beyond next-word prediction: hierarchical linguistic composition modulates LLM-brain alignment in time

This article has 2 authors:
1. Junyuan Zhao
2. Jonathan R. Brennan
This article has no evaluationsLatest version May 16, 2026
Word meaning, not surface statistics, is essential for predictive language processing

This article has 3 authors:
1. Andrey Zyryanov
2. Victoria Pierz
3. Yulia Oganian
This article has no evaluationsLatest version May 15, 2026
Temporal Dissociation of Syntactic Disambiguation and Memory Retrieval during Sentence Processing: Naturalistic MEG Evidence from Interpretable Models

This article has 5 authors:
1. Donald Dunagan
2. Dylan Scott Low
3. Shisen Yue
4. Lars Meyer
5. John T. Hale
This article has no evaluationsLatest version Apr 21, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Beyond next-word prediction: hierarchical linguistic composition modulates LLM-brain alignment in time

Word meaning, not surface statistics, is essential for predictive language processing

Temporal Dissociation of Syntactic Disambiguation and Memory Retrieval during Sentence Processing: Naturalistic MEG Evidence from Interpretable Models