Narrative Context Shifts Gaze from Visual to Semantic Salience

Eva Berlot
Lea-Maria Schmitt
Christoph Huber-Huber
Marius V. Peelen
Floris P. de Lange

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Humans make over a hundred thousand eye movements daily to gather visual information. But what determines where we look? Current computational models typically link gaze behaviour to visual features of isolated images, but we know that eye movements are also strongly shaped by cognitive goals: Observers gather information that helps them to understand, rather than just represent, the world. Within this framework, observers should focus more on information that updates one’s understanding of the environment, and less on what is purely visually salient. Here we tested this hypothesis using a free-viewing paradigm of narratives where we experimentally manipulated the meaningfulness of temporal context by either presenting pictures in a coherent, i.e. correct, order, or in a temporally shuffled order. We developed a novel approach to quantify which visual information is semantically salient (i.e., important for understanding): we separately obtained language narratives for images in stories, and computed the contextual surprisal of visual objects using a large language model. The ability of this semantic saliency model in explaining gaze behaviour was compared to a state-of-the art model of visual saliency (DeepGaze-II). We found that individuals looked relatively more often and more quickly at semantically salient objects when images were presented in coherent compared to shuffled order. In contrast, visual salience did not better account for gaze behaviour in coherent than shuffled order. These findings highlight how internal contextual models guide visual sampling and demonstrate that language models could offer a powerful tool for capturing gaze behavior in richer, meaningful settings.

Version published to 10.1101/2025.10.24.684352 on bioRxiv
Oct 24, 2025

The meaning of salience and the salience of meaning: Object prioritization within naturalistic scenes

This article has 2 authors:
1. Antje Nuthmann
2. Franz Faul
This article has no evaluationsLatest version Jan 6, 2026
The Semantic Scaffold: Functional Dissociation of Visual and Language-derived Features Shapes Human Natural Scene Understanding

This article has 9 authors:
1. Yu Zhang
2. Yuxuan Tu
3. Zihan Yin
4. Jing Zhang
5. Weiyang Shi
6. Siyang Li
7. Jingguo Dai
8. Yongfu Hao
9. Tianzi Jiang
This article has no evaluationsLatest version Jan 12, 2026
The Role of Stimulus Characteristics in Visual Attention: An Eye-Tracking Study

This article has 2 authors:
1. Ela Berger
2. Michal Hochhauser
This article has no evaluationsLatest version Jan 13, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

The meaning of salience and the salience of meaning: Object prioritization within naturalistic scenes

The Semantic Scaffold: Functional Dissociation of Visual and Language-derived Features Shapes Human Natural Scene Understanding

The Role of Stimulus Characteristics in Visual Attention: An Eye-Tracking Study