Discourse context and co-speech gestures jointly shape hierarchical prediction during the processing of a multimodal narrative

James Trujillo
Benjamin Straube
Yifei He

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Understanding one another in daily communication depends on predicting language from prior discourse context and visual signals, such as co-speech gestures. However, it remains unclear how discourse context and gestures jointly shape neural predictions during naturalistic language processing. Here, participants watched multimodal narratives with spontaneously-produced gestures during fMRI scanning. Leveraging transformer-based computational modeling, we disentangled linguistic uncertainty from contextual informativity at the sentence level, and observed that these deconstructed measures engaged neural regions associated with predictive processing across multiple multimodal representational levels. Further, greater gesture availability reduced the neural cost associated with these predictive processes and lessened reliance on discourse context, revealing a push-pull synergy between gestures and context. Our findings extend hierarchical predictive processing frameworks, demonstrating that gestures and discourse jointly, rather than additively, constrain neural predictions at multiple representational scales. These results demonstrate the critical and dynamically integrated role of multimodal predictive mechanisms in everyday communication.

Version published to 10.31234/osf.io/89qcn_v1 on OSF Preprints
Apr 3, 2025

Co-speech Pointing Gestures as the Visual Determiner in Early Word Learning

This article has 2 authors:
1. Xuancu Hong
2. Patrick Wong
This article has no evaluationsLatest version May 7, 2025
Context modulates brain state dynamics and behavioral responses during narrative comprehension

This article has 5 authors:
1. Yibei Chen
2. Zaid Zada
3. Samuel A. Nastase
4. F. Gregory Ashby
5. Satrajit S. Ghosh
This article has no evaluationsLatest version Apr 6, 2025
Large Language Models Reveal the Neural Tracking of Linguistic Context in Attended and Unattended Multi-Talker Speech

This article has 11 authors:
1. Corentin Puffay
2. Gavin Mischler
3. Vishal Choudhari
4. Jonas Vanthornhout
5. Stephan Bickel
6. Ashesh D. Mehta
7. Catherine Schevon
8. Guy M. McKhann
9. Hugo Van hamme
10. Tom Francart
11. Nima Mesgarani
This article has no evaluationsLatest version May 15, 2025

Listed in

Abstract

Article activity feed

Related articles

Co-speech Pointing Gestures as the Visual Determiner in Early Word Learning

Context modulates brain state dynamics and behavioral responses during narrative comprehension

Large Language Models Reveal the Neural Tracking of Linguistic Context in Attended and Unattended Multi-Talker Speech