The brain predicts visual speech units during naturalistic audiovisual speech listening

Giorgio Piazza
Francesco Vespignani
Simone Gastaldon
Francesca Peressotti

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Predictive processing is fundamental to language comprehension, yet research has primarily focused on auditory mechanisms, emphasizing how listeners anticipate upcoming phonemes and words based on the acoustic and linguistic structure of speech. However, speech is inherently multimodal and, here, we show that language prediction is likewise inherently multimodal. We recorded EEG while participants watched naturalistic continuous speech videos with the speaker’s mouth visible or covered to investigate whether the brain proactively predicts visual speech units (visemes) and how visual cues influence high-level linguistic prediction. Our results reveal that visemes are actively predicted beyond low level mouth movement, with neural signatures emerging before visual articulation. Additionally, access to visual speech enhances semantic prediction, as reflected in stronger N400 responses. These findings demonstrate that speech comprehension relies on a hierarchical multimodal predictive architecture, integrating visual and auditory linguistic information. This framework advances understanding of how the brain efficiently processes natural communication by dynamically incorporating visual cues to optimize linguistic expectations.

Version published to 10.31234/osf.io/gm4f8_v1 on OSF Preprints
Mar 3, 2026

Do You Feel the Beat, Baby? MEG Reveals Motor Brain Activity during Rhythm Processing in Infants and Its Links to Later Language Skills

This article has 2 authors:
1. Tzu-Han Zoe Cheng
2. T. Christina Zhao
This article has no evaluationsLatest version Mar 21, 2026
Eye Movements Reflect Listening Effort and Visualizable Speech

This article has 3 authors:
1. Geneva Mariotti
2. Bjorn Herrmann
3. Jennifer D. Ryan
This article has no evaluationsLatest version Mar 24, 2026
Speech Perception Consistency Facilitates Initial Lexical Activation, but Not Speech Perception Flexibility

This article has 3 authors:
1. Brian W. L. Wong
2. Arthur G. Samuel
3. Efthymia C Kapnoula
This article has no evaluationsLatest version Apr 6, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Do You Feel the Beat, Baby? MEG Reveals Motor Brain Activity during Rhythm Processing in Infants and Its Links to Later Language Skills

Eye Movements Reflect Listening Effort and Visualizable Speech

Speech Perception Consistency Facilitates Initial Lexical Activation, but Not Speech Perception Flexibility