Perceptual learning of speaker-specific gesture-speech temporal alignment: Effects on word recognition

Chengjia Ye
James McQueen
Hans Rutger Bosker

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Perceptual learning helps listeners cope with the lack of invariance in speech perception. Exposure to a nontypical phoneme in disambiguating contexts can bias subsequent perception. Such learning is driven not only by lexical context but also by visual cues. One such visual cue is the beat gesture, a simple up-and-down hand movement that usually co-occurs with stressed syllables in everyday conversation. Beat gestures can influence stress perception and thus bias word recognition. However, gesture-speech temporal alignment varies substantially across speakers. The current study examined whether listeners can learn about speaker-specific gesture-speech alignment patterns from disambiguating auditory stress cues. Across three experiments, participants were exposed to a speaker consistently gesturing earlier than stressed syllables in one group, and later in the other. After exposure, both groups categorized minimal stress pairs (e.g., VOORnaam–voorNAAM) with ambiguous auditory stress cues and a beat midway between two target syllables. A reliable group difference emerged when the test stimuli were sufficiently ambiguous: word recognition was biased by the gesture-speech alignment learnt during exposure. However, this learning appeared not to generalize to novel words. Our results demonstrate that the learnt gesture-speech alignment can be applied to word recognition for the same speaker, optimizing face-to-face interaction.

Version published to 10.31234/osf.io/bauf6_v1 on OSF Preprints
Feb 23, 2026

What Makes Co-Speech Gestures Memorable?

This article has 3 authors:
1. Xiaohan (Hannah) Guo
2. Susan Goldin-Meadow
3. Wilma Bainbridge
This article has no evaluationsLatest version Jan 22, 2026
Speech Perception Consistency Facilitates Initial Lexical Activation, but Not Speech Perception Flexibility

This article has 3 authors:
1. Brian W. L. Wong
2. Arthur G. Samuel
3. Efthymia C Kapnoula
This article has no evaluationsLatest version Mar 12, 2026
Towards an Attentional Theory of Second-Language Speech Perception: Evidence from Cue Ecology

This article has 6 authors:
1. Annie C. Tremblay
2. Taehong Cho
3. Mirjam Broersma
4. Hyoju Kim
5. Quentin Zhen Qin
6. Jiayu Liang
This article has no evaluationsLatest version Feb 6, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

What Makes Co-Speech Gestures Memorable?

Speech Perception Consistency Facilitates Initial Lexical Activation, but Not Speech Perception Flexibility

Towards an Attentional Theory of Second-Language Speech Perception: Evidence from Cue Ecology