What Are You Still Waiting For? Real-Time Lexical Access Requires Encapsulated Auditory Memory

Read the full article See related articles

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Speech perception entails the integration of multiple acoustic cues that unfold asynchronously. While conventional work posits that listeners immediately update higher-level interpretations as cues become available, emerging evidence suggests that processing of certain speech sounds (e.g., voiceless sibilant fricatives) may be buffered, in which categorical judgments are temporarily withheld until the arrival of subsequent cues (such as a following vocoid). This study examined the mechanisms underlying this buffered integration, testing whether its release is triggered by specific incoming information or by a fixed temporal threshold. Additionally, we examine potential sources of this buffering effect. Using the Visual World Paradigm (VWP), we tested this by manipulating frication duration for /s/ and /ʃ/ to canonical, doubled, and tripled lengths. Participants (n=56) completed a VWP task to determine when recognition of a fricative occurred, and a Visual Analog Scaling (VAS) task to assess categorization profiles and determine whether these profiles predict individual differences in buffering. The VWP results replicated prior findings: for canonical-length fricatives, listeners delayed processing until frication offset. For double-length fricatives, this buffering was extended, with listeners withholding decisions until the entire fricative duration had elapsed. When fricative lengths were tripled, partial commitment emerged during the frication period, yet final decisions remained contingent on later cues. These results suggest that buffering is largely governed by the availability of acoustic information over time, though listeners may integrate partial information at extreme durations. VAS results revealed that listeners with greater sensitivity to secondary cues showed faster recognition. Together, these findings underscore the flexible nature of speech perception, highlighting how listeners strategically modulate processing to optimize cue integration.

Article activity feed