Sound localization is biased by simultaneous, and delayed by preceding visual distractors

Read the full article See related articles

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.
Log in to save this article

Abstract

Vision can shape auditory perception, especially when visual cues occur at different times and locations than sounds. Simultaneous but spatially misaligned lights bias the perceived location of a sound—a phenomenon known as the ventriloquism effect. Temporally misaligned lights can also affect the latency of auditory responses. However, it remains unclear how multiple visual stimuli that differ from sounds in both space and time jointly influence localization behaviour. We investigated how visual distractors, spatially misaligned by 10°, presented before and/or during a target sound influence localization accuracy and response latency in a rapid head-pointing task. Human listeners localized brief (150 ms) broadband noise bursts with an average root-mean-square error of 5° and a baseline latency of 252 ms. Simultaneous visual cues induced the ventriloquism effect, in which the perceived sound location was biased by 1.8°. Response latency also increased by 21 ms (273 ms). Preceding visual stimuli (2 s duration) did not induce a bias, but increased latency by 55 ms (307 ms). Introducing a 200 ms gap between the preceding light and the sound reduced this latency increase to 24 ms (276 ms), still not inducing a significant bias. When we presented both a preceding and a simultaneous light on opposite sides of the sound, localization reflected the bias induced by the simultaneous light (1.8°) and the latency increase induced by the preceding light (by 48 ms). These findings reveal a dissociation in audiovisual integration: preceding visual stimuli primarily influence when a sound is responded to (latency), while simultaneous stimuli influence where it is perceived (accuracy). This supports causal inference models of multisensory integration and suggests distinct underlying mechanisms for spatial and temporal processing of sounds in sensorimotor circuits.

Article activity feed