Robust spatial hearing beyond primary interaural cues in humans over deep neural networks

Antonino Greco
Sangyeob Baek
Clara Rastelli
Markus Siegel
Christoph Braun

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Spatial hearing allows humans to localize sound sources in the azimuth plane using interaural time (ITD) and level (ILD) differences, but the contribution of additional auditory features remains unclear. To investigate this, we measured human localization performance with natural and artificial stimuli that selectively included or excluded ITD and ILD as primary interaural cues. As expected, human listeners relied synergistically on ITD and ILD for accurate azimuth localization. Moreover, even when both primary cues were absent, localization performance remained above chance level. We compared human performance with state-of-the-art deep neural networks (DNN) optimized for sound localization to investigate possible computational mechanisms underlying this robust performance. In contrast to humans, DNNs demonstrated high accuracy only for stimuli that resembled their training regime but failed when primary interaural cues were absent. This human-DNN misalignment highlights a fundamental distinction in sensory processing strategies, potentially arising from the simplicity bias inherent in DNN training, with human reliance on a wider range of auditory features likely reflecting evolutionary pressures favoring adaptability across diverse acoustic environments. Together, our results demonstrate the robustness of human spatial hearing beyond primary interaural cues and point to promising directions for advancing artificial systems and informing clinical applications, such as cochlear implants and auditory prosthetics.

Version published to 10.1101/2025.08.05.668776 on bioRxiv
Aug 7, 2025

The inner ear’s active process contributes to selective attention to speech in noise

This article has 2 authors:
1. Janna Steinebach
2. Tobias Reichenbach
This article has no evaluationsLatest version Aug 17, 2025
Tonotopically distinct OFF responses arise in the mouse auditory midbrain following sideband suppression

This article has 3 authors:
1. Patrick D. Parker
2. Amanda M. Lauer
3. Dwight E. Bergles
This article has no evaluationsLatest version Jul 3, 2025
Cortical tracking of connected speech is interactively modulated by top-down predictions and bottom-up signal quality

This article has 2 authors:
1. James M. Webb
2. Ediz Sohoglu
This article has no evaluationsLatest version Jul 23, 2025

Listed in

Abstract

Article activity feed

Related articles

The inner ear’s active process contributes to selective attention to speech in noise

Tonotopically distinct OFF responses arise in the mouse auditory midbrain following sideband suppression

Cortical tracking of connected speech is interactively modulated by top-down predictions and bottom-up signal quality