Brain Representations of Natural Sound Statistics
Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Natural sound textures, such as rain or crackling fire, are defined by time-averaged summary statistics that shape their perceptual identity. Variations in these statistics provide a controlled means to examine how the human brain processes complex auditory structure. In this study, we investigated how such statistics are represented along the ascending auditory pathway, within auditory cortex, and in high-level regions involved in natural pattern analysis. Using fMRI, we measured brain responses to synthetic sound textures in which higher-order statistical structure was systematically degraded while preserving the overall texture category. Participants listened to sounds with varying levels of naturalness, defined by their statistical fidelity to real-world textures, and performed a perceptual comparison task, judging whether two sequentially presented sounds matched in naturalness. We observed that increasing naturalness elicited stronger BOLD responses across bilateral auditory cortex for both reference and test sounds. Activity in medial temporal lobe regions, including the entorhinal cortex and hippocampus, was modulated by naturalness in a position-dependent manner. The entorhinal cortex activity was modulated only during the test sound, suggesting a role in perceptual-mnemonic comparison. The hippocampal connectivity with auditory cortex increased when reference textures were more degraded or less natural, indicating top-down inference under uncertainty. Together, these findings highlight the interplay between bottom-up encoding and memory-based mechanisms in supporting judgments of auditory realism based on summary statistics.