Neural Construction of Temporal Hierarchies in Speech Processing.

Fan Bai
Antje Meyer
Noémie te Rietmolen
Andrea Martin

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Understanding spoken language requires the brain to transform continuous acoustic input into hierarchically organized structured units such as syllables, words, and phrases. Neural oscillations are known to align with these units, but it remains unclear whether this alignment predominantly reflects structured linguistic representations, statistical regularities, prosodic patterns, or a combination of these factors, let alone how they interact. Using magnetoencephalography (MEG), we systematically manipulated the availability of structural cues (e.g., prosodic, statistical and linguistic) in synthesized speech streams in both Dutch and Mandarin Chinese. We found that statistical regularities alone can elicit hierarchical neural tracking, but distinct cortical spatiotemporal dynamics emerged when additional types of cues (e.g., prosodic and linguistic) were present. Neural phase and power responses jointly reflected the type and strength of the cues, revealing how structured information sharpens the brain’s temporal alignment to speech, though phase and power dissociated in their relationship to structure and content. Based on these neural findings, we propose a theoretical model outlining how temporal hierarchies are constructed across time, frequency, and space. Bivariate analyses and encoding simulations further validated our model and clarified how different types of cues are represented and integrated over time. Together, our MEG and modeling results suggest that the brain engages a generalized mechanism for organizing perceptual units in speech into temporal hierarchies, but that cortical dynamics are sensitive to the type of information used, as reflected in coordinated changes in both phase and power across cortical regions depending on the cue.

Version published to 10.21203/rs.3.rs-7070586/v1 on Research Square
Jul 21, 2025

Evidence for hierarchical representations of written and spoken words from an open-science human neuroimaging dataset

This article has 8 authors:
1. Suneel Banerjee
2. Kimberly Jin
3. Plamen Nikolov
4. Philip Cho
5. Vishnu R. Pendri
6. Lillian Chang
7. Srikanth R. Damera
8. Maximilian Riesenhuber
This article has no evaluationsLatest version Sep 8, 2025
Sensory Compression as a Unifying Principle for Action Chunking and Time Coding in the Brain

This article has 6 authors:
1. Sreejan Kumar
2. Matthieu B. Le Cauchois
3. Alexander Mathis
4. Lea Duncker
5. Jonathon R. Howlett
6. Marcelo G. Mattar
This article has no evaluationsLatest version Sep 6, 2025
Cortical tracking of connected speech is interactively modulated by top-down predictions and bottom-up signal quality

This article has 2 authors:
1. James M. Webb
2. Ediz Sohoglu
This article has no evaluationsLatest version Jul 23, 2025

Listed in

Abstract

Article activity feed

Related articles

Evidence for hierarchical representations of written and spoken words from an open-science human neuroimaging dataset

Sensory Compression as a Unifying Principle for Action Chunking and Time Coding in the Brain

Cortical tracking of connected speech is interactively modulated by top-down predictions and bottom-up signal quality