Features underlying speech versus music as categories of auditory experience

Lauren Fink
Madita Hörster
David Poeppel
Melanie Wald-Fuhrmann
Pauline Larrouy-Maestri

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Classifying whether an auditory signal is music or speech is important for both humans and computer systems and critical for many applications. Though previous literature suggests that music and speech are easily separable categories, our approaches to studying the distinction have arguably biased findings in that direction. Prior studies often use material from different sound sources and provide response-label categories. Here, we use stimulus material from the dùndún drum–a speech surrogate that can signal either speech-related or musical content, rather unfamiliar to Western listeners. We first replicate standard speech-music categorization results (N=108). Then, we depart from the typical experimental procedure by asking new participants (N=180) to freely sort then label the stimulus material. Hierarchical clustering of participants’ stimulus groupings shows that the speech/music distinction emerges but is not primary. By reverse engineering the acoustic features and semantic labels underlying participants’ auditory categories, we provide an empirical argument for moderating the long-standing, but nevertheless disputable, perceptual salience of music and speech categories.

Version published to 10.31234/osf.io/2635u_v2 on OSF Preprints
Sep 22, 2025
Version published to 10.31234/osf.io/2635u on OSF Preprints
Jul 24, 2023

A review of musical timbre reveals common origin of music and mammalian vocalizations

This article has 4 authors:
1. Oliver Tab Bellmann
2. Marisa Hoeschele
3. Melina Witt
4. Rie Asano
This article has no evaluationsLatest version Sep 29, 2025
Salient features of task-irrelevant continuous speech distort subjective time

This article has 3 authors:
1. Ashley E Symons
2. Frederic Dick
3. Adam Tierney
This article has no evaluationsLatest version Oct 16, 2025
Embodied speech: sensorimotor contributions to native and non-native phoneme processing and learning

This article has 5 authors:
1. Tzuyi Tseng
2. Jennifer Krzonowski
3. Claudio Brozzoli
4. Alice C. Roy
5. Véronique Boulenger
This article has no evaluationsLatest version Oct 20, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A review of musical timbre reveals common origin of music and mammalian vocalizations

Salient features of task-irrelevant continuous speech distort subjective time

Embodied speech: sensorimotor contributions to native and non-native phoneme processing and learning