Measurement and comparison of acoustic space use in vocalizations of humans and close primate relatives
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
The human larynx, compared to those of closely related primates, lies deeper in the throat and lacks vocal membranes and air sacs. These shifts are usually analyzed regarding their acoustic effects on vowel-like vocalizations, since the evolution of speech was long thought to require an expansion of vocal range driven by vocal tract modifications. However, vowels are just one type of phoneme, and speech is just one class of human utterance. To understand the evolutionary underpinnings of known shifts in human vocal morphology, a broader bioacoustic comparison is needed. Specifically, the range of sounds used in human speech must be compared to that employed in other human vocalizations and in the repertoires of extant close primate relatives. Here, we measure the acoustic-feature space occupied by human speech, non-linguistic, and musical vocalizations along with the calls of chimpanzees, bonobos, and chacma baboons. We use Mel-frequency cepstral coefficients to create an acoustic space depicting the spectro-temporal features of over 750,000 brief vocal segments sourced from published databases and other verified sources. Speech and song occupied significantly less volume in this acoustic space than human non-linguistic vocalizations. In addition, the acoustic-feature volumes of speech and song were not statistically distinct from those of non-human primates. These results suggest that speech was not enabled by an expansion of human vocal acoustic space. Anatomical shifts unique to humans may have led to an elaboration of non-linguistic utterances, but learned vocalizations use a surprisingly small fraction of this space. Our understanding of human vocal evolution will be further informed by additional systematic comparisons of the function and homology of non-speech vocalizations, along with the collection and incorporation of more complete non-human primate vocal datasets, especially from Gorilla and Orangutan.