Low-dimensional learned feature spaces quantify individual and group differences in vocal repertoires

Jack Goffinet
Samuel Brudner
Richard Mooney
John Pearson

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Increases in the scale and complexity of behavioral data pose an increasing challenge for data analysis. A common strategy involves replacing entire behaviors with small numbers of handpicked, domain-specific features, but this approach suffers from several crucial limitations. For example, handpicked features may miss important dimensions of variability, and correlations among them complicate statistical testing. Here, by contrast, we apply the variational autoencoder (VAE), an unsupervised learning method, to learn features directly from data and quantify the vocal behavior of two model species: the laboratory mouse and the zebra finch. The VAE converges on a parsimonious representation that outperforms handpicked features on a variety of common analysis tasks, enables the measurement of moment-by-moment vocal variability on the timescale of tens of milliseconds in the zebra finch, provides strong evidence that mouse ultrasonic vocalizations do not cluster as is commonly believed, and captures the similarity of tutor and pupil birdsong with qualitatively higher fidelity than previous approaches. In all, we demonstrate the utility of modern unsupervised learning approaches to the quantification of complex and high-dimensional vocal behavior.

Version published to 10.7554/elife.67855 on eLife
May 14, 2021
Version published to 10.1101/811661 on bioRxiv
Oct 21, 2019

Measurement and comparison of acoustic space use in vocalizations of humans and close primate relatives

This article has 3 authors:
1. Hans T. Bilger
2. Michael J. Ryan
3. Julia A. Clarke
This article has no evaluationsLatest version Jun 16, 2026
A detailed investigation of Shared Variance Component Analysis as a tool to characterize neural dimensionality

This article has 2 authors:
1. Alejandro Carballosa
2. Alessandro Torcini
This article has no evaluationsLatest version May 4, 2026
An annotated bioacoustic dataset of dog vocalizations and related sounds during dog-human social play

This article has 10 authors:
1. Laura V. Cuaya
2. Paula Pérez-Fraga
3. Raúl Hernández-Pérez
4. Louisa Pillwax
5. Ines Waldecker
6. Csenge Reisinger
7. Tamás Faragó
8. Sasha Winkler
9. Ludwig Huber
10. Claus Lamm
This article has no evaluationsLatest version Apr 23, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Measurement and comparison of acoustic space use in vocalizations of humans and close primate relatives

A detailed investigation of Shared Variance Component Analysis as a tool to characterize neural dimensionality

An annotated bioacoustic dataset of dog vocalizations and related sounds during dog-human social play