Vocal Call Locator Benchmark (VCL) for localizing rodent vocalizations from multi-channel audio

Ralph E Peterson
Aramis Tanelus
Christopher Ick
Bartul Mimica
Niegil Francis
Violet J Ivan
Aman Choudhri
Annegret L Falkner
Mala Murthy
David M Schneider
Dan H Sanes
Alex H Williams

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Understanding the behavioral and neural dynamics of social interactions is a goal of contemporary neuroscience. Many machine learning methods have emerged in recent years to make sense of complex video and neurophysiological data that result from these experiments. Less focus has been placed on understanding how animals process acoustic information, including social vocalizations. A critical step to bridge this gap is determining the senders and receivers of acoustic information in social interactions. While sound source localization (SSL) is a classic problem in signal processing, existing approaches are limited in their ability to localize animal-generated sounds in standard laboratory environments. Advances in deep learning methods for SSL are likely to help address these limitations, however there are currently no publicly available models, datasets, or benchmarks to systematically evaluate SSL algorithms in the domain of bioacoustics. Here, we present the VCL Benchmark: the first large-scale dataset for benchmarking SSL algorithms in rodents. We acquired synchronized video and multi-channel audio recordings of 767,295 sounds with annotated ground truth sources across 9 conditions. The dataset provides benchmarks which evaluate SSL performance on real data, simulated acoustic data, and a mixture of real and simulated data. We intend for this benchmark to facilitate knowledge transfer between the neuroscience and acoustic machine learning communities, which have had limited overlap.

Version published to 10.1101/2024.09.20.613758 on bioRxiv
Sep 21, 2024

Shared acoustic manifolds for exploratory comparison of passerine vocalizations

This article has 1 author:
1. Lucio Arese
This article has no evaluationsLatest version Jan 23, 2026
Ew! Yuck! Ugh! – Nonverbal Vocalisations of Pathogen and Moral Disgust

This article has 4 authors:
1. Roza Gizem Kamiloğlu
2. Christiaan Meijer
3. Disa Sauter
4. Joshua M. Tybur
This article has no evaluationsLatest version Jan 23, 2026
Ew! Yuck! Ugh! – Nonverbal Vocalisations of Pathogen and Moral Disgust

This article has 4 authors:
1. Roza Gizem Kamiloğlu
2. Christiaan Meijer
3. Disa Sauter
4. Joshua M. Tybur
This article has no evaluationsLatest version Jan 23, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Shared acoustic manifolds for exploratory comparison of passerine vocalizations

Ew! Yuck! Ugh! – Nonverbal Vocalisations of Pathogen and Moral Disgust

Ew! Yuck! Ugh! – Nonverbal Vocalisations of Pathogen and Moral Disgust