DCUSV: Deep Clustering of Ultrasonic Vocalizations in Rodents

Sabah Shahnoor Anis
Devin M. Kellis
Kris Ford Kaigler
Marlene A. Wilson
Christian O’Reilly

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Analyzing ultrasonic vocalizations (USVs) is critical for understanding rodents’ emotional states and social behaviors. This work presents Deep Clustering of USVs (DCUSV), an automated deep clustering pipeline for analyzing preprocessed USV contours that addresses key challenges in effectively clustering USVs and revealing distinct patterns in rodent vocal behavior. DCUSV employs a dense autoencoder to compress high-dimensional spectrograms into a latent space suitable for clustering, followed by a combination of Uniform Manifold Approximation and Projection (UMAP), Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN), and Agglomerative clustering, with hyperparameter optimization, to group USVs based on their spectro-temporal features. Clustering is evaluated using the Silhouette Coefficient, Calinski-Harabasz Index, and Davies-Bouldin Index. In addition, Principal Component Analysis (PCA), t-distributed Stochastic Neighbor Embedding (t-SNE), and UMAP are used to visualize and analyze the clustering results. Performance benchmarks revealed that DCUSV achieves 1.50× higher Silhouette score than K-Means and Deep Embedded Clustering (DEC), and 1.29× higher Silhouette score than HDBSCAN; 3.07×, 3.11×, and 29.15× higher Calinski-Harabasz scores than K-Means, DEC, and HDBSCAN, respectively; and 2.28×, 2.31×, and 1.92× lower Davies-Bouldin scores (lower is better) than K-Means, DEC, and HDBSCAN, respectively. Applying DCUSV revealed four distinct call-type families, closely aligning with manually defined categories without requiring manual grouping. Furthermore, DCUSV identified statistically significant shifts in call-type distributions across experimental conditions, demonstrating its ability to capture behaviorally meaningful changes in vocal expression. Thus, DCUSV enables robust analysis of USV structure and uncovers novel patterns in rodent vocal behavior.

Version published to 10.21203/rs.3.rs-9068431/v1 on Research Square
Mar 25, 2026

Low-dimensional neural dynamics underlying rhythmic vocal behavior in songbirds

This article has 6 authors:
1. Fiamma L. Leites
2. Cecilia T. Herbert
3. Santiago Boari
4. Felipe I. Cignoli
5. Gabriel B. Mindlin
6. Ana Amador
This article has no evaluationsLatest version Mar 31, 2026
Acoustic Features and Recognition of Distress Calls in <em>Rhinolophus ferrumequinum</em>: A Study Combining Machine Learning and Playback Experiments

This article has 5 authors:
1. Jingyan Hao
2. Jiaxi Li
3. Bingbing Wang
4. Meiyan Guo
5. Xiao Tan
This article has no evaluationsLatest version Mar 31, 2026
Selective approach behavior toward context-dependent ultrasonic vocalizations in male mice

This article has 5 authors:
1. Katsumasa Takahashi
2. Kazuma Hase
3. Takumi Miyajima
4. Jumpei Matsumoto
5. Tetsufumi Ito
This article has no evaluationsLatest version Apr 10, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Low-dimensional neural dynamics underlying rhythmic vocal behavior in songbirds

Acoustic Features and Recognition of Distress Calls in <em>Rhinolophus ferrumequinum</em>: A Study Combining Machine Learning and Playback Experiments

Selective approach behavior toward context-dependent ultrasonic vocalizations in male mice