Enhancing Soundscape Characterization and Pattern Analysis Using Low-Dimensional Deep Embeddings on a Large-Scale Dataset

Daniel Alexis Nieto-Mora
Leonardo Duque-Muñoz
Juan David Martínez-Vargas

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Soundscape monitoring has become an increasingly important tool for studying ecological processes and supporting habitat conservation. While many recent advances focus on identifying species through supervised learning, there is growing interest in understanding the soundscape as a whole considering patterns that go beyond individual vocalizations. This broader view requires unsupervised approaches capable of capturing meaningful structures related to temporal dynamics, frequency content, spatial distribution, and ecological variability. In this study, we present a fully unsupervised framework for analyzing large-scale soundscape data using deep learning. We applied a convolutional autoencoder (Soundscape-Net) to extract acoustic representations from over 60,000 recordings collected across a grid-based sampling design in the Rey Zamuro Reserve, Colombia. Dimensionality reduction methods (UMAP and PaCMAP) were used to project the learned features, followed by clustering with KMeans and DBSCAN to explore latent acoustic structures. To interpret and validate the resulting clusters, we combined multiple strategies: spatial mapping through interpolation, analysis of acoustic index variance to understand cluster structure, and graph-based connectivity analysis to identify ecological relationships between recording sites. Our results demonstrate that this approach can uncover both local and broad-scale patterns in the soundscape, providing a flexible and interpretable pathway for unsupervised ecological monitoring.

Version published to 10.20944/preprints202508.0808.v1
Aug 12, 2025

Stratified Active Learning for Spatiotemporal Generalisation in Large-Scale Bioacoustic Monitoring

This article has 3 authors:
1. Ben McEwen
2. Corentin Bernard
3. Dan Stowell
This article has no evaluationsLatest version Sep 5, 2025
Detection, communication, and individual identification with deep audio embeddings: A case study with North Atlantic right whales

This article has 5 authors:
1. Irina Tolkova
2. Holger Klinck
3. Dana A. Cusano
4. Anke Kügler
5. Susan E. Parks
This article has no evaluationsLatest version Jul 16, 2025
DualStack: Multi-Resolution Spectrogram Fusion Improves Bird Sound Classification for Ecological Monitoring

This article has 3 authors:
1. Chitrang Patel
2. Tanishka Gupta
3. Sapan Mankad
This article has no evaluationsLatest version Aug 11, 2025

Listed in

Abstract

Article activity feed

Related articles

Stratified Active Learning for Spatiotemporal Generalisation in Large-Scale Bioacoustic Monitoring

Detection, communication, and individual identification with deep audio embeddings: A case study with North Atlantic right whales

DualStack: Multi-Resolution Spectrogram Fusion Improves Bird Sound Classification for Ecological Monitoring