AudioSet-Tools: A Python Framework for Taxonomy-Aware AudioSet Curation and Reproducible Audio Research

Stefano Giacomelli
Marco Giordano
Claudia Rinaldi
Fabio Graziosi

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This work presents AudioSet-Tools, a modular and composable Python framework designed to streamline the creation of task-specific datasets derived from Google’s AudioSet. Despite its extensive coverage, AudioSet suffers from weak labeling, class imbalance, and a loosely structured taxonomy, which limit its practical applicability in machine listening workflows. AudioSet-Tools addresses these issues through configurable taxonomy-aware label filtering and class re-balancing strategies. The framework includes automated routines for data download and transformation, enabling reproducible and semantically consistent dataset generation for both downstream fine-tuning and pre-training of machine/deep learning models. While domain-agnostic, we showcase its versatility through AudioSet-EV, a curated subset focused on emergency vehicle siren recognition — a socially relevant and technically challenging use case that exemplifies the structural and semantic gaps in AudioSet taxonomy. We further provide an extensive comparative benchmark of AudioSet-EV against state-of-the-art emergency vehicle corpora, with source code and datasets openly released on GitHub and Zenodo, to foster transparency and reproducibility in real-world audio signal processing research.

Version published to 10.21203/rs.3.rs-6957428/v1 on Research Square
Jun 24, 2025

Teacher-Student Framework for Short-Context Classification with Domain Adaptation and Data Augmentation

This article has 6 authors:
1. Fu Lei
2. Haoran Zheng
3. Beichen Liu
4. Zhejun Zhao
5. Lipeng Liu
6. Xuan Li
This article has no evaluationsLatest version May 30, 2025
Interpretability-Guided Adaptation for Robust DGA Detection with Large Language Models

This article has 3 authors:
1. Reynier Leyva La O
2. Carlos A. Catania
3. Tatiana S. Parlanti
This article has no evaluationsLatest version Jun 13, 2025
Semantic Encoding in Medical LLMs for Vocabulary Standardisation

This article has 3 authors:
1. Samuel Mainwood
2. Aashish Bhandari
3. Sonika Tyagi
This article has no evaluationsLatest version Jun 17, 2025

Listed in

Abstract

Article activity feed

Related articles

Teacher-Student Framework for Short-Context Classification with Domain Adaptation and Data Augmentation

Interpretability-Guided Adaptation for Robust DGA Detection with Large Language Models

Semantic Encoding in Medical LLMs for Vocabulary Standardisation