Enhancing Environmental Sound Classification Performance through Data Fusion: A Comparative Machine Learning Analysis

ALPEREN KACAR
Derya AVCI
İbrahim TURKOĞLU

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Environmental Sound Classification (ESC) has become a fundamental component of intelligent acoustic systems, enabling applications such as smart cities, environmental monitoring, and public safety. This study proposes a comprehensive feature-level fusion framework for machine learning-based ESC. We extract complementary features from the UrbanSound8K dataset—time-domain attributes (Zero Crossing Rate and Root Mean Square) and frequency-domain descriptors (Mel-Frequency Cepstral Coefficients and Chroma) —which are then concatenated into an enriched representation space. To ensure robustness, multiple preprocessing configurations were evaluated across various window sizes, hop lengths, and sampling rates. Seven classifiers, including a Multi-Layer Perceptron (MLP), XGBoost, and Support Vector Machine (SVM), were systematically compared using both individual and fused feature sets. The results empirically demonstrate that feature-level fusion consistently enhances classification performance, achieving a maximum accuracy of 94.4% with the MLP model and significantly outperforming the baseline configurations that rely on individual features. These findings affirm that the integration of heterogeneous acoustic features at the feature level substantially improves the generalization and robustness of environmental sound recognition, offering a scalable pathway for real-world acoustic scene analysis and intelligent monitoring infrastructures. Our main contributions are summarized as follows: 1. A comprehensive feature-level fusion strategy is proposed, integrating both time-domain (ZCR, RMS) and frequency-domain (MFCC, Chroma) acoustic features to construct a robust and discriminative representation for environmental sound classification. 2. A comprehensive experimental setup is designed, enabling a detailed performance analysis across diverse acoustic pre-processing configurations by systematically varying window size, hop length, and sampling rate parameters. 3. A systematic evaluation of multiple machine learning classifiers—including SVM, K-NN, Decision Tree, Random Forest, Naive Bayes, XGBoost, and Multi-Layer Perceptron (MLP)—is conducted to assess the impact of feature fusion on classification performance. 4. Performance comparisons demonstrate that the fused feature set significantly outperforms individual feature inputs, achieving a peak classification accuracy of 94.4% with the MLP model, thereby validating the efficacy of the proposed fusion approach. 5. The results validate the suitability of the proposed system for real-world acoustic monitoring tasks, including smart city surveillance and urban environmental sound recognition.

Version published to 10.21203/rs.3.rs-7898377/v1 on Research Square
Nov 6, 2025

ETNeXt: Integrated feature engineering and classification framework for BLDC motor fault detection

This article has 4 authors:
1. Burak Çelik
2. Ezgi TASKIN
3. Mehmet OZDEMIR
4. Ayhan AKBAL
This article has no evaluationsLatest version Oct 3, 2025
Combined with wavelet time frequency analysis and lightweight deep convolutional networkintelligent recognition of coal rock acoustic emission signal

This article has 9 authors:
1. Weijian Liu
2. Zhongkai Peng
3. Shilei Zhen
4. Zhuangzhuang Wang
5. Xinbo Luan
6. Haoyang Li
7. Biqi Yuan
8. Jianbo Li
9. Shuai Teng
This article has no evaluationsLatest version Oct 28, 2025
Thermal Comfort Prediction Using Machine Learning: A Comparative Study of Algorithms and Features

This article has 4 authors:
1. Josephine Theresa S
2. Geethamani D
3. Bhuvaneshwari A
4. Sagayapriya A
This article has no evaluationsLatest version Oct 15, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

ETNeXt: Integrated feature engineering and classification framework for BLDC motor fault detection

Combined with wavelet time frequency analysis and lightweight deep convolutional networkintelligent recognition of coal rock acoustic emission signal

Thermal Comfort Prediction Using Machine Learning: A Comparative Study of Algorithms and Features