Improving Speaker Identification in Reverberant Environments Using MFCCs and Comb Filtering with Neural Network Classification

Emad S. Hassan
Mohamad Monir
Idris H. Smaili
Adel S. El-Fishawy
Mohamed A. Zein Eldin
Ahmed S. Oshaba
Abdoh Jabbari
Ehab S. Ali
F. E. Abd El-Samie

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Reverberation presents a persistent challenge to the accuracy of speaker identification systems, especially in real-world acoustic settings. This paper proposes a robust and lightweight framework that enhances speaker recognition performance under reverberant conditions using a combination of comb filtering, Mel-Frequency Cepstral Coefficients (MFCCs), and a neural network classifier. The comb filter is applied as a preprocessing stage to suppress delayed reflections and reduce temporal smearing of the speech signal prior to feature extraction. Experimental evaluations were conducted across multiple reverberation levels (RT60 = 0.3 sec to 0.9 sec) and noise conditions (SNR from 30 dB to 0 dB). Results show that the proposed system outperforms baseline and transformation-based methods, achieving a recognition accuracy of 85.4% at RT60 = 0.9 sec compared to 70.2% for the unfiltered baseline, and up to 97.6% in low-reverberation scenarios. Additionally, the comb filter introduces a non-invertible transformation that enables cancelable biometric templates, reinforcing the system's security. The proposed method demonstrates a good balance between effectiveness, simplicity, and privacy, making it well-suited for real-time speaker identification applications in reverberant and noisy environments.

Version published to 10.21203/rs.3.rs-6976984/v1 on Research Square
Aug 7, 2025

A Noise-Robust End-to-End Framework for Amharic Speech Recognition

This article has 3 authors:
1. Yohannes Ayana Ejigu
2. Tesfa Tegegne Asfaw
3. Surafel Amsalu Tadesse
This article has no evaluationsLatest version Aug 4, 2025
Learning Emotional Nuances in Speech via DCNNs and Spectral Feature Integration

This article has 5 authors:
1. K. Venkatesh Sharma
2. Pramod Reddy
3. Rakesh Betala
4. Madhavi Pappula
5. Shirisha Reddy K
This article has no evaluationsLatest version Sep 2, 2025
The inner ear’s active process contributes to selective attention to speech in noise

This article has 2 authors:
1. Janna Steinebach
2. Tobias Reichenbach
This article has no evaluationsLatest version Aug 17, 2025

Listed in

Abstract

Article activity feed

Related articles

A Noise-Robust End-to-End Framework for Amharic Speech Recognition

Learning Emotional Nuances in Speech via DCNNs and Spectral Feature Integration

The inner ear’s active process contributes to selective attention to speech in noise