A Review of Deep Learning for Speech Recognition and Its Application in Advanced Hearing Assistance for the Elderly

Cheng Yao Song
Zhen Bin It
Jovan Bowen Heng

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The aging global population is experiencing a rising prevalence of age-related hearing loss. Conventional hearing aids often fail in noisy environments, leading to user dissatisfaction. Recent advancements in deep learning, particularly in automatic speech recognition (ASR) and smart filter, highlight the potential for a new generation of hearing aids. This paper explores the transformative potential of evolving deep learning architectures to address the critical limitation of noise suppression. We review the progression of models specifically for auditory processing: from Deep Neural Networks (DNNs) for basic noise reduction, to Convolutional Neural Networks (CNNs) which analyse spectral features in audio spectrograms. Recurrent Neural Networks (RNNs) and sequence-to-sequence (seq2seq) models and transformer models which is a improved version of the Seq2Seq that further improved the handling of temporal speech patterns. We conclude that integrating these sophisticated models into next-generation hearing aids is essential for dramatically improving speech intelligibility in complex settings. This technological evolution promises to enhance the quality of life for the aging population by reducing hearing effort and promoting social engagement.

Version published to 10.20944/preprints202510.1838.v1
Oct 23, 2025

Learning Emotional Nuances in Speech via DCNNs and Spectral Feature Integration

This article has 5 authors:
1. K. Venkatesh Sharma
2. Pramod Reddy
3. Rakesh Betala
4. Madhavi Pappula
5. Shirisha Reddy K
This article has no evaluationsLatest version Sep 2, 2025
CrySenseNet: A Deep Learning-Based Acoustic Intelligence System for Decoding Infant Cries

This article has 7 authors:
1. Krishna S
2. Anushka B R
3. Swetha Saju
4. Amrutha K V
5. Devika S Babu
6. Sishu Shankar Muni
7. Swetha P
This article has no evaluationsLatest version Sep 29, 2025
Automated Severity and Breathiness Assessment of Disordered Speech Using a Speech Foundation Model

This article has 3 authors:
1. Vahid Ashkanichenarlogh
2. Arman Hassanpour
3. Vijay Parsa
This article has no evaluationsLatest version Oct 6, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Learning Emotional Nuances in Speech via DCNNs and Spectral Feature Integration

CrySenseNet: A Deep Learning-Based Acoustic Intelligence System for Decoding Infant Cries

Automated Severity and Breathiness Assessment of Disordered Speech Using a Speech Foundation Model