Performance Analysis of Speech Recognition Models in Automated Scoring of the QuickSIN Test

Arman Hassanpour
Yan Jiang
Paula Folkeard
Ewan Macpherson
Susan D. Scollie
Vijay Parsa

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Purpose

Best practices in audiology recommend assessing speech understanding in noisy environments, especially for those with communication difficulties. Speech-in-noise (SiN) assessments such as the QuickSIN are used for validating signal processing in hearing aids (HAs) and are linked to HA satisfaction. This project seeks to enhance QuickSIN test efficiency by applying recent advancements in automatic speech recognition (ASR) technologies.

Method

Twenty-three adults with sensorineural hearing loss were fitted bilaterally with Unitron Moxi HAs and were administered the QuickSIN test in low and high reverberation environments. Testing was performed with two different HA programs: an omnidirectional program and a fixed directional microphone program. QuickSIN sentences were presented from 0° azimuth and competing babble from either 0°, laterally from 90° or 270°, or simultaneously from 90°, 180°, and 270° azimuths. Participants’ verbal responses to QuickSIN stimuli were scored by an audiologist and were recorded in parallel for offline transcription and scoring by ASR models from Amazon, Microsoft, NVIDIA, and Picovoice. The ASR-derived QuickSIN scores were compared to the corresponding audiologist-derived scores.

Results

Repeated Measures ANOVA results revealed that all ASR models overestimated the QuickSIN scores across most test conditions. Bland-Altman analyses showed that the Amazon ASR model had the least bias and the narrowest range for the limits of agreement, in comparison to the manual scoring by an experienced audiologist.

Conclusions

Some ASR models, such as Amazon, demonstrated performance comparable to that of an audiologist in automatically scoring QuickSIN tests. However, further refinements are necessary to increase the robustness of the ASR models in scoring low SNR loss test conditions.

Version published to 10.1101/2025.07.25.25332211 on medRxiv
Jul 25, 2025

Performance and biases of the LENA® and ACLEW algorithms in analyzing language environments in Down, Fragile X, Angelman syndromes, and populations at elevated likelihood for autism

This article has 4 authors:
1. Marvin Lavechin
2. Lisa R. Hamrick
3. Bridgette Kelleher
4. Amanda Seidl
This article has no evaluationsLatest version Jul 30, 2025
The inner ear’s active process contributes to selective attention to speech in noise

This article has 2 authors:
1. Janna Steinebach
2. Tobias Reichenbach
This article has no evaluationsLatest version Aug 17, 2025
A brain-inspired algorithm enhances automatic speech recognition performance in multi-talker scenes

This article has 2 authors:
1. Alexander D. Boyd
2. Kamal Sen
This article has no evaluationsLatest version Jul 16, 2025

Listed in

Abstract

Purpose

Method

Results

Conclusions

Article activity feed

Related articles

Performance and biases of the LENA® and ACLEW algorithms in analyzing language environments in Down, Fragile X, Angelman syndromes, and populations at elevated likelihood for autism

The inner ear’s active process contributes to selective attention to speech in noise

A brain-inspired algorithm enhances automatic speech recognition performance in multi-talker scenes