Energy Measurement and Secure Low-Power Optimization for Speech Systems on Intelligent Terminals

David R Mitchell
Hannah L. Wong
Thomas J Evans
Sophia K Turner

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Speech algorithms on intelligent terminals face strict limits of power, memory, and heat, but most earlier work has focused on model size or FLOPs rather than real energy. This study measured the energy use of automatic speech recognition (ASR) and keyword spotting (KWS) on smartphones, embedded boards, and wearables. In total, 108–120 runs were carried out under controlled and office conditions. Methods included quantization, structured pruning, frame-rate subsampling, and runtime scheduling with dynamic voltage and frequency scaling (DVFS) and NPU offloading. The results showed that energy per inference dropped by 26–33% on average, with maximum savings near 40%. Accuracy stayed stable, with WER changes ≤0.2 and KWS F1 changes ≤0.3. A regression test confirmed that parameter count and feature size were the main factors linked to energy cost. The study suggests that direct energy measurement combined with model simplification and scheduling gives a practical way to build secure and low-power speech systems. The limits are the small number of devices, short test time, and no far-field or multilingual data. These gaps should be addressed in future work.

Version published to 10.20944/preprints202510.0757.v1
Oct 10, 2025

Energy Use of Speech Algorithms on Intelligent Terminals

This article has 3 authors:
1. Jonathan M Clarke
2. Mei Lin
3. Rachel A. Foster
This article has no evaluationsLatest version Oct 10, 2025
Energy Efficient Speech Algorithms for Intelligent Terminals with Pruning and Compression

This article has 5 authors:
1. Michael R Thompson
2. Eleanor J Smith
3. Daniel K. Brown
4. Charlotte L. Evans
5. Henry P. Wilson
This article has no evaluationsLatest version Oct 10, 2025
Mixed Signal Design Using C++ and DSP Acceleration for Low Latency and Secure Speech Systems

This article has 5 authors:
1. Ethan J. Turner
2. Grace L. Mitchell
3. Liam R. Scott
4. Chloe A. Wright
5. Isabella M. Parker
This article has no evaluationsLatest version Oct 9, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Energy Use of Speech Algorithms on Intelligent Terminals

Energy Efficient Speech Algorithms for Intelligent Terminals with Pruning and Compression

Mixed Signal Design Using C++ and DSP Acceleration for Low Latency and Secure Speech Systems