Synergistic Entropy Coding and Quantization Enable Efficient on Device Neural Networks

Abulfadhel Amer Saihood Altufaili
Dunya Mohammed Shleej

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deploying deep neural networks (DNNs) on edge devices poses significant challenges due to constrained memory, compute, and energy resources. Conventional model compression pipelines—comprising separate stages of pruning, quantization, and entropy coding—often fail to deliver optimal trade-offs between efficiency and accuracy. In this work, we propose Synergistic Entropy–Quantization (SyE-C²Q), a unified compression framework that jointly optimizes quantization precision and entropy coding based on the statistical structure of model parameters. By aligning quantization levels with symbol probabilities and adapting quantization step sizes to entropy estimates, SyE-C²Q reduces the bit-rate while maintaining high inference accuracy.Extensive experiments on MobileNet-V2 and ResNet-18 with CIFAR-10 and ImageNet demonstrate that SyE-C²Q achieves up to 3.6× model compression, < 1% accuracy degradation, and ~ 40% energy savings compared to conventional post-training quantization techniques. Furthermore, the compressed models exhibit improved inference latency and memory utilization on ARM-based edge hardware. Unlike traditional pipelines, SyE-C²Q integrates entropy-guided quantization directly into the compression loop, establishing a new benchmark in the design of resource-efficient, deployable deep learning systems.

Version published to 10.21203/rs.3.rs-8018121/v1 on Research Square
Dec 4, 2025

Neural-Model-Augmented Hybrid NMS-OSD Decoders for Near-ML in Short Block Codes

This article has 2 authors:
1. Guangwen Li
2. Xiao Yu
This article has no evaluationsLatest version Jan 27, 2026
<p class="MDPI12title"><a name="_Hlk215587133"></a>A Convolutional Autoencoder-Based Method for Vector Curve Data Compression

This article has 4 authors:
1. Shuo Zhang
2. Pengcheng Liu
3. Hongran Ma
4. Mingwu Guo
This article has no evaluationsLatest version Dec 24, 2025
Task-Guided Quantization Strategies

This article has 1 author:
1. Igor Szoboszlai
This article has no evaluationsLatest version Dec 23, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Neural-Model-Augmented Hybrid NMS-OSD Decoders for Near-ML in Short Block Codes

<p class="MDPI12title"><a name="_Hlk215587133"></a>A Convolutional Autoencoder-Based Method for Vector Curve Data Compression

Task-Guided Quantization Strategies