Hardware-Efficient Neural Network Implementation: A Power-Accuracy Trade-off Analysis for Quantized Classification Neural Network

Amogh Anshu N

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper presents a comprehensive analysis of power-accuracy trade-offs in quantized neural network implementations for Application-Specific Integrated Circuit (ASIC) design. A three-layer feedforward neural network trained on the Wisconsin Breast Cancer dataset is implemented using a complete design flow from PyTorch model training to ASIC synthesis. The study evaluates 14-bit, 16-bit, and 18-bit uniform post-training quantization schemes and their impact on classification accuracy, power consumption, and area utilization. A lookup table (LUT) based sigmoid activation function is employed to reduce computational complexity in hardware implementation. The design is synthesized using Cadence Stratus High-Level Synthesis (HLS) tool targeting 500 MHz operation frequency on GPDK 45nm technology. Results demonstrate that 18-bit quantization achieves 95.6% accuracy with 2.44 mW power consumption and 183,963 GE (Gate Equivalent) area, representing an optimal balance between computational precision and hardware efficiency. The 16-bit implementation provides a reasonable compromise with 89.4% accuracy, 1.819 mW power, and 162,379 GE area, while the 14-bit version shows significant accuracy degradation to 64.9% despite lower power consumption of 1.924 mW.

Version published to 10.31224/5974
Dec 11, 2025

Chip-level voltage controlled magnetic p-bits array with variation compensation for stochastic neural network computation

This article has 16 authors:
1. Yi Cao
2. Jiachen Bao
3. Songsong Li
4. Ruizhi Ren
5. Zheng Zhu
6. Di Wu
7. Jiayong Wang
8. Minggao Zuo
9. Mingke Li
10. Duanzihua Cheng
11. Jianyu Lv
12. Lintong Zou Zou
13. Chuyue Liu
14. Wangyang Hu
15. Yuesheng Li Li
16. Guanghua Yu
This article has no evaluationsLatest version Jan 28, 2026
LoRPIA: Low-power Reconfigurable Pallet-Integrated Accelerator for Depthwise Separable Convolutions

This article has 2 authors:
1. Sajad Eydivandi
2. Hakem Beitollahi
This article has no evaluationsLatest version Jan 8, 2026
A Novel FEC Implementation for VSAT Terminals Using High-Level Synthesis

This article has 3 authors:
1. Najmeh Khosroshahi
2. Ron Mankarious
3. M. Reza Soleymani
This article has no evaluationsLatest version Jan 14, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Chip-level voltage controlled magnetic p-bits array with variation compensation for stochastic neural network computation

LoRPIA: Low-power Reconfigurable Pallet-Integrated Accelerator for Depthwise Separable Convolutions

A Novel FEC Implementation for VSAT Terminals Using High-Level Synthesis