Less is More: Quantization of Deep Neural Networks

Brady Steele

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This project investigates the effects of weight quantization on deep neural network performance, focusing on image datasets. Weight quantization reduces the precision of model weights which leads to reduction in model size and computational requirements, making them suitable for resource-constrained devices. The project explores the impact of these techniques on model accuracy, training efficiency, and generalization capabilities. The research is motivated by the need to develop efficient and effective deep learning models for image classification tasks that can be easily deployed in real-world applications.

Version published to 10.31224/4278
Jan 7, 2025

Task-Guided Quantization Strategies

This article has 1 author:
1. Igor Szoboszlai
This article has no evaluationsLatest version Dec 23, 2025
ZENITH: Automated Gradient Norm Informed Stochastic Optimization

This article has 1 author:
1. Dhrubo Saha
This article has no evaluationsLatest version Jan 22, 2026
<p class="MDPI12title"><a name="_Hlk215587133"></a>A Convolutional Autoencoder-Based Method for Vector Curve Data Compression

This article has 4 authors:
1. Shuo Zhang
2. Pengcheng Liu
3. Hongran Ma
4. Mingwu Guo
This article has no evaluationsLatest version Dec 24, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Task-Guided Quantization Strategies

ZENITH: Automated Gradient Norm Informed Stochastic Optimization

<p class="MDPI12title"><a name="_Hlk215587133"></a>A Convolutional Autoencoder-Based Method for Vector Curve Data Compression