TOPIC: Impact of FP16 Quantization on MobileNetV3Large Performance for Mung Bean Defect Classification.

Leonel Gamvou Taklai
Lawrance Chege Ngugi
Cosmas Mutugi Kiruki

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In resource-constrained environments, deploying deep learning models for real-time image classification poses significant challenges due to limited computational power and memory. Existing solutions often rely on full-precision (FP32) models, which are computationally expensive and impractical for embedded systems. This study addresses the problem of efficient deployment of deep learning models by evaluating the impact of 16-bit Floating Point (FP16) quantization on the performance of MobileNetV3Large for mung bean seed defect classification. The proposed solution targets the limitations of current approaches, which offer high accuracy but at the cost of large model size and slow inference speeds. A dataset comprising 6,598 high-resolution images was constructed, with samples classified into five defect categories: broken, immature, infected, normal, and rotten. The baseline FP32 MobileNetV3Large model achieved a test accuracy of 94.85% with a model size of 16.2MB and an inference speed of 3.5 frames per second (FPS). After applying FP16 quantization, the model size was reduced to 8.27MB and inference speed increased to 8 FPS. This demonstrates a significant improvement in memory and speed efficiency. Although there was a minor accuracy drop to 93.86% (a reduction of 0.9%), the trade-off is acceptable for real-time applications on embedded platforms. These findings highlight the practical advantages of FP16 quantization for deploying lightweight yet accurate deep learning models in resource-constrained environments. The results support its viability for real-time agricultural applications such as automated seed sorting.

Version published to 10.22541/au.174803246.63251619/v1
May 23, 2025

Real-Time Deepfake Detection Using a Hybrid Mobile Net-LSTM Model For Image and Video Analysis

This article has 3 authors:
1. S Ambika
2. B R Sumanth
3. Yerramsetty Harini
This article has no evaluationsLatest version May 9, 2025
PUNet: A Lightweight Parallel U-Net Architecture Integrating Mamba-CNN for High-Precision Image Segmentation

This article has 5 authors:
1. Zhaoyan Xie
2. Xiaowei Li
3. Hongyao Ma
4. Sihao Wu
5. Dayou Cui
This article has no evaluationsLatest version May 29, 2025
Dynamic Fusion of Multi-Scale Perception and Adaptive Discrimination for Compressed GANs

This article has 2 authors:
1. Rui Li
2. ChangHao Ge
This article has no evaluationsLatest version May 8, 2025

Listed in

Abstract

Article activity feed

Related articles

Real-Time Deepfake Detection Using a Hybrid Mobile Net-LSTM Model For Image and Video Analysis

PUNet: A Lightweight Parallel U-Net Architecture Integrating Mamba-CNN for High-Precision Image Segmentation

Dynamic Fusion of Multi-Scale Perception and Adaptive Discrimination for Compressed GANs