Generalized Quantization of Faster R-CNN

Tamás Menyhárt
Róbert Lakatos

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The Faster Region-based Convolutional Network (Faster R-CNN) is an efficient object detection model. However, its large size and significant computational requirements limit its applicability in embedded systems and real-time environments. Quantization is a proven method for reducing models' size and computational requirements, but there is currently no open-source general implementation for quantizing Faster R-CNN. The main reason is that individual architecture components need to be quantized separately due to their structural characteristics. We present a general Faster R-CNN quantization algorithm, for which our implementation is open-source and compatible with the PyTorch ecosystem. Our solution reduces the model size by 67.2\% and the detection time by 50.4\% while maintaining the accuracy measured on the test data within an error margin of 8.2\% and a standard deviation of ± 3.4\%. It also allows for the visualization of model errors by extracting the model's internal activation maps, supporting a more efficient understanding of its behavior. We demonstrate that the proposed method can effectively quantize Faster R-CNN, enabling the model to run on low-power hardware. This is particularly important in applications such as autonomous vehicles, embedded sensor systems, and real-time security surveillance, where fast and energy-efficient object detection is crucial.

Version published to 10.20944/preprints202510.0354.v1
Oct 6, 2025

Revisiting Convolutional Design for Efficient CNNs: An Empirical Study on Embedded AI Platforms

This article has 1 author:
1. Onur Erdem Korkmaz
This article has no evaluationsLatest version Aug 25, 2025
Visual Localisation Using Deep Learning and Graph Neural Networks: Approaches and Evaluation

This article has 1 author:
1. Dinesh Kumar Koilada
This article has no evaluationsLatest version Aug 20, 2025
Accelerated Border Tracking in Binary Images with GPUs

This article has 4 authors:
1. Pedro Alonso-Jordá
2. Roberto Díaz-Cano
3. Enrique S. Quintana-Ortí
4. Francesc Folch
This article has no evaluationsLatest version Oct 10, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Revisiting Convolutional Design for Efficient CNNs: An Empirical Study on Embedded AI Platforms

Visual Localisation Using Deep Learning and Graph Neural Networks: Approaches and Evaluation

Accelerated Border Tracking in Binary Images with GPUs