A Comprehensive Comparative Analysis of Convolutional Neural Network Architectures for Image Classification and Object Detection Tasks

Fahim Al Islam
Saif Hossain
Monir Hosen

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper presents a comprehensive empirical evaluation of convolutional neural network (CNN) architectures across diverse computer vision tasks, encompassing multi-class image classification and bounding box object detection. We systematically compare five distinct model configurations: a custom- designed CNN architecture, VGG-16 with frozen pre-trained weights, VGG-16 with fine-tuned weights, ResNet-18 with frozen weights, and ResNet-18 with fine-tuned weights. Our experiments span five domain-specific datasets: agricultural imagery (Paddy and Mango classification), infrastructure assess- ment (Road and Footpath condition classification), and urban transportation (Rickshaw detection). We evaluate model performance using standard metrics including precision, recall, F1-score, and accuracy, while simultaneously analyzing computational efficiency through training time, GPU power consump- tion, memory utilization, and parameter counts. Our findings reveal that transfer learning with unfrozen weights consistently achieves superior classification performance, with VGG-16 demonstrating excep- tional results on the Mango dataset (F1=0.94) and Road dataset (F1=1.00). For object detection, ResNet-18 with unfrozen weights exhibits the highest precision-recall balance (F1=0.77). We further ob- serve that frozen backbone strategies significantly reduce computational overhead but often at the cost of model accuracy, particularly for complex classification tasks. This study provides actionable insights for practitioners selecting CNN architectures under varying computational and accuracy constraints.

Version published to 10.21203/rs.3.rs-8749762/v1 on Research Square
Feb 3, 2026

A Controlled Multi-dataset Evaluation of Custom CNNs, Pretrained Feature Extractors, and Transfer Learning

This article has 2 authors:
1. Tanzeem Maliat
2. Asfin Jannat Shamsi
This article has no evaluationsLatest version Jan 9, 2026
A Comparative Study of Different CNN Architectures for Real-World Image Classification in Bangladesh

This article has 3 authors:
1. Md. Sadmin Tahmid Khan
2. Ahad Bin Islam Shoeb
3. Arif Billah
This article has no evaluationsLatest version Jan 12, 2026
Balancing Accuracy and Efficiency: A Comparative Study of CNN Models on Versatile Image Datasets

This article has 2 authors:
1. Imran Zahid
2. Naima Hasan
This article has no evaluationsLatest version Feb 2, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

A Controlled Multi-dataset Evaluation of Custom CNNs, Pretrained Feature Extractors, and Transfer Learning

A Comparative Study of Different CNN Architectures for Real-World Image Classification in Bangladesh

Balancing Accuracy and Efficiency: A Comparative Study of CNN Models on Versatile Image Datasets