Assessing the 3D position of a car with a single 2D camera using mainstream DCNN models

Youssef Yahia
Júlio Castro Lopes
Eduardo Bezerra
Pedro João Rodrigues
Rui Pedro Lopes

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Deep Convolutional Neural Networks (DCNNs) are regarded as one of the foundations of computer vision due to their unparalleled ability to process visual data. This work explores the use of DCNNs to estimate the orientation of vehicles from a single 2D image. The experiments included 48 training scenarios encompassing four dataset variations and 12 models, followed by an evaluation of each one based on four key metrics. Overall the best-performing architecture was EfficientNet-B2, achieving an accuracy of 97.22% consistently across all dataset variations, demonstrating its robustness to preprocessing techniques. Additionally, ResNet18 delivered competitive results, achieving the highest recorded accuracy of 98.61% on the original dataset, while MobileNetV2 also performed exceptionally well on the augmented and no-background datasets, reaching 98.61%. EfficientNet-B5 initially underperformed on the original dataset but significantly improved with augmentation, achieving 97.22% accuracy. The study revealed that dataset preprocessing played a crucial role in model performance, with augmentation and background removal significantly boosting accuracy. The classification mistakes were further analyzed using SHAP values, highlighting the importance of specific car features, such as the front and rear sections, in determining orientation. Overall, the results confirmed that vehicle orientation estimation can be effectively approached as a classification problem. The ResNet family proved highly robust, while EfficientNet-B2 emerged as a strong contender due to its consistency. MobileNetV2's efficiency and strong performance make it a viable option for real-time applications. Future work should explore transformer-based architectures and evaluate model performance on real-world datasets with varying environmental conditions.

Version published to 10.21203/rs.3.rs-5401929/v1 on Research Square
Apr 29, 2025

Toward Robust Human Pose Estimation under Real-World Image Degradations and Restoration Scenarios

This article has 4 authors:
1. Nada E. Elshami
2. Ahmad Salah
3. Amr Abdellatif
4. Heba Mohsen
This article has no evaluationsLatest version May 7, 2025
Fidelity assessment of synthetic images with multi-criteria combination under adverse weather conditions

This article has 3 authors:
1. Alexandra Duminil
2. Sio-Song Ieng
3. Dominique Gruyer
This article has no evaluationsLatest version Apr 24, 2025
3D-SDIS：Enhanced 3D Instance Segmentation through Spectral Fusion and Dual-Sphere Sampling

This article has 6 authors:
1. BingGe Cong
2. XiaoHong Wang
3. Xu Zhao
4. NingNing Zhang
5. TianShui Zhu
6. KaiHang Chen
This article has no evaluationsLatest version May 20, 2025

Listed in

Abstract

Article activity feed

Related articles

Toward Robust Human Pose Estimation under Real-World Image Degradations and Restoration Scenarios

Fidelity assessment of synthetic images with multi-criteria combination under adverse weather conditions

3D-SDIS：Enhanced 3D Instance Segmentation through Spectral Fusion and Dual-Sphere Sampling