Urban Road Defect Detection: A Hybrid EfficientNetV2-B0 and CBAM Framework with Real-Time Computer Vision Optimization

Sarah Ezz
Nashaat M. Hussain Hassan
Ayman Mahmoud Othman
Ahmed Monier
Ahmed Ehab

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Road defect detection is essential to road maintenance and road safety, but current approaches barely achieve the desired accuracy and real-time processing. This work introduces a novel hybrid deep learning architecture that leverages EfficientNetV2-B0 together with Convolutional Block Attention Module (CBAM) in order to achieve high-precision, real-time multi-class road defect detection. The system leverages EfficientNetV2-B0's strong feature extraction and complements it with CBAM's attention mechanism for focusing on important defect regions to improve detection accuracy while maintaining computation efficiency. We tested the system on a well-chosen dataset that contains 1200 images for four classes of defects (cracks, potholes, patches, and surface defects), with better performance at 97% accuracy and 21ms inference per image using GPU hardware. Comparative experiments show our hybrid approach outperforms individual CNNs (EfficientNetV2-B0: 93.5%) and Vision Transformers (ViT-Tiny: 97.1% but 70ms latency) in speed-accuracy trade-offs. The high performance of the system is further augmented by the 6th October City- Giza- Egypt case study in which it precisely recognized and classified important pavement distresses in real urban environments, such as fine cracks (98% accuracy), hazardous potholes (96% recall), and complex surface defects (97% precision). The suggested system has a high degree of technical advantage for infrastructure monitoring applications, with real-time processing capabilities (21.5ms per image) and low computational overheads (1.42 billion FLOPs). This work encourages automated monitoring of infrastructure by providing a scalable, high-accuracy, and low-latency solution for road defect detection.

Version published to 10.21203/rs.3.rs-7150970/v1 on Research Square
Sep 1, 2025

RoadNet: A High-Precision Transformer-CNN Framework for Road Defect Detection via UAV-Based Visual Perception

This article has 4 authors:
1. Long Gou
2. Yadong Liang
3. Xingyu Zhang
4. Jianfeng Yang
This article has no evaluationsLatest version Oct 9, 2025
YOLO-Defect: A New Multi-Scale YOLO-Based Deep Neural Network with Feature Enhancement for Multiclass Bridge Surface Defect Detection

This article has 5 authors:
1. Haihao Tang
2. Xiaobo Zhang
3. Yutao Liu
4. Donghai Zhai
5. Yongle Li
This article has no evaluationsLatest version Oct 6, 2025
Pedestrian Detection in Aerial Image Based on Convolutional Neural Network with Attention Mechanism and Multi-scale Prediction

This article has 3 authors:
1. Jiaxi Yang
2. Jiaquan Shen
3. Shitong Wang
This article has no evaluationsLatest version Aug 27, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

RoadNet: A High-Precision Transformer-CNN Framework for Road Defect Detection via UAV-Based Visual Perception

YOLO-Defect: A New Multi-Scale YOLO-Based Deep Neural Network with Feature Enhancement for Multiclass Bridge Surface Defect Detection

Pedestrian Detection in Aerial Image Based on Convolutional Neural Network with Attention Mechanism and Multi-scale Prediction