EfficientCrackFusion: An EfficientNet-ViT Hybrid with Cross-Attention Fusion for Concrete Crack Detection

Aditi Shah
Shveta Mahajan

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The essential process of detecting cracks in both concrete and pavement structures helps preserve the integrity of infrastructure systems, but the existing manual inspection techniques face challenges because they operate at slow speeds and introduce personal bias. Deep learning methods enable automatic feature extraction for crack detection through convolutional neural networks, but these networks face limitations because their receptive fields can only detect short-range patterns. Vision Transformers improve upon this limitation through their self-attention mechanism, but their independent operation requires extensive computational resources. The EfficientCrackFusion presents a novel hybrid architecture uses a Cross-Attention Fusion Gate and Squeeze-and-Excitation block to combine EfficientNet-B0 and ViT-Base sensors for efficient local and global feature representation. The researchers used perceptual hashing together with FAISS similarity grouping to create a data splitting method which maintains group awareness and prevents data leaks to achieve unbiased assessment. The testing on 40,000 images achieved 99.11\% accuracy with an F1-score of 0.9947, which surpassed all existing models while maintaining a compact design suitable for outdoor structural health assessments.

Version published to 10.21203/rs.3.rs-8935135/v1 on Research Square
Mar 18, 2026

SAC-YOLO: Efficient Multi-Scale Feature Fusion for Transmission Line Defect Detection

This article has 6 authors:
1. Haotian Yin
2. Fanghua Liu
3. Jiankang Yuan
4. Juntao Fan
5. Chaojie Xu
6. Ruibo Tan
This article has no evaluationsLatest version Apr 17, 2026
YOLO-MAFD: A Collaborative Detection Framework for Automated Recognition of Bridge Steel Structural Components

This article has 4 authors:
1. Guixiang Xue
2. Jiaxiang Li
3. Changhai Xu
4. Junfei Zhang
This article has no evaluationsLatest version Apr 6, 2026
Optimization of Multi-Scale Feature Extraction and Loss Functions in YOLOv8 for Insulator Defect Detection

This article has 6 authors:
1. Meng Su
2. Shuailun Geng
3. Hong Yu
4. Shuai Zhou
5. Lihua Zhou
6. Jiao Luo
This article has no evaluationsLatest version Apr 19, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

SAC-YOLO: Efficient Multi-Scale Feature Fusion for Transmission Line Defect Detection

YOLO-MAFD: A Collaborative Detection Framework for Automated Recognition of Bridge Steel Structural Components

Optimization of Multi-Scale Feature Extraction and Loss Functions in YOLOv8 for Insulator Defect Detection