Lightweight YOLOv8-Obb Optimization with Hybrid Attention and Dynamic Feature Reconstruction for Remote Sensing Object Detection

Xilai Ju
Yongjian Li
Genghua Huang
Zhen Yan
Xiyuan Wu
Song Ji

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

To address the challenges of multi-scale object recognition and complex background in-terference in remote sensing images, this paper proposes a lightweight improved algo-rithm based on YOLOv8-obb. By integrating a hybrid local-channel attention mechanism (MLCA), dynamic upsampling (DySample), and a reparameterized cross-scale feature aggregation module (RepNCSPELAN), the algorithm achieves dual enhancements in de-tection accuracy and computational efficiency. The MLCA module enhances feature dis-criminability in complex backgrounds through a dual-path mechanism combining local and global pooling. The DySample module employs content-aware sampling point gen-eration to improve feature reconstruction for multi-scale targets. The RepNCSPELAN module reduces model parameters by 29% while preserving cross-scale feature fusion ca-pabilities. Experimental results on the DOTA dataset demonstrate a 2.3% improvement in mAP50, with parameters reduced to 2.2M and FLOPs decreased by 27%. Cross-dataset validation on DIOR further confirms a 1.5% mAP50 gain. Compared to mainstream lightweight models (e.g., YOLOv5n, YOLOv8n), the proposed algorithm exhibits superior performance in accuracy (65.3% vs. 60.8%), parameter efficiency (2.2M vs. 3.1M), and computational cost (6.2G vs. 8.5G FLOPs). Ablation studies validate the efficacy of each module, while visualizations highlight robustness in dense small-object detection and ro-tated target localization. This work provides an efficient solution for real-time remote sensing object detection in complex scenarios and offers a novel technical pathway for lightweight deep learning model design.

Version published to 10.20944/preprints202504.1716.v1
Apr 21, 2025

ECBAM-CVT-SOD: An Enhanced YOLOv8 Architecture with Multimodal Attentional Fusion for Innovative Low-altitude Remote Sensing in Small Object Detection

This article has 3 authors:
1. Tieshan Zhang
2. Zhong Ren
3. Shaoyuan Xi
This article has no evaluationsLatest version Apr 21, 2025
SEMA-YOLO: Lightweight Small Object Detection in Remote Sensing Image via Shallow-layer Enhancement and Multi-scale Adaptation

This article has 5 authors:
1. Zhenchuan Wu
2. Hang Zhen
3. Xiaoxinxi Zhang
4. Xuechen Bai
5. Xinghua Li
This article has no evaluationsLatest version Apr 14, 2025
MambaRetinaNet: A Multi-Scale Convolution and Mamba Fusion-Based Remote Sensing Object Detection Model

This article has 8 authors:
1. Junjie Chen
2. Jieru Wei
3. Gang Wu
4. Jichang Yang
5. Jiandong Shang
6. Hengliang Guo
7. Dujuan Zhang
8. Shengguang Zhu
This article has no evaluationsLatest version Mar 31, 2025

Listed in

Abstract

Article activity feed

Related articles

ECBAM-CVT-SOD: An Enhanced YOLOv8 Architecture with Multimodal Attentional Fusion for Innovative Low-altitude Remote Sensing in Small Object Detection

SEMA-YOLO: Lightweight Small Object Detection in Remote Sensing Image via Shallow-layer Enhancement and Multi-scale Adaptation

MambaRetinaNet: A Multi-Scale Convolution and Mamba Fusion-Based Remote Sensing Object Detection Model