Fusion Guard: A Multi-Scale Sequential Fusion Framework for Small Target Detection in Unmanned Aerial Vehicle Scenarios Using YOLO-World

Li-Juan Liu
Ke Lv
Yanhua Ma
Hong Tian

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

This paper presents a multi-scale sequence fusion framework, Fusion Guard, built upon YOLO-World to enhance small target detection from UAV scenarios. To overcome the limitations of traditional models in feature convergence and detail preservation, Fusion Guard incorporates three key modules: Triple Feature Encoding, Scale Sequence Feature Fusion, and Feature Sum Aggregator, which collectively strengthen feature extraction and integration. Moreover, Selective Boundary Aggregation utilizes a bidirectional dynamic fusion approach to enhance feature complementarity. In addition, the integration of the small object detection layer with high-resolution features significantly improves detection performance. To further improve target localization, the WIoU v3 loss function is incorporated into the model. The experimental results indicate that the model presented in this study achieves a 5.7% increase in mAP@0.5 and a 3.2% improvement in mAP@0.5:0.95 on the VisDrone2019 dataset. Additionally, precision and recall improve by about 5%. On the DOTA dataset, the model achieves notable performance enhancements 1 while reducing parameters by 0.5M and model size by 0.7MB, and it also supports customizable detection categories. In general, Fusion Guard demonstrates excellent performance and versatility in detecting small objects across various tasks.

Version published to 10.21203/rs.3.rs-7708917/v1 on Research Square
Oct 9, 2025

CSM-DETR: Construction Site Monitoring via Mamba-Enhanced Detection Transformer for UAV Aerial Imagery

This article has 1 author:
1. Long Zhang
This article has no evaluationsLatest version Jan 19, 2026
An Improved RT-DETR Algorithm for Small-Object Detection in UAV Aerial Images

This article has 4 authors:
1. Qiyu Long
2. Zhixun Liang
3. Peng Chen
4. Peng Tang
This article has no evaluationsLatest version Jan 6, 2026
MLRT-DETR: A Multi-Directional and Multi-Level Small Object Detection Algorithm from UAV Perspective

This article has 5 authors:
1. Yongxian Song
2. Qi Zhang
3. Yan Yan
4. Quan Chen
5. Zeyu Ma
This article has no evaluationsLatest version Dec 16, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

CSM-DETR: Construction Site Monitoring via Mamba-Enhanced Detection Transformer for UAV Aerial Imagery

An Improved RT-DETR Algorithm for Small-Object Detection in UAV Aerial Images

MLRT-DETR: A Multi-Directional and Multi-Level Small Object Detection Algorithm from UAV Perspective