TAF-YOLO: A Small-Object Detection Network for UAV Aerial Imagery via Visible and Infrared Adaptive Fusion

Zhanhong Zhuo
Ruitao Lu
Yongxiang Yao
Siyu Wang
Zhi Zheng
Jing Zhang
Xiaogang Yang

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Detecting small objects from UAV-captured aerial imagery is a critical yet challenging task, hindered by factors such as small object size, complex backgrounds, and subtle inter-class differences. Single-modal methods lack the robustness for all-weather operation, while existing multimodal solutions are often too computationally expensive for deployment on resource-constrained UAVs. To this end, we propose TAF-YOLO, a lightweight and efficient multimodal detection framework designed to balance accuracy and efficiency. First, an early fusion module, the Two-branch Adaptive Fusion Network (TAFNet), which adaptively integrates visible and infrared information at both pixel and channel levels before the feature extractor, maximizing complementary data while minimizing redundancy. Second, a Large Adaptive Selective Kernel (LASK) module that dynamically expands the receptive field using multi-scale convolutions and spatial attention, preserving crucial details of small objects during downsampling. Finally, an optimized feature neck architecture that replaces PANet's bidirectional path with a more efficient top-down pathway. This is enhanced by a Dual-Stream Attention Bridge (DSAB) that injects high-level semantics into low-level features, improving localization without significant computational overhead. On the VEDAI benchmark, TAF-YOLO achieves 67.2% mAP50, outperforming the CFT model by 2.7% and demonstrating superior performance against seven other YOLO variants. Our work presents a practical and powerful solution that enables real-time, all-weather object detection on resource-constrained UAVs.

Version published to 10.20944/preprints202511.0375.v1
Nov 6, 2025

Enhanced YOLOv11n for Small Object Detection in UAV Imagery: Higher Accuracy with Fewer Parameters

This article has 2 authors:
1. Hongkai zhu
2. Xianghua Xie
This article has no evaluationsLatest version Oct 15, 2025
LEA-DETR: A Lightweight and Efficient Attention-Enhanced Model for UAV Object Detection

This article has 3 authors:
1. Haohao Ma
2. Mingliang Zuo
3. Qiqi Ge
This article has no evaluationsLatest version Oct 29, 2025
EBAD-YOLO: Efficient Bidirectional Adaptive Dense Network for UAV Small Object Detection

This article has 4 authors:
1. Shuo Hu
2. Run Xing
3. Liyang Han
4. Tongtong Liu
This article has no evaluationsLatest version Sep 23, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Enhanced YOLOv11n for Small Object Detection in UAV Imagery: Higher Accuracy with Fewer Parameters

LEA-DETR: A Lightweight and Efficient Attention-Enhanced Model for UAV Object Detection

EBAD-YOLO: Efficient Bidirectional Adaptive Dense Network for UAV Small Object Detection