IR-WSANet: An Efficient Lightweight Network for Real-Time Infrared Small Target Detection in UAV Applications
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
Infrared Small Target Detection (IRSTD) holds significant application value in military operations, early warning and surveillance, aerospace, and other fields. However, traditional detection methods face challenges related to small target pixel sizes, strong background noise, and sparse infrared image features, resulting in insufficient accuracy and robustness. This paper proposes IR-WSANet, a lightweight network based on an improved YOLOv10, which enhances the detection performance of infrared small targets through a frequency-spatial joint optimization strategy. Firstly, discrete wavelet transform convolution (DWaveletConv) is introduced into the backbone network, and the fusion of high-frequency details and low-frequency semantics is enhanced by multi-band feature decomposition to suppress noise interference; Secondly, we designed a cooperative module (POS-SHSA) that integrates POSConvEmbedding with a partial channel single-head self-attention mechanism (SHSA), which combines local spatial features and global context information to improve the positioning accuracy of small targets. Experiments verify the effectiveness of the model on SIDD and HIT-UAV datasets: the mAP of IR-WSANet on SIDD-City, SIDD-Mountain and HIT-UAV datasets reaches 97.2%, 82.6% and 82.8%, respectively, which is 2.8% to 14.1% higher than the baseline YOLOv10, and the highest F1 score was improved to 14.8%, while maintaining low computing cost (27.9 GFLOPs) and real-time performance (42.8 FPS). The results show that IR-WSANet significantly improves the detection performance of infrared small targets in complex scenes through the combination of frequency domain filtering enhancement and space-channel dual attention mechanism.