An Improved RT-DETR Algorithm for Small-Object Detection in UAV Aerial Images
Discuss this preprint
Start a discussion What are Sciety discussions?Listed in
This article is not in any list yet, why not save it to one of your lists.Abstract
To address the challenges of UAV aerial imagery, including the prevalence of small objects, complex background interference, and difficulty in feature extraction that lead to high missed detection rates and compromise detection accuracy in existing RT-DETR algorithms, this paper proposes an improved small-object-oriented detector named MSFE-DETR (Multi-Scale Feature Enhancement DETR). A CMFE (CSP-MultiScale Feature Enhancement) module is integrated into the shallow backbone layers to enhance feature representation of small objects and alleviate feature loss caused by scale and background complexity. In deeper layers of backbone, the C2f module is employed to preserve fine-grained details and improve target–background discrimination, while multi-scale feature fusion further prevents small object information degradation. In addition, Deformable Attention (DAttention) is incorporated to adaptively focus on small target regions, retaining spatial positional information and suppressing background noise. The head integrates MPCA and FSA modules, where MPCA progressively fuses adjacent-scale features to complementarily enhance small object representations and suppress background interference, and FSA further improves detail enhancement and robustness. Moreover, an Inner-SIoU loss is proposed by combining Inner-IoU with SIoU, improving localization accuracy, convergence speed, and robustness in complex scenes. Experimental results on the VisDrone 2019 dataset show that MSFE-DETR outperforms RT-DETR-r18 by 2.0% in Precision, 2.1% in Recall, and 2.4% in mAP@0.5, with only a slight increase in computational cost.