Multiscale Feature Optimization for Accurate Small Object Detection in Remote Sensing Imagery

Bingxiang Wang
Mugen Zhou
Wenzhuo Ma
Tianyu Li
Changsheng Zhu

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Detecting small, overlapping objects in high-resolution remote sensing imagery is crucial for applications such as smart city monitoring and disaster response. However, challenges such as severe feature confusion and spatial misalignment hinder accurate localization. This paper introduces Multiscale SOG-DETR, a systematic redesign of the RT-DETR framework tailored for remote sensing small-object detection. We propose a lightweight Multiscale Overlapping-Object Decoupling Network (MOODNet) to significantly reduce feature entanglement in overlapping regions. Additionally, our specialized fusion neck, comprising the Residual Spatial-Alignment Progressive Fusion Module (SAPFM), E-CGAFusion, and WTConv2d modules, enhances multiscale semantic focus and preserves high-frequency details cost-effectively. Experimental results on the RSOD, VisDrone2019, and NWPU VHR-10 datasets demonstrate that Multiscale SOG-DETR achieves superior detection accuracy with significantly fewer parameters compared to the baseline RT-DETR model, increasing AP_ _IoU=50 by 3.1%, 3.0%, and 5.2%, and AP _IoU=50:95 by 5.1%, 2.1%, and 8.5%, respectively. These findings position Multiscale SOG-DETR as a promising solution for efficient and accurate small-object detection in remote sensing applications.The source code is publicly available at https://github.com/AaronWang-code/Multiscale-SOG-DETR.

Version published to 10.21203/rs.3.rs-9124242/v1 on Research Square
Apr 17, 2026

ESO-YOLO: Enhanced Small Object Detection Algorithm from Multiple Perspectives

This article has 4 authors:
1. Dong Wu
2. Wenhao Guan
3. Bingjie Zhang
4. Hao Chen
This article has no evaluationsLatest version Apr 13, 2026
ARF-YOLO: Attention-Guided Adaptive Resolution-Aware Feature Learning for UAV Remote Sensing Object Detection

This article has 1 author:
1. Long Zhang
This article has no evaluationsLatest version Apr 10, 2026
DINO-CD:Marrying DINOv3 with Dual-Branch Complementary Reconstruction for Remote Sensing Change Detection

This article has 7 authors:
1. Xiaopeng Cao
2. Bingwen Dou
3. Hailong Ning
4. Qi He
5. Yizhuo Dong
6. Sisi Wang
7. Anlei Wang
This article has no evaluationsLatest version Apr 13, 2026

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

ESO-YOLO: Enhanced Small Object Detection Algorithm from Multiple Perspectives

ARF-YOLO: Attention-Guided Adaptive Resolution-Aware Feature Learning for UAV Remote Sensing Object Detection

DINO-CD:Marrying DINOv3 with Dual-Branch Complementary Reconstruction for Remote Sensing Change Detection