DINOray: A Tightly Integrated De-occlusion Model for the Detection of Prohibited Items

Boyan Sun
Zhijian Wang
Yi Xiao Sun
Xiang Zhou
Wenyu Zhang
Haigang Zhang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

In today’s society, the increasing occurrence of terrorist actsand other criminal behaviors highlights the crucial necessity of securityscreening to safeguard areas from potential threats. However, most of theexisting mainstream target detection methods are designed for process-ing natural images, and do not take into account the special differencethat X-ray security images have perspective. The prohibited items withinthese security images exhibit size and shape variations under variousimaging angles, often rendering them uninformative to the models. Inaddition, the security images frequently display varying degrees of over-lapping and occlusion between targets and backgrounds, which makes theaccurate detection of prohibited items more difficult. To address thesechallenges in security screening tasks, this paper proposes a novel modelbased on DINO, which is referred to as DINOray. It integrates a Cross-Scale Tight Integration module and an Occlusion Removal module. Bymitigating noise and occlusion, DINOray extracts more salient informa-tion about prohibited items and tightly integrates it with the output ofthe encoder, which includes global features, in order to supplement lo-cal features. These enhanced features are then passed to the decoder toenhance the detection accuracy of the model. The experimental resultsshow that DINOray outperforms DINO by achieving 3.5% and 1.1% en-hancements in the detection accuracy of prohibited items on the PIDray and CLCXray open-source security screening datasets, respectively. Thisdemonstrates its superior performance in the security screening field.

Version published to 10.21203/rs.3.rs-4309020/v1 on Research Square
Apr 26, 2024

Enhancing Object Detection Algorithm for Size-Insensitive Performance

This article has 3 authors:
1. Hyoju Kim
2. Soomin Kwon
3. Jaeoh Kim
This article has no evaluationsLatest version Sep 22, 2025
Scene Text Detection Using Attention with Depthwise Separable Convolutions for Mobile Applications

This article has 2 authors:
1. Ramalakshmi Subbukalai
2. Vani Vijayan
This article has no evaluationsLatest version Sep 11, 2025
Spectral Pyramid Pooling and Fused Keypoint Generation in ResNet-50 for Robust 3D Object Detection

This article has 3 authors:
1. R. Ramana
2. V. Vasudevan
3. B. S. Murugan
This article has no evaluationsLatest version Aug 21, 2025

Listed in

Abstract

Article activity feed

Related articles

Enhancing Object Detection Algorithm for Size-Insensitive Performance

Scene Text Detection Using Attention with Depthwise Separable Convolutions for Mobile Applications

Spectral Pyramid Pooling and Fused Keypoint Generation in ResNet-50 for Robust 3D Object Detection