Enhancing Cross-Modality Person Re-Identification through Attention-Guided Asymmetric Feature Learning

Xuehua Song
Junxing Zhou
Hua Jin
Xin Yuan
Changda Wang

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

The main objective of the visible-infrared person re-identification (VI-ReID) task is to correlate images of pedestrians captured in daylight with their corresponding infrared (IR) representations. The main challenge lied in the cross-modal and intra-modal differences between VIS and IR images, which often result in reduced recognition accuracy. To address these changes, this paper uses a middle modality generator(MMG) that converts pedestrian images into middle-modality(M-modality) ones, transforming the dual-modality task into a tri-modality one. Commonly, VI-ReID utilizes a dual-branch network to extract features from two modalities. However, to enhance feature extracition across three modalities, we propose a four-branch parameter sharing network(FBPN) and explore its parameter sharing capabilities. Research conducted on datasets showed that the FBPN effectively minimizes modality disparities and mitigates background interference through a lightweight channel attention module. Furthermore, we introduce an Asymmetric Multi-Granularity Feature Learning (AMFL) module to further decrease modality diversity. The proposed method achieves a 70.60\% mean Average Precision in the All-Search mode on SYSU-MM01 and 86.7\% mean Average Precision in the VtI mode on RegDB, outperforming existing approaches.

Version published to 10.21203/rs.3.rs-5317293/v1 on Research Square
Oct 25, 2024

Multi-object Detector with Double Task Network for Traffic Scenes

This article has 3 authors:
1. Sheng-ye Wang
2. Zhong Qu
3. Jian Zheng
This article has no evaluationsLatest version Oct 28, 2024
RAVL: A Region Attention Yolo with Two-Stage Training for Enhanced Object Detection

This article has 3 authors:
1. Weiwen Cai
2. Huiqian Du
3. Min Xie
This article has no evaluationsLatest version Nov 4, 2024
SCFI-ESeg: Enhancing Semantic Segmentation with Spatial and Content Feature Integration

This article has 5 authors:
1. Ning Li
2. Xudong Zhang
3. Bo Li
4. Baohua Yuan
5. Gaochao Yang
This article has no evaluationsLatest version Oct 29, 2024

Listed in

Abstract

Article activity feed

Related articles

Multi-object Detector with Double Task Network for Traffic Scenes

RAVL: A Region Attention Yolo with Two-Stage Training for Enhanced Object Detection

SCFI-ESeg: Enhancing Semantic Segmentation with Spatial and Content Feature Integration