Multi-Scale Feature Fusion for Cross-Modality Person Re-Identification: The MSJLNet Approach

Zhixin Tie
Haobiao Fan
Lingbing Tao
Yanbing Chen
Hao Sheng
Wei Ke

Read the full article

Discuss this preprint

Start a discussion What are Sciety discussions?

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Visible-Infrared person re-identification (VI-ReID) faces significant challenges due to discrepancies between visible and infrared images. Traditional two-stream networks often struggle to preserve semantic guidance from data augmentation as network depth increases. To address this, we propose the Multi-Scale Joint Learning Network (MSJLNet), which employs a novel four-stream architecture to segregate data-augmented branches from original branches, focusing on extracting robust and color-agnostic modal features. An Information Purification Module (IPM) with a channel attention mechanism is designed to dynamically filter noise and suppress redundant color information in the augmented branches. Furthermore, a Joint Semantic Learning Module (JSLM) effectively fuses global detail features with color-agnostic features, improving the model’s discriminative ability. Extensive experiments on the SYSU-MM01 and RegDB datasets demonstrate MSJLNet’s superior performance, achieving 79.94$%$ Rank-1 accuracy and 74.96$%$ mAP on SYSU-MM01, and 93.14$%$ Rank-1 accuracy and 87.22$%$ mAP on RegDB. The proposed approach offers new insights for enhancing cross-modality feature learning. Code is available at https://github.com/1849714926/MSJLNet.

Version published to 10.21203/rs.3.rs-7187927/v1 on Research Square
Sep 2, 2025

Efficient Person Re-Identification via Progressive Filter Pruning and Body Part-Aware Feature Learning

This article has 4 authors:
1. Anusha Jayasimhan
2. Vijaya Lakshmi A
3. Pranaya Padmanabhuni
4. Priyaadharshini Ramesh
This article has no evaluationsLatest version Oct 7, 2025
Mixture-of-Experts Vision Transformer for Occluded Person Re-Identification

This article has 4 authors:
1. Hongtao Zuo
2. Guoqing Zhang
3. Yuhui Zheng
4. Jianwei Zhang
This article has no evaluationsLatest version Oct 13, 2025
DASYOLO: Dual-Attention-Synergistic YOLO for Cross-Modality Object Detection

This article has 5 authors:
1. Yunjia Yang
2. Xiaoxia Wang
3. Fengbao Yang
4. Weiwei Du
5. Bo Li
This article has no evaluationsLatest version Oct 10, 2025

Discuss this preprint

Listed in

Abstract

Article activity feed

Related articles

Efficient Person Re-Identification via Progressive Filter Pruning and Body Part-Aware Feature Learning

Mixture-of-Experts Vision Transformer for Occluded Person Re-Identification

DASYOLO: Dual-Attention-Synergistic YOLO for Cross-Modality Object Detection