Deeply Supervised Self-Attention Learning Model for Person Re-Identification

Miftah Bedru Jamal
Yaping Zhao

Read the full article

Listed in

This article is not in any list yet, why not save it to one of your lists.

Abstract

Person Re-Identification (Re-ID) involves matching pedestrian images across camera networks with non-overlapping views. This task is challenging due to variations in illumination, viewpoints, background clutter, and occlusions, which often degrade model performance and consistency. Addressing these issues requires features that are robust to such variations. We introduce a deep self-attention module for Re-ID, designed to learn and fuse spatial and cross-channel relationships throughout the feature extraction process. By decomposing attention into channel and spatial dimensions, our approach enhances the robust-ness of the learned features. The module uses global feature aggregation and normalization, resulting in more discriminative and complementary features. Our model is structured as a fine-grained classification problem, optimized using multi-class cross-entropy loss, and enhanced with losses from deeply supervised intermediate layers, partitioned regions, and final classification. Extensive experiments on the DukeMTMC-ReID and Market-1501 datasets show that our model surpasses the baseline, with improvements of 6.6% and 6.2% in Rank-1 accuracy, respectively. Additionally, our model delivers superior and competitive performance compared to current state-of-the-art methods. (Code for this work is available at https://github.com/bmiftah/DSP-Person-ReID.)

Version published to 10.21203/rs.3.rs-6697924/v1 on Research Square
Jun 2, 2025

Fourier Feature Pyramid Network for Improved Small Object Detection in Complex Traffic Scenes

This article has 3 authors:
1. Jianquan Ouyang
2. Jiaxiong Lu
3. Lingtao Zeng
This article has no evaluationsLatest version May 30, 2025
Place Recognition Meet Multiple Modalities: A Comprehensive Review, Current Challenges and Future Development

This article has 4 authors:
1. Zhenyu Li
2. Tianyi Shang
3. Pengjie Xu
4. Zhaojun Deng
This article has no evaluationsLatest version Jun 17, 2025
A Multi-Scale Spatial Attention-Based Zero-Shot Learning Framework for Low-Light Image Enhancement

This article has 3 authors:
1. Muhammad Azeem Aslam
2. Hassan Khalid
3. Nisar Ahmed
This article has no evaluationsLatest version Jun 27, 2025

Listed in

Abstract

Article activity feed

Related articles

Fourier Feature Pyramid Network for Improved Small Object Detection in Complex Traffic Scenes

Place Recognition Meet Multiple Modalities: A Comprehensive Review, Current Challenges and Future Development

A Multi-Scale Spatial Attention-Based Zero-Shot Learning Framework for Low-Light Image Enhancement